Trending repositories for topic natural-language-processing
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Unsupervised text tokenizer for Neural Network-based text generation.
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers
A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).
A starting take on a fast and fully local NLP file organizer that organizes files based on their content.
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
[NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
A compute framework for building Search, RAG, Recommendations and Analytics over complex structured & unstructured data.
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
A curated list of awesome online courses about Large Langage Models (LLMs)
对豆瓣影评进行文本分类情感分析,利用爬虫豆瓣爬取评论,进行数据清洗,分词,采用BERT、CNN、LSTM等模型进行训练,采用tensorboardX可视化训练过程,自然语言处理项目\A project for text classification, based on torch 1.7.1
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Home of the AI workforce - Multi-agent system, AI agents & tools
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
Realtime Sign Language Detection: Deep learning model for accurate, real-time recognition of sign language gestures using Python and TensorFlow.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
The official GitHub page for the survey paper "A Survey of Large Language Models".
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers
A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents. https://oasis.camel-ai.org
A starting take on a fast and fully local NLP file organizer that organizes files based on their content.
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classification tasks using the Stanford Sentiment Treebank (SST-2) data...
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
An unobtrusive Obsidian plugin that quietly processes equations and patterns in real time
A curated list of awesome online courses about Large Langage Models (LLMs)
A compute framework for building Search, RAG, Recommendations and Analytics over complex structured & unstructured data.
Realtime Sign Language Detection: Deep learning model for accurate, real-time recognition of sign language gestures using Python and TensorFlow.
🔡 List of Tools, Libraries, Models, Datasets and other resources for Turkish NLP.
🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where...
SentimentArcs: a large ensemble of dozens of sentiment analysis models to analyze emotion in text over time
[NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
Learn how to design, develop, deploy and iterate on production-grade ML applications.
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
A compute framework for building Search, RAG, Recommendations and Analytics over complex structured & unstructured data.
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Chatbot for documentation, that allows you to chat with your data. Privately deployable, provides AI knowledge sharing and integrates knowledge into your AI workflow
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents. https://oasis.camel-ai.org
Монгол үгийн алдаа шалгах толь, Mongolian spellchecking dictionary
VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for correctness (using posteriori model)
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)
A compute framework for building Search, RAG, Recommendations and Analytics over complex structured & unstructured data.
A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).
Realtime Sign Language Detection: Deep learning model for accurate, real-time recognition of sign language gestures using Python and TensorFlow.
Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
A starting take on a fast and fully local NLP file organizer that organizes files based on their content.
Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)
A curated list of awesome online courses about Large Langage Models (LLMs)
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents. https://oasis.camel-ai.org
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis
Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey"
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
[EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
A curated list of awesome online courses about Large Langage Models (LLMs)
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
This is a collection of DS, AI, ML, DL, NLP, Computer Vision job interview questions.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
📺 Discover the latest machine learning / AI courses on YouTube.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
The official GitHub page for the survey paper "A Survey of Large Language Models".
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
💫 Industrial-strength Natural Language Processing (NLP) in Python
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing,...
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
CLIP-Finder enables semantic offline searches of images from gallery photos using natural language descriptions or the camera. Built on Apple's MobileCLIP-S0 architecture, it ensures optimal performan...
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
This repository is an AI Bootcamp material that consist of a workflow for LLM
OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes
AI-powered YouTube Notes Generator: Create detailed notes from YouTube videos. Streamlit UI for easy use.
🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where...
A starting take on a fast and fully local NLP file organizer that organizes files based on their content.
[EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation"