Trending repositories for topic natural-language-processing
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Learn how to design, develop, deploy and iterate on production-grade ML applications.
The official GitHub page for the survey paper "A Survey of Large Language Models".
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Resume Matcher is an open source, free tool to improve your resume. It works by using language models to compare and rank resumes with job descriptions.
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey"
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
TopicGPT allows to integrate the benefits of LLMs into Topic Modelling
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
This repository introduces MentaLLaMA, the first open-source instruction following large language model for interpretable mental health analysis.
[ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models
Powerful web application that combines Streamlit, LangChain, and Pinecone to simplify document analysis. Powered by OpenAI's GPT-3, RAG enables dynamic, interactive document conversations, making it i...
A legal knowledge search and Q&A application based on Vietnam's Legal Code and legal document database ⚖️
🔡 List of Tools, Libraries, Models, Datasets and other resources for Turkish NLP.
对豆瓣影评进行文本分类情感分析,利用爬虫豆瓣爬取评论,进行数据清洗,分词,采用BERT、CNN、LSTM等模型进行训练,采用tensorboardX可视化训练过程,自然语言处理项目\A project for text classification, based on torch 1.7.1
A simple RoadMap to Natural Language Processing(NLP)
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
The official GitHub page for the survey paper "A Survey of Large Language Models".
Learn how to design, develop, deploy and iterate on production-grade ML applications.
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
TopicGPT allows to integrate the benefits of LLMs into Topic Modelling
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
📚 Curated collection of engineering blogs detailing real-world applications of LLMs in solving specific business problems.
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)
Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.
Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
A curated list of awesome online courses about Large Langage Models (LLMs)
AI-Generated Text Detection: A BERT-powered solution for accurately identifying AI-generated text. Seamlessly integrated, highly accurate, and user-friendly.🚀
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
The official GitHub page for the survey paper "A Survey of Large Language Models".
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
:book: A curated list of resources dedicated to Natural Language Processing (NLP)
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Chatbot for documentation, that allows you to chat with your data. Privately deployable, provides AI knowledge sharing and integrates knowledge into your AI workflow
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)
ML Nexus is an open-source collection of machine learning projects, covering topics like neural networks, computer vision, and NLP. Whether you're a beginner or expert, contribute, collaborate, and gr...
📚 Curated collection of engineering blogs detailing real-world applications of LLMs in solving specific business problems.
TopicGPT allows to integrate the benefits of LLMs into Topic Modelling
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Realtime Sign Language Detection: Deep learning model for accurate, real-time recognition of sign language gestures using Python and TensorFlow.
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.
A curated list of awesome online courses about Large Langage Models (LLMs)
This repository is used to collect papers and code in the field of AI.
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Collection of training data management explorations for large language models
A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evolving RAG ecosystem.
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis
[ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning
Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey"
[ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
[EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.
:zap: Cloud-native, AI-powered, document processing pipelines on AWS.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
📺 Discover the latest machine learning / AI courses on YouTube.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
The official GitHub page for the survey paper "A Survey of Large Language Models".
⚡ Automatically decrypt encryptions without knowing the key or cipher, decode encodings, and crack hashes ⚡
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
💫 Industrial-strength Natural Language Processing (NLP) in Python
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"
WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing,...
Collection of training data management explorations for large language models
🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where...
CLIP-Finder enables semantic offline searches of images from gallery photos using natural language descriptions or the camera. Built on Apple's MobileCLIP-S0 architecture, it ensures optimal performan...
library supporting NLP and CV research on scientific papers
[EMNLP2024 Demo] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports audio moment retrieval.
This repository is an AI Bootcamp material that consist of a workflow for LLM
A legal knowledge search and Q&A application based on Vietnam's Legal Code and legal document database ⚖️
OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes