177 results found Sort:
- Filter by Primary Language:
- Python (84)
- Jupyter Notebook (47)
- JavaScript (10)
- C++ (3)
- HTML (3)
- Java (3)
- C# (2)
- Go (2)
- Kotlin (1)
- Cython (1)
- Rust (1)
- TypeScript (1)
- Vue (1)
- +
An open source library for deep learning end-to-end dialog systems and chatbots.
Created
2017-11-17
2,711 commits to master branch, last one 7 days ago
An Open-Source Framework for Prompt-Learning.
Created
2021-09-30
264 commits to main branch, last one about a year ago
Data processing with ML, LLM and Vision LLM
Created
2022-01-08
505 commits to main branch, last one a day ago
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Created
2022-12-31
254 commits to main branch, last one 2 days ago
ChatGPT带火了聊天机器人,主流的趋势都调整到了GPT类模式,本项目也与时俱进,会在近期更新GPT类版本。基于本项目和自己的语料可以训练出自己想要的聊天机器人,用于智能客服、在线问答、闲聊等场景。
Created
2018-01-08
102 commits to master branch, last one 8 months ago
精选机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理。算法大牛笔记汇总
Created
2018-12-04
14,369 commits to master branch, last one 7 months ago
Datasets, tools, and benchmarks for representation learning of code.
This repository has been archived
(exclude archived)
Created
2019-02-28
286 commits to master branch, last one 2 years ago
Text Classification Algorithms: A Survey
deep-learning
random-forest
decision-trees
text-processing
rocchio-algorithm
boosting-algorithms
deep-belief-network
deep-neural-network
logistic-regression
text-classification
k-nearest-neighbours
nlp-machine-learning
naive-bayes-classifier
document-classification
support-vector-machines
dimensionality-reduction
conditional-random-fields
recurrent-neural-networks
convolutional-neural-networks
hierarchical-attention-networks
Created
2018-07-06
230 commits to master branch, last one about a month ago
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Created
2014-06-26
475 commits to master branch, last one about a year ago
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
Created
2020-04-04
569 commits to master branch, last one 10 months ago
自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Created
2020-09-05
197 commits to master branch, last one 11 months ago
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
Created
2020-11-27
105 commits to main branch, last one 3 months ago
End-to-end neural table-text understanding models.
Created
2020-03-31
65 commits to master branch, last one 2 years ago
A deep dive into embeddings starting from fundamentals
Created
2023-05-23
281 commits to main branch, last one 15 days ago
Rasa UI is a frontend for the Rasa Framework
Created
2017-05-09
267 commits to master branch, last one 4 years ago
Python AI assistant 🧠
Created
2019-05-16
651 commits to develop branch, last one 15 days ago
skweak: A software toolkit for weak supervision applied to NLP tasks
Created
2021-03-16
180 commits to main branch, last one 3 months ago
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
Created
2020-06-17
435 commits to main branch, last one 14 days ago
We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 Turbo (April 2024) and GPT-4o.
Created
2024-05-13
52 commits to main branch, last one 4 months ago
This repo contains my coursework, assignments, and Slides for Natural Language Processing Specialization by deeplearning.ai on Coursera
Created
2020-07-06
20 commits to master branch, last one 4 years ago
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
Created
2023-07-24
540 commits to main branch, last one 2 months ago
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Created
2018-11-15
534 commits to main branch, last one about a month ago
BabyAI platform. A testbed for training agents to understand and execute language commands.
Created
2018-10-02
1,319 commits to master branch, last one about a year ago
Grounded search engine (i.e. with source reference) based on LLM / ChatGPT / OpenAI API. It supports web search, file content search etc.
Created
2023-02-11
212 commits to master branch, last one 3 months ago
Converse with book - Built with GPT-3
Created
2023-01-03
66 commits to main branch, last one about a year ago
Resources for learning about Text Mining and Natural Language Processing
Created
2016-11-09
245 commits to master branch, last one about a year ago
The Schema-Guided Dialogue Dataset
Created
2019-06-13
51 commits to master branch, last one about a year ago
The hands-on NLTK tutorial for NLP in Python
Created
2018-03-25
42 commits to main branch, last one about a year ago
Repository with all what is necessary for sentiment analysis and related areas
Created
2017-03-06
46 commits to master branch, last one about a year ago
Compendium of the resources available from top NLP conferences.
Created
2019-08-31
16 commits to master branch, last one 9 months ago