120 results found Sort:

864
12.7k
apache-2.0
132
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa...
Created 2021-08-08
645 commits to main branch, last one a day ago
689
10.8k
mit
70
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Created 2021-06-18
45 commits to main branch, last one 10 months ago
499
10.6k
mit
145
AI Code Completions
Created 2018-11-06
349 commits to master branch, last one 4 months ago
This repository contains demos I made with the Transformers library by HuggingFace.
Created 2020-08-31
431 commits to master branch, last one about a month ago
953
8.2k
mit
177
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
This repository has been archived (exclude archived)
Created 2020-07-05
884 commits to master branch, last one 2 years ago
1.7k
7.5k
mit
161
Chinese version of GPT2 training code, using BERT tokenizer.
Created 2019-05-31
69 commits to old_gpt_2_chinese_before_2021_4_22 branch, last one 7 months ago
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Created 2019-05-25
412 commits to main branch, last one 6 days ago
315
4.3k
mit
116
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simp...
Created 2024-04-01
43 commits to main branch, last one about a month ago
417
3.3k
mit
28
An unnecessarily tiny implementation of GPT-2 in NumPy.
Created 2023-01-21
14 commits to main branch, last one about a year ago
526
3.0k
apache-2.0
76
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Created 2019-04-10
1,058 commits to master branch, last one 6 months ago
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)
Created 2019-12-09
25 commits to master branch, last one about a year ago
215
2.7k
apache-2.0
41
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
Created 2020-01-25
1,118 commits to main branch, last one about a month ago
207
2.6k
apache-2.0
33
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJ...
Created 2023-03-19
593 commits to main branch, last one about a month ago
441
2.4k
apache-2.0
65
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Created 2019-01-19
955 commits to v2-main branch, last one 3 years ago
374
2.4k
apache-2.0
79
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Created 2017-07-22
1,719 commits to master branch, last one 4 years ago
340
2.4k
mit
55
Large-scale pretraining for dialogue
Created 2019-08-29
83 commits to master branch, last one 2 years ago
Simple UI for LLM Model Finetuning
Created 2023-03-22
46 commits to master branch, last one 11 months ago
255
1.8k
mit
28
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
Created 2020-06-08
51 commits to master branch, last one 2 years ago
Guide to using pre-trained large language models of source code
Created 2021-11-25
55 commits to main branch, last one about a year ago
🦄 State-of-the-Art Conversational AI with Transfer Learning
Created 2019-05-07
37 commits to master branch, last one 4 years ago
334
1.7k
apache-2.0
38
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Created 2019-11-05
98 commits to master branch, last one 3 years ago
Visual Studio Code client for Tabnine. https://marketplace.visualstudio.com/items?itemName=TabNine.tabnine-vscode
Created 2018-10-01
1,686 commits to master branch, last one 4 months ago
87
1.4k
mit
17
llama and other large language models on iOS and MacOS offline using GGML library.
Created 2023-06-14
308 commits to main branch, last one a day ago
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Created 2019-07-26
1,478 commits to master branch, last one 5 months ago
This Discord chatbot is incredibly versatile. Powered incredibly fast Groq API
This repository has been archived (exclude archived)
Created 2023-04-29
1,297 commits to main branch, last one 6 months ago
A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.
Created 2019-01-13
264 commits to master branch, last one 25 days ago
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Created 2022-09-26
110 commits to main branch, last one 6 months ago
This Word Does Not Exist
Created 2020-03-09
172 commits to master branch, last one 2 years ago
Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation
Created 2019-02-18
27 commits to master branch, last one 5 years ago
This is a repository that aims to provide updates on the status of jailbreaking the OpenAI GPT language model.
Created 2023-03-01
410 commits to main branch, last one 9 months ago