123 results found Sort:

911
13.5k
apache-2.0
133
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN a...
Created 2021-08-08
717 commits to main branch, last one 7 days ago
742
11.7k
mit
69
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Created 2021-06-18
46 commits to main branch, last one 3 months ago
505
10.8k
mit
141
AI Code Completions
Created 2018-11-06
349 commits to master branch, last one 9 months ago
This repository contains demos I made with the Transformers library by HuggingFace.
Created 2020-08-31
435 commits to master branch, last one 3 months ago
960
8.3k
mit
177
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
This repository has been archived (exclude archived)
Created 2020-07-05
884 commits to master branch, last one 3 years ago
1.7k
7.6k
mit
162
Chinese version of GPT2 training code, using BERT tokenizer.
Created 2019-05-31
69 commits to old_gpt_2_chinese_before_2021_4_22 branch, last one about a year ago
465
7.5k
mit
100
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultr...
Created 2024-04-01
58 commits to main branch, last one 23 days ago
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Created 2019-05-25
428 commits to main branch, last one 23 days ago
433
3.3k
mit
29
An unnecessarily tiny implementation of GPT-2 in NumPy.
Created 2023-01-21
14 commits to main branch, last one about a year ago
523
3.1k
apache-2.0
74
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Created 2019-04-10
1,058 commits to master branch, last one 11 months ago
GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)
Created 2019-12-09
25 commits to master branch, last one 2 years ago
222
2.8k
apache-2.0
40
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
Created 2020-01-25
1,118 commits to main branch, last one 6 months ago
205
2.6k
apache-2.0
34
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJ...
Created 2023-03-19
593 commits to main branch, last one 6 months ago
435
2.4k
apache-2.0
64
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Created 2019-01-19
955 commits to v2-main branch, last one 3 years ago
372
2.4k
apache-2.0
77
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Created 2017-07-22
1,719 commits to master branch, last one 4 years ago
346
2.4k
mit
55
Large-scale pretraining for dialogue
Created 2019-08-29
83 commits to master branch, last one 2 years ago
Simple UI for LLM Model Finetuning
Created 2023-03-22
46 commits to master branch, last one about a year ago
256
1.8k
mit
28
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
Created 2020-06-08
51 commits to master branch, last one 2 years ago
Guide to using pre-trained large language models of source code
Created 2021-11-25
55 commits to main branch, last one about a year ago
🦄 State-of-the-Art Conversational AI with Transfer Learning
Created 2019-05-07
37 commits to master branch, last one 4 years ago
127
1.7k
mit
24
llama and other large language models on iOS and MacOS offline using GGML library.
Created 2023-06-14
313 commits to main branch, last one about a month ago
333
1.7k
apache-2.0
37
GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型
Created 2019-11-05
98 commits to master branch, last one 4 years ago
Visual Studio Code client for Tabnine. https://marketplace.visualstudio.com/items?itemName=TabNine.tabnine-vscode
Created 2018-10-01
1,687 commits to master branch, last one 2 months ago
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Created 2019-07-26
1,484 commits to master branch, last one 2 months ago
This Discord chatbot is incredibly versatile. Powered incredibly fast Groq API
This repository has been archived (exclude archived)
Created 2023-04-29
1,297 commits to main branch, last one 11 months ago
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Created 2024-11-29
38 commits to main branch, last one about a month ago
A curated list of NLP resources focused on Transformer networks, attention mechanism, GPT, BERT, ChatGPT, LLMs, and transfer learning.
Created 2019-01-13
264 commits to master branch, last one 5 months ago
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
Created 2022-09-26
110 commits to main branch, last one 11 months ago
97
1.1k
apache-2.0
44
This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.
Created 2024-12-20
6 commits to master branch, last one 2 months ago
This Word Does Not Exist
Created 2020-03-09
172 commits to master branch, last one 3 years ago