148 results found Sort:

4.5k
36.5k
apache-2.0
218
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Created 2023-05-28
2,482 commits to main branch, last one 2 days ago
1.9k
18.5k
apache-2.0
184
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Created 2023-03-15
556 commits to main branch, last one 7 months ago
1.1k
13.1k
mit
124
Faster Whisper transcription with CTranslate2
Created 2023-02-11
242 commits to master branch, last one 9 days ago
1.2k
8.4k
mit
109
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
Created 2022-11-23
143 commits to main branch, last one about a month ago
487
5.3k
other
131
Lossy PNG compressor — pngquant command based on libimagequant library
Created 2009-09-17
1,206 commits to main branch, last one 5 months ago
491
4.6k
mit
31
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Created 2023-04-13
764 commits to main branch, last one 6 days ago
801
4.4k
apache-2.0
132
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
This repository has been archived (exclude archived)
Created 2018-04-24
643 commits to master branch, last one about a year ago
309
3.5k
mit
60
Fast inference engine for Transformer models
Created 2019-09-23
2,189 commits to master branch, last one 3 days ago
176
3.1k
other
58
Sparsity-aware deep learning inference runtime for CPUs
Created 2020-12-14
1,052 commits to main branch, last one 5 months ago
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
Created 2019-12-02
162 commits to master branch, last one 11 months ago
448
2.9k
apache-2.0
164
A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks
This repository has been archived (exclude archived)
Created 2018-05-17
957 commits to master branch, last one 2 years ago
Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)
Created 2017-04-28
17 commits to master branch, last one 4 years ago
486
2.6k
apache-2.0
57
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
Created 2021-07-20
1,152 commits to main branch, last one a day ago
207
2.6k
apache-2.0
33
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJ...
Created 2023-03-19
593 commits to main branch, last one 2 months ago
Run Mixtral-8x7B models in Colab or consumer desktops
Created 2023-12-15
86 commits to master branch, last one 11 months ago
258
2.3k
apache-2.0
33
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Created 2020-07-21
3,699 commits to master branch, last one 2 days ago
478
2.2k
mit
41
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Ari...
Created 2019-12-04
295 commits to master branch, last one 3 years ago
391
2.2k
other
51
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Created 2020-04-21
2,434 commits to develop branch, last one 17 hours ago
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...
Created 2018-10-18
299 commits to master branch, last one about a month ago
188
1.7k
bsd-3-clause
43
PyTorch native quantization and sparsity for training and inference
Created 2023-11-03
866 commits to main branch, last one 18 hours ago
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
Created 2020-04-15
2,396 commits to main branch, last one 3 days ago
239
1.6k
apache-2.0
17
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
Created 2021-12-30
291 commits to master branch, last one 9 months ago
347
1.6k
apache-2.0
91
PaddleSlim is an open-source library for deep model compression and architecture search.
Created 2019-12-16
1,246 commits to develop branch, last one 17 days ago
231
1.5k
apache-2.0
22
OpenMMLab Model Compression Toolbox and Benchmark.
Created 2021-12-22
229 commits to main branch, last one about a year ago
324
1.5k
apache-2.0
120
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Created 2018-10-31
834 commits to master branch, last one 5 days ago
100
1.4k
mit
23
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Created 2023-03-30
421 commits to master branch, last one 4 months ago
199
1.2k
other
34
Brevitas: neural network quantization in PyTorch
Created 2018-07-10
1,406 commits to master branch, last one 2 months ago
Efficient computing methods developed by Huawei Noah's Ark Lab
Created 2019-09-04
157 commits to master branch, last one about a month ago
60
1.2k
unknown
7
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
Created 2023-09-12
72 commits to main branch, last one 17 days ago