Search Results - RepositoryStats

LLaMA-Factory hiyouga

5.6k

45.8k

apache-2.0

252

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Created 2023-05-28

2,738 commits to main branch, last one a day ago

Chinese-LLaMA-Alpaca ymcui

1.9k

18.8k

apache-2.0

183

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

llm nlp plm lora llama alpaca llama-2 alpaca-2 quantization large-language-models pre-trained-language-models

Created 2023-03-15

556 commits to main branch, last one 11 months ago

faster-whisper SYSTRAN

1.3k

15.1k

mit

134

Faster Whisper transcription with CTranslate2

openai whisper inference transformer quantization deep-learning speech-to-text speech-recognition

Created 2023-02-11

246 commits to master branch, last one 12 days ago

Qbot UFund-Me

1.6k

10.9k

mit

128

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant

qlib funds bitcoin fintech pytrade trade-bot blockchain strategies trademarks quant-trade quant-trader quantization deep-learning machine-learning quantitative-finance quantitative-trading

Created 2022-11-23

143 commits to main branch, last one 4 months ago

bitsandbytes bitsandbytes-foundation

680

6.9k

mit

51

Accessible large language models via k-bit quantization for PyTorch.

llm qlora pytorch quantization machine-learning

Created 2021-06-04

849 commits to main branch, last one a day ago

pngquant kornelski

490

5.3k

other

129

Lossy PNG compressor — pngquant command based on libimagequant library

c png stdin palette quality smaller pngquant conversion quantization png-compression image-optimization

Created 2009-09-17

1,207 commits to main branch, last one 2 months ago

AutoGPTQ AutoGPTQ

512

4.8k

mit

30

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

nlp llms pytorch inference transformer quantization transformers deep-learning large-language-models

Created 2023-04-13

769 commits to main branch, last one 15 days ago

distiller IntelLabs

804

4.4k

apache-2.0

130

Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller

onnx pruning pytorch early-exit group-lasso distillation quantization truncated-svd regularization jupyter-notebook pruning-structures network-compression deep-neural-networks automl-for-compression

This repository has been archived (exclude archived)

Created 2018-04-24

643 commits to master branch, last one about a year ago

CTranslate2 OpenNMT

342

3.7k

mit

58

Fast inference engine for Transformer models

Created 2019-09-23

2,194 commits to master branch, last one 4 days ago

deepsparse neuralmagic

181

3.1k

other

55

Sparsity-aware deep learning inference runtime for CPUs

nlp cpus onnx pruning inference deepsparse performance quantization llm-inference sparsification computer-vision machinelearning object-detection pretrained-models

Created 2020-12-14

1,052 commits to main branch, last one 8 months ago

Pretrained-Language-Model huawei-noah

633

3.1k

unknown

57

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

quantization model-compression pretrained-models knowledge-distillation large-scale-distributed

Created 2019-12-02

162 commits to master branch, last one about a year ago

nlp-architect IntelLabs

448

2.9k

apache-2.0

162

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

nlp nlu bert dynet pytorch tensorflow deeplearning quantization transformers deep-learning

This repository has been archived (exclude archived)

Created 2018-05-17

957 commits to master branch, last one 2 years ago

optimum huggingface

516

2.8k

apache-2.0

54

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

onnx intel habana tflite pytorch training graphcore inference onnxruntime optimization quantization transformers

Created 2021-07-20

1,190 commits to main branch, last one 25 days ago

pytorch-playground aaron-xichen

614

2.7k

mit

52

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

pytorch quantization pytorch-tutorial pytorch-tutorials

Created 2017-04-28

17 commits to master branch, last one 4 years ago

xTuring stochasticai

206

2.6k

apache-2.0

34

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJ...

llm lora peft gpt-2 gpt-j llama alpaca gen-ai adapter mistral finetuning fine-tuning quantization deep-learning generative-ai language-model mixed-precision

Created 2023-03-19

593 commits to main branch, last one 6 months ago

neural-compressor intel

263

2.4k

apache-2.0

32

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

awq fp4 gptq int4 int8 pruning mxformat sparsity sparsegpt auto-tuning smoothquant quantization low-precision large-language-models knowledge-distillation post-training-quantization quantization-aware-training

Created 2020-07-21

3,739 commits to master branch, last one 12 hours ago

mixtral-offloading dvmazur

233

2.3k

mit

28

Run Mixtral-8x7B models in Colab or consumer desktops

llm pytorch offloading google-colab quantization deep-learning colab-notebook language-model mixture-of-experts

Created 2023-12-15

86 commits to master branch, last one about a year ago

aimet quic

398

2.3k

other

49

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

auto-ml pruning opensource compression open-source quantization deep-learning machine-learning network-compression deep-neural-networks network-quantization

Created 2020-04-21

2,659 commits to develop branch, last one 23 hours ago

micronet 666DZY666

476

2.2k

mit

40

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Ari...

Created 2019-12-04

295 commits to master branch, last one 3 years ago

Awesome-Model-Quantization Efficient-ML

220

2.0k

unknown

65

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...

awesome quantization deep-learning binary-network model-compression model-acceleration model-quantization efficient-deep-learning binarized-neural-networks lightweight-neural-network

Created 2018-10-18

301 commits to master branch, last one 28 days ago

ao pytorch

235

1.9k

bsd-3-clause

43

PyTorch native quantization and sparsity for training and inference

mx brrr cuda llama dtypes float8 pytorch sparsity training inference optimizer offloading transformer quantization

Created 2023-11-03

1,168 commits to main branch, last one 4 days ago

intel-extension-for-pytorch intel

264

1.8k

apache-2.0

35

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

intel pytorch quantization deep-learning neural-network machine-learning

Created 2020-04-15

2,493 commits to main branch, last one 5 days ago

ppq OpenPPL

249

1.7k

apache-2.0

16

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

cuda onnx caffe pytorch open-source quantization deep-learning neural-network

Created 2021-12-30

291 commits to master branch, last one about a year ago

PaddleSlim PaddlePaddle

351

1.6k

apache-2.0

90

PaddleSlim is an open-source library for deep model compression and architecture search.

nas bert ernie yolov5 yolov6 yolov7 pruning sparsity tensorrt detection compression transformer distillation quantization segmentation

Created 2019-12-16

1,246 commits to develop branch, last one 3 months ago

mmrazor open-mmlab

236

1.6k

apache-2.0

21

OpenMMLab Model Compression Toolbox and Benchmark.

nas spos darts pruning pytorch autoslim detection quantization segmentation classification knowledge-distillation

Created 2021-12-22

229 commits to main branch, last one about a year ago

model-optimization tensorflow

325

1.5k

apache-2.0

117

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

ml keras pruning sparsity tensorflow compression optimization quantization deep-learning machine-learning model-compression quantized-networks quantized-training quantized-neural-networks

Created 2018-10-31

837 commits to master branch, last one about a month ago