11 results found Sort:
- Filter by Primary Language:
- Python (5)
- Jupyter Notebook (2)
- C++ (1)
- +
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...
Created
2018-10-18
299 commits to master branch, last one 19 days ago
A curated list for Efficient Large Language Models
Created
2023-05-22
508 commits to main branch, last one 3 days ago
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
Created
2023-12-26
116 commits to main branch, last one 8 months ago
模型压缩的小白入门教程
Created
2023-12-28
136 commits to main branch, last one a day ago
This repository contains notebooks that show the usage of TensorFlow Lite for quantizing deep neural networks.
Created
2020-04-29
143 commits to master branch, last one about a year ago
A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including language and vision, we are continuously improving the project. Welcom...
Created
2023-05-22
49 commits to main branch, last one 19 days ago
[WINNER! 🏆] Psychopathology FER Assistant. Because mental health matters. My project submission for #TFWorld TF 2.0 Challenge at Devpost.
Created
2019-11-27
43 commits to master branch, last one about a year ago
[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binarization.
Created
2023-05-01
10 commits to main branch, last one 8 months ago
[NeurIPS 2023 Spotlight] This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution.
Created
2023-10-19
3 commits to main branch, last one 6 months ago
Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.
Created
2023-06-02
78 commits to main branch, last one 12 months ago
The official implementation of the ICML 2023 paper OFQ-ViT
Created
2023-05-09
22 commits to main branch, last one about a year ago