11 results found Sort:

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...
Created 2018-10-18
299 commits to master branch, last one 19 days ago
A curated list for Efficient Large Language Models
Created 2023-05-22
508 commits to main branch, last one 3 days ago
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
Created 2023-12-26
116 commits to main branch, last one 8 months ago
模型压缩的小白入门教程
Created 2023-12-28
136 commits to main branch, last one a day ago
This repository contains notebooks that show the usage of TensorFlow Lite for quantizing deep neural networks.
Created 2020-04-29
143 commits to master branch, last one about a year ago
A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including language and vision, we are continuously improving the project. Welcom...
Created 2023-05-22
49 commits to main branch, last one 19 days ago
[WINNER! 🏆] Psychopathology FER Assistant. Because mental health matters. My project submission for #TFWorld TF 2.0 Challenge at Devpost.
Created 2019-11-27
43 commits to master branch, last one about a year ago
4
54
unknown
2
[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binarization.
Created 2023-05-01
10 commits to main branch, last one 8 months ago
2
39
apache-2.0
3
[NeurIPS 2023 Spotlight] This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low-bit Quantization for Efficient Image Super-Resolution.
Created 2023-10-19
3 commits to main branch, last one 6 months ago
Chat to LLaMa 2 that also provides responses with reference documents over vector database. Locally available model using GPTQ 4bit quantization.
Created 2023-06-02
78 commits to main branch, last one 12 months ago
0
27
mit
2
The official implementation of the ICML 2023 paper OFQ-ViT
Created 2023-05-09
22 commits to main branch, last one about a year ago