2 results found Sort:

29
373
apache-2.0
12
Advanced Quantization Algorithm for LLMs/VLMs.
Created 2024-01-04
404 commits to main branch, last one a day ago
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
Created 2023-04-26
713 commits to main branch, last one 22 days ago