2 results found Sort:

🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
Created 2023-04-26
713 commits to main branch, last one about a month ago
13
80
apache-2.0
5
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
Created 2023-11-20
357 commits to main branch, last one 6 days ago