2 results found Sort:
A pytorch quantization backend for optimum
Created
2023-09-19
705 commits to main branch, last one 26 days ago
Accelerated NLP pipelines for fast inference on CPU and GPU. Built with Transformers, Optimum and ONNX Runtime.
Created
2022-03-16
154 commits to master branch, last one 2 years ago