2 results found Sort:

A pytorch quantization backend for optimum
Created 2023-09-19
503 commits to main branch, last one 5 hours ago
Accelerated NLP pipelines for fast inference on CPU and GPU. Built with Transformers, Optimum and ONNX Runtime.
Created 2022-03-16
154 commits to master branch, last one 2 years ago