1 result found Sort:
A high-throughput and memory-efficient inference and serving engine for LLMs
Created
2023-02-09
1,568 commits to main branch, last one 22 hours ago