1 result found Sort:
A high-throughput and memory-efficient inference and serving engine for LLMs
Created
2023-02-09
1,748 commits to main branch, last one 15 hours ago