1 result found Sort:

2.9k
20.9k
apache-2.0
197
A high-throughput and memory-efficient inference and serving engine for LLMs
Created 2023-02-09
1,568 commits to main branch, last one 22 hours ago