1 result found Sort:

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Created 2023-12-15
1,579 commits to main branch, last one 12 days ago