2 results found Sort:
SGLang is a fast serving framework for large language models and vision language models.
Created
2024-01-08
1,662 commits to main branch, last one about an hour ago
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
Created
2023-08-27
437 commits to main branch, last one 2 hours ago