2 results found Sort:

665
7.2k
apache-2.0
63
SGLang is a fast serving framework for large language models and vision language models.
Created 2024-01-08
1,662 commits to main branch, last one about an hour ago
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
Created 2023-08-27
437 commits to main branch, last one 2 hours ago