2 results found Sort:

📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
Created 2023-08-27
428 commits to main branch, last one 13 days ago
📖A curated list of Awesome Diffusion Inference Papers with codes, such as Sampling, Caching, Multi-GPUs, etc. 🎉🎉
Created 2024-01-14
47 commits to main branch, last one 9 days ago