2 results found Sort:
A high-throughput and memory-efficient inference and serving engine for LLMs
Created
2023-02-09
3,290 commits to main branch, last one 8 hours ago
Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
Created
2020-12-22
1,074 commits to main branch, last one 6 days ago