2 results found Sort:
A high-throughput and memory-efficient inference and serving engine for LLMs
Created
2023-02-09
1,748 commits to main branch, last one 15 hours ago
Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
Created
2020-12-22
888 commits to main branch, last one 14 days ago