1 result found Sort:
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including x86 and ARMv9.
Created
2024-04-01
32 commits to main branch, last one 2 months ago