5 results found Sort:

480
1.7k
gpl-3.0
91
MySQL Proxy using Java NIO based on Sharding SQL,Calcite ,simple and fast
Created 2017-08-04
4,199 commits to main branch, last one about a year ago
E-Mail Header Analyzer
Created 2016-04-25
59 commits to master branch, last one 4 years ago
141
589
gpl-3.0
20
Medical image processing in Python
Created 2012-05-11
413 commits to master branch, last one 8 months ago
Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.
Created 2024-08-14
2 commits to master branch, last one 16 days ago
Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.
Created 2023-08-16
1 commits to master branch, last one 26 days ago