5 results found Sort:

479
1.7k
gpl-3.0
93
MySQL Proxy using Java NIO based on Sharding SQL,Calcite ,simple and fast
Created 2017-08-04
4,199 commits to main branch, last one about a year ago
E-Mail Header Analyzer
Created 2016-04-25
59 commits to master branch, last one 4 years ago
141
583
gpl-3.0
21
Medical image processing in Python
Created 2012-05-11
413 commits to master branch, last one 6 months ago
Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.
Created 2023-08-16
1 commits to master branch, last one 5 months ago
Decoding Attention is specially optimized for multi head attention (MHA) using CUDA core for the decoding stage of LLM inference.
Created 2024-08-14
1 commits to master branch, last one 3 months ago