5 results found Sort:
- Filter by Primary Language:
- C++ (2)
- HTML (1)
- Java (1)
- Jupyter Notebook (1)
- +
MySQL Proxy using Java NIO based on Sharding SQL,Calcite ,simple and fast
Created
2017-08-04
4,199 commits to main branch, last one about a year ago
E-Mail Header Analyzer
Created
2016-04-25
59 commits to master branch, last one 4 years ago
Medical image processing in Python
Created
2012-05-11
413 commits to master branch, last one 6 months ago
Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.
Created
2023-08-16
1 commits to master branch, last one 5 months ago
Decoding Attention is specially optimized for multi head attention (MHA) using CUDA core for the decoding stage of LLM inference.
Created
2024-08-14
1 commits to master branch, last one 3 months ago