Bruce-Lee-LY / flash_attention_inference

Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.

Date Created 2023-08-16 (about a year ago)
Commits 1 (last one 5 months ago)
Stargazers 35 (0 this week)
Watchers 1 (0 this week)
Forks 3
License bsd-3-clause
Ranking

RepositoryStats indexes 616,225 repositories, of these Bruce-Lee-LY/flash_attention_inference is ranked #561,530 (9th percentile) for total stargazers, and #556,869 for total watchers. Github reports the primary language for this repository as C++, for repositories using this language it is ranked #30,548/32,944.

Bruce-Lee-LY/flash_attention_inference is also tagged with popular topics, for these it's ranked: llm (#2,797/3251),  gpu (#901/944),  cuda (#636/676),  nvidia (#310/325),  inference (#301/319)

Other Information

Bruce-Lee-LY/flash_attention_inference has Github issues enabled, there are 2 open issues and 2 closed issues.

Star History

Github stargazers over time

3535303025252020151510105500Sep '24Sep '24Oct '24Oct '24Nov '24Nov '24Dec '24Dec '2420252025Feb '25Feb '25

Watcher History

Github watchers over time, collection started in '23

2222111111000015 Nov15 NovDec '24Dec '2415 Dec15 DecJan '25Jan '2515 Jan15 JanFeb '25Feb '2515 Feb15 Feb

Recent Commit History

1 commits on the default branch (master) since jan '22

1111110.50.500000015 Sep15 SepOct '24Oct '2415 Oct15 OctNov '24Nov '2415 Nov15 NovDec '24Dec '2415 Dec15 DecJan '25Jan '2515 Jan15 JanFeb '25Feb '2515 Feb15 Feb

Yearly Commits

Commits to the default branch (master) per year

1111110.50.500000020242024

Issue History

Total Issues
Open Issues
Closed Issues
443.53.5332.52.5221.51.5110.50.500Oct '23Oct '23Nov '23Nov '23Dec '23Dec '2320242024Feb '24Feb '24Mar '24Mar '24Apr '24Apr '24May '24May '24Jun '24Jun '24Jul '24Jul '24Aug '24Aug '24Sep '24Sep '24Oct '24Oct '24Nov '24Nov '24Dec '24Dec '2420252025Feb '25Feb '25

Languages

The primary language is C++ but there's also others...

C++C++ShellShellCCPythonPythonCudaCudaCMakeCMake

updated: 2025-02-16 @ 08:29pm, id: 679281575 / R_kgDOKH0Dpw