usyd-fsalab / fp6_llm

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Date Created 2024-03-04 (8 months ago)
Commits 6 (last one 10 days ago)
Stargazers 195 (0 this week)
Watchers 6 (0 this week)
Forks 15
License apache-2.0
Ranking

RepositoryStats indexes 579,555 repositories, of these usyd-fsalab/fp6_llm is ranked #180,991 (69th percentile) for total stargazers, and #296,754 for total watchers. Github reports the primary language for this repository as Cuda, for repositories using this language it is ranked #107/335.

Other Information

usyd-fsalab/fp6_llm has Github issues enabled, there are 2 open issues and 9 closed issues.

Homepage URL: https://www.usenix.org/system/files/atc24-xia.pdf

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

6 commits on the default branch (main) since jan '22

Yearly Commits

Commits to the default branch (main) per year

Issue History

Languages

The primary language is Cuda but there's also others...

updated: 2024-11-07 @ 02:14pm, id: 766704459 / R_kgDOLbL7Sw