usyd-fsalab / fp6_llm

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Date Created 2024-03-04 (9 months ago)
Commits 6 (last one about a month ago)
Stargazers 220 (0 this week)
Watchers 7 (0 this week)
Forks 16
License apache-2.0
Ranking

RepositoryStats indexes 595,856 repositories, of these usyd-fsalab/fp6_llm is ranked #168,979 (72nd percentile) for total stargazers, and #271,583 for total watchers. Github reports the primary language for this repository as Cuda, for repositories using this language it is ranked #101/355.

Other Information

usyd-fsalab/fp6_llm has Github issues enabled, there are 3 open issues and 9 closed issues.

Homepage URL: https://www.usenix.org/system/files/atc24-xia.pdf

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

6 commits on the default branch (main) since jan '22

Yearly Commits

Commits to the default branch (main) per year

Issue History

Languages

The primary language is Cuda but there's also others...

updated: 2024-12-18 @ 10:09pm, id: 766704459 / R_kgDOLbL7Sw