b4rtaz / distributed-llama

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

Date Created 2023-12-04 (about a year ago)
Commits 292 (last one 3 days ago)
Stargazers 1,576 (14 this week)
Watchers 32 (0 this week)
Forks 111
License mit
Ranking

RepositoryStats indexes 595,856 repositories, of these b4rtaz/distributed-llama is ranked #33,280 (94th percentile) for total stargazers, and #68,406 for total watchers. Github reports the primary language for this repository as C++, for repositories using this language it is ranked #1,767/31,836.

b4rtaz/distributed-llama is also tagged with popular topics, for these it's ranked: llm (#343/2913),  neural-network (#160/1108),  llms (#66/537),  llama2 (#37/238),  distributed-computing (#20/195),  llm-inference (#25/171),  llama3 (#29/169)

Other Information

b4rtaz/distributed-llama has Github issues enabled, there are 33 open issues and 32 closed issues.

There have been 29 releases, the latest one was published on 2024-12-09 (12 days ago) with the name 0.11.1.

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

292 commits on the default branch (main) since jan '22

Yearly Commits

Commits to the default branch (main) per year

Issue History

Languages

The primary language is C++ but there's also others...

Opengraph Image
b4rtaz/distributed-llama

updated: 2024-12-21 @ 03:19pm, id: 727470807 / R_kgDOK1xS1w