somewheresystems / dataclysm

Pull high-quality, efficient embeddings for PubMed, arXiv and Wikipedia from Huggingface and use for local LLM inference/Retrieval Augmented Generation (RAG)

Date Created 2023-12-28 (11 months ago)
Commits 22 (last one 10 months ago)
Stargazers 40 (0 this week)
Watchers 2 (0 this week)
Forks 2
License apache-2.0
Ranking

RepositoryStats indexes 595,856 repositories, of these somewheresystems/dataclysm is ranked #520,769 (13th percentile) for total stargazers, and #485,301 for total watchers. Github reports the primary language for this repository as Jupyter Notebook, for repositories using this language it is ranked #14,340/17,543.

somewheresystems/dataclysm is also tagged with popular topics, for these it's ranked: llm (#2,383/2913),  data (#895/999),  transformers (#753/849),  rag (#441/532),  huggingface (#302/383)

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

22 commits on the default branch (main) since jan '22

Yearly Commits

Commits to the default branch (main) per year

Issue History

Languages

The primary language is Jupyter Notebook but there's also others...

updated: 2024-12-21 @ 10:27am, id: 736766145 / R_kgDOK-oowQ