Statistics for topic llama
RepositoryStats tracks 595,858 Github repositories, of these 545 are tagged with the llama topic. The most common primary language for repositories using this topic is Python (283). Other languages include: Jupyter Notebook (48), TypeScript (35), C++ (24), JavaScript (20), Go (19), Rust (16)
Stargazers over time for topic llama
Most starred repositories for topic llama (view more)
Trending repositories for topic llama (view more)
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
A high-throughput and memory-efficient inference and serving engine for LLMs
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O
Zotero chat PDF with GPT, ChatGPT, Claude, Gemini
The first AI Agent Server, Eidolon is a pluggable Agent SDK and enterprise ready, deployment server for Agentic applications
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
A high-throughput and memory-efficient inference and serving engine for LLMs
This repository provides programs to build Retrieval Augmented Generation (RAG) code for Generative AI with LlamaIndex, Deep Lake, and Pinecone leveraging the power of OpenAI and Hugging Face models f...
Zotero chat PDF with GPT, ChatGPT, Claude, Gemini
A highly optimized LLM inference acceleration engine for Llama and its variants.
A highly optimized LLM inference acceleration engine for Llama and its variants.
🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
A high-throughput and memory-efficient inference and serving engine for LLMs
🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
Simple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation
This repository provides programs to build Retrieval Augmented Generation (RAG) code for Generative AI with LlamaIndex, Deep Lake, and Pinecone leveraging the power of OpenAI and Hugging Face models f...
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
SGLang is a fast serving framework for large language models and vision language models.
Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
🧑🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
A high-throughput and memory-efficient inference and serving engine for LLMs
Private & local AI personal knowledge management app for high entropy people.
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
SGLang is a fast serving framework for large language models and vision language models.
Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization