Trending repositories for topic llama

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

ollama/ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

98,623 (+429)

mit

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

30,534 (+232)

apache-2.0

hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

34,623 (+191)

apache-2.0

mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...

26,007 (+167)

mit

ggerganov/llama.cpp

LLM inference in C/C++

68,107 (+149)

mit

meta-llama/llama-recipes

Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a ...

15,261 (+108)

unslothai/unsloth

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

18,318 (+107)

apache-2.0

WangRongsheng/awesome-LLM-resourses

🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.

2,349 (+106)

apache-2.0

fishaudio/fish-speech

Brand new TTS solution

14,578 (+82)

HqWu-HITCS/Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

16,079 (+73)

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

6,142 (+62)

apache-2.0

tensorzero/tensorzero

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

620 (+57)

apache-2.0

haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

20,312 (+53)

apache-2.0

chatchat-space/Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Ll...

32,109 (+49)

apache-2.0

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

5,301 (+42)

apache-2.0

linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

3,472 (+38)

bsd-2-clause

xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...

5,437 (+35)

apache-2.0

TheR1D/shell_gpt

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.

9,706 (+34)

mit

datawhalechina/llms-from-scratch-cn

仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理

1,566 (+32)

edwko/OuteTTS

Interface for OuteTTS models.

406 (+30)

apache-2.0

Last 3 days (relative gain)

electricpipelines/barq

Dabarqus is a stand alone application that implements a complete RAG solution.

46 (+15%)

e2b-dev/ai-analyst

Open source AI analyst powered by E2B. Analyze your CSV files with Llama 3.1 and create interactive charts.

43 (+13%)

apache-2.0

Denis2054/RAG-Driven-Generative-AI

This repository provides programs to build Retrieval Augmented Generation (RAG) code for Generative AI with LlamaIndex, Deep Lake, and Pinecone leveraging the power of OpenAI and Hugging Face models f...

78 (+13%)

mit

vinhnx/VT.ai

VT.ai - Minimal multimodal AI chat app with dynamic conversation routing

45 (+13%)

mit

tensorzero/tensorzero

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

620 (+10%)

apache-2.0

edwko/OuteTTS

Interface for OuteTTS models.

406 (+8%)

apache-2.0

zhanshijinwat/Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

335 (+6%)

cheahjs/free-llm-api-resources

A list of free LLM inference resources accessible via API.

621 (+6%)

ggml-org/llama.vim

Vim plugin for LLM-assisted code/text completion

96 (+5%)

WangRongsheng/awesome-LLM-resourses

🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.

2,349 (+5%)

apache-2.0

AdrianBZG/llama-multimodal-vqa

Multimodal Instruction Tuning for Llama 3

41 (+3%)

mit

MetaGLM/langchain-glm

基于 Langchain，快速集成GLM-4 AllTools 功能的插件

41 (+3%)

mit

premAI-io/premsql

End-to-End Local-First Text-to-SQL Pipelines

166 (+2%)

heshengtao/comfyui_LLM_party

LLM Agent Framework in ComfyUI includes Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai/gemini interfaces, such as o1,ol...

1,038 (+2%)

agpl-3.0

jmont-dev/ollama-hpp

Modern, Header-only C++ bindings for the Ollama API.

48 (+2%)

mit

datawhalechina/llms-from-scratch-cn

仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理

1,566 (+2%)

Simatwa/python-tgpt

AI Chat in Terminal + Package + REST-API

117 (+2%)

mit

ArdaGnsrn/ollama-php

This is a PHP library for Ollama. Ollama is an open-source project that serves as a powerful and user-friendly platform for running LLMs on your local machine. It acts as a bridge between the complexi...

59 (+2%)

mit

openshieldai/openshield

OpenShield is a new generation security layer for AI models

61 (+2%)

apache-2.0

jakobdylanc/llmcord

A Discord LLM chat bot that supports any OpenAI compatible API (OpenAI, xAI, Mistral, Groq, OpenRouter, Ollama, LM Studio and more)

370 (+1%)

mit

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

ollama/ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

98,623 (+990)

mit

mudler/LocalAI

26,007 (+652)

mit

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

30,534 (+425)

apache-2.0

hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

34,623 (+407)

apache-2.0

ggerganov/llama.cpp

LLM inference in C/C++

68,107 (+339)

mit

unslothai/unsloth

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

18,318 (+244)

apache-2.0

fishaudio/fish-speech

Brand new TTS solution

14,578 (+181)

WangRongsheng/awesome-LLM-resourses

🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.

2,349 (+171)

apache-2.0

meta-llama/llama-recipes

15,261 (+169)

tensorzero/tensorzero

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

620 (+144)

apache-2.0

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

6,142 (+121)

apache-2.0

HqWu-HITCS/Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

16,079 (+121)

cheahjs/free-llm-api-resources

A list of free LLM inference resources accessible via API.

621 (+105)

haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

20,312 (+103)

apache-2.0

chatchat-space/Langchain-Chatchat

32,109 (+103)

apache-2.0

zhanshijinwat/Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

335 (+102)

mlc-ai/web-llm-chat

Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.

466 (+93)

apache-2.0

modelscope/ms-swift

Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vis...

4,300 (+92)

apache-2.0

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

5,301 (+83)

apache-2.0

xorbitsai/inference

5,437 (+62)

apache-2.0

Last week (relative gain)

zhanshijinwat/Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

335 (+44%)

tensorzero/tensorzero

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

620 (+30%)

apache-2.0

gpustack/llama-box

LLM inference server implementation based on llama.cpp.

34 (+26%)

mit

mlc-ai/web-llm-chat

Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.

466 (+25%)

apache-2.0

e2b-dev/ai-analyst

Open source AI analyst powered by E2B. Analyze your CSV files with Llama 3.1 and create interactive charts.

43 (+23%)

apache-2.0

cheahjs/free-llm-api-resources

A list of free LLM inference resources accessible via API.

621 (+20%)

Denis2054/RAG-Driven-Generative-AI

78 (+18%)

mit

electricpipelines/barq

Dabarqus is a stand alone application that implements a complete RAG solution.

46 (+15%)

ggml-org/llama.vim

Vim plugin for LLM-assisted code/text completion

96 (+13%)

premAI-io/premsql

End-to-End Local-First Text-to-SQL Pipelines

166 (+13%)

vinhnx/VT.ai

VT.ai - Minimal multimodal AI chat app with dynamic conversation routing

45 (+13%)

mit

guoriyue/LangCommand

LangCommand is a local inference command-line tool that transforms natural language descriptions into shell commands.

111 (+12%)

mit

edwko/OuteTTS

Interface for OuteTTS models.

406 (+12%)

apache-2.0

bolna-ai/bolna

Full stack tools for building voice agents

78 (+10%)

mit

johnmai-dev/NotebookMLX

📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)

182 (+8%)

mit

WangRongsheng/awesome-LLM-resourses

🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.

2,349 (+8%)

apache-2.0

tak-bro/aicommit2

A Reactive CLI that generates git commit messages with Ollama, ChatGPT, Gemini, Claude, Mistral and other AI

154 (+7%)

mit

balisujohn/localwriter

A LibreOffice Writer extension that adds local-inference generative AI features.

35 (+6%)

awaescher/OllamaSharp

The easiest way to use the Ollama API in .NET

576 (+5%)

mit

sammcj/ingest

Parse files (e.g. code repos) and websites to clipboard or a file for ingestions by AI / LLMs

63 (+5%)

mit

Last month (new repositories)

edwko/OuteTTS

Interface for OuteTTS models.

406

apache-2.0

johnmai-dev/NotebookMLX

📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)

182

mit

ggml-org/llama.vim

Vim plugin for LLM-assisted code/text completion

fofsinx/echoOLlama

🦙 echoOLlama: A real-time voice AI platform powered by local LLMs. Features WebSocket streaming, voice interactions, and OpenAI API compatibility. Built with FastAPI, Redis, and PostgreSQL. Perfect f...

e2b-dev/ai-analyst

Open source AI analyst powered by E2B. Analyze your CSV files with Llama 3.1 and create interactive charts.

apache-2.0

Last month (absolute gain)

ollama/ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

98,623 (+4,391)

mit

meta-llama/llama-recipes

15,261 (+3,088)

mudler/LocalAI

26,007 (+1,924)

mit

hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

34,623 (+1,820)

apache-2.0

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

30,534 (+1,652)

apache-2.0

ggerganov/llama.cpp

LLM inference in C/C++

68,107 (+1,454)

mit

unslothai/unsloth

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

18,318 (+1,154)

apache-2.0

fishaudio/fish-speech

Brand new TTS solution

14,578 (+1,096)

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

5,301 (+693)

apache-2.0

HqWu-HITCS/Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

16,079 (+526)

haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

20,312 (+472)

apache-2.0

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

6,142 (+460)

apache-2.0

chatchat-space/Langchain-Chatchat

32,109 (+448)

apache-2.0

tensorzero/tensorzero

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

620 (+410)

apache-2.0

WangRongsheng/awesome-LLM-resourses

🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.

2,349 (+398)

apache-2.0

modelscope/ms-swift

4,300 (+380)

apache-2.0

edwko/OuteTTS

Interface for OuteTTS models.

406 (+346)

apache-2.0

xorbitsai/inference

5,437 (+301)

apache-2.0

cheahjs/free-llm-api-resources

A list of free LLM inference resources accessible via API.

621 (+282)

Hexastack/Hexabot

Hexabot is an open-source AI chatbot / agent builder. It allows you to create and manage multi-channel and multilingual chatbots / agents with ease.

496 (+258)

agpl-3.0

Last month (relative gain)

fofsinx/echoOLlama

59 (+1,375%)

guoriyue/LangCommand

LangCommand is a local inference command-line tool that transforms natural language descriptions into shell commands.

111 (+1,010%)

mit

johnmai-dev/NotebookMLX

📋 NotebookMLX - An Open Source version of NotebookLM (Ported NotebookLlama)

182 (+727%)

mit

edwko/OuteTTS

Interface for OuteTTS models.

406 (+577%)

apache-2.0

tensorzero/tensorzero

TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.

620 (+195%)

apache-2.0

gpustack/llama-box

LLM inference server implementation based on llama.cpp.

34 (+162%)

mit

ggml-org/llama.vim

Vim plugin for LLM-assisted code/text completion

96 (+153%)

zhanshijinwat/Steel-LLM

Train a 1B LLM with 1T tokens from scratch by personal

335 (+109%)

Hexastack/Hexabot

Hexabot is an open-source AI chatbot / agent builder. It allows you to create and manage multi-channel and multilingual chatbots / agents with ease.

496 (+108%)

agpl-3.0

QuantiusBenignus/BlahST

Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp offline. Speak with local LLMs.

41 (+105%)

bsd-3-clause

cheahjs/free-llm-api-resources

A list of free LLM inference resources accessible via API.

621 (+83%)

Denis2054/RAG-Driven-Generative-AI

78 (+77%)

mit

mlc-ai/web-llm-chat

Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.

466 (+59%)

apache-2.0

bolna-ai/bolna

Full stack tools for building voice agents

78 (+47%)

mit

johnbean393/Sidekick

A native macOS app that allows users to chat with a local LLM with context of your files, folders and websites on your Mac without installing any other software.

33 (+43%)

mit

xiaoachen98/Open-LLaVA-NeXT

An open-source implementation for training LLaVA-NeXT.

395 (+43%)

ArdaGnsrn/ollama-php

59 (+40%)

mit

premAI-io/premsql

End-to-End Local-First Text-to-SQL Pipelines

166 (+38%)

sammcj/ingest

Parse files (e.g. code repos) and websites to clipboard or a file for ingestions by AI / LLMs

63 (+37%)

mit

balisujohn/localwriter

A LibreOffice Writer extension that adds local-inference generative AI features.

35 (+35%)

Last 12-months (new repositories)

unslothai/unsloth

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

18,318

apache-2.0

SJTU-IPADS/PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

7,967

mit

reorproject/reor

Private & local AI personal knowledge management app for high entropy people.

7,172

agpl-3.0

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

6,142

apache-2.0

CrazyBoyM/llama3-Chinese-chat

Llama3、Llama3.1 中文仓库（随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）

4,041

AugustDev/enchanted

Enchanted is iOS and macOS app for chatting with private self hosted language models such as Llama2, Mistral or Vicuna using Ollama.

3,712

apache-2.0

linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

3,472

bsd-2-clause

mishushakov/llm-scraper

Turn any webpage into structured data using LLMs

2,394

mit

WangRongsheng/awesome-LLM-resourses

🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.

2,349

apache-2.0

SilasMarvin/lsp-ai

LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.

2,208

mit

datawhalechina/llms-from-scratch-cn

仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理

1,566

SqueezeAILab/LLMCompiler

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

1,530

mit

FoundationVision/LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

1,324

mit

time-series-foundation-models/lag-llama

Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

1,258

apache-2.0

fatwang2/search2ai

Help your LLMs online

1,068

mit

heshengtao/comfyui_LLM_party

1,038

agpl-3.0

lenML/Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

857

agpl-3.0

undreamai/LLMUnity

Create characters in Unity with LLMs!

698

mit

TinyLLaVA/TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

657

apache-2.0

johnmai-dev/ChatMLX

🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.

633

apache-2.0

Last 12-months (absolute gain)

ollama/ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

98,623 (+80,404)

mit

hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

34,623 (+27,834)

apache-2.0

ggerganov/llama.cpp

LLM inference in C/C++

68,107 (+23,657)

mit

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

30,534 (+20,565)

apache-2.0

unslothai/unsloth

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

18,318 (+18,312)

apache-2.0

fishaudio/fish-speech

Brand new TTS solution

14,578 (+14,566)

chatchat-space/Langchain-Chatchat

32,109 (+13,519)

apache-2.0

mudler/LocalAI

26,007 (+12,556)

mit

HqWu-HITCS/Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

16,079 (+11,887)

meta-llama/llama-recipes

15,261 (+10,065)

haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

20,312 (+9,786)

apache-2.0

SJTU-IPADS/PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

7,967 (+7,965)

mit

reorproject/reor

Private & local AI personal knowledge management app for high entropy people.

7,172 (+7,171)

agpl-3.0

LlamaFamily/Llama-Chinese

Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用

14,026 (+7,043)

dataelement/bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SF...

8,908 (+6,355)

apache-2.0

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

6,142 (+6,119)

apache-2.0

arcee-ai/mergekit

Tools for merging pretrained large language models.

4,828 (+4,584)

lgpl-3.0

xorbitsai/inference

5,437 (+4,233)

apache-2.0

CrazyBoyM/llama3-Chinese-chat

Llama3、Llama3.1 中文仓库（随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）

4,041 (+4,040)

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

5,301 (+3,998)

apache-2.0

Last 12-months (relative gain)

unslothai/unsloth

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

18,318 (+305,200%)

apache-2.0

fishaudio/fish-speech

Brand new TTS solution

14,578 (+121,383%)

ymcui/Chinese-LLaMA-Alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

1,701 (+28,250%)

apache-2.0

sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

6,142 (+26,604%)

apache-2.0

FoundationVision/Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

563 (+13,975%)

apache-2.0

SqueezeAILab/LLMCompiler

[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

1,530 (+13,809%)

mit

lenML/Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

857 (+12,143%)

agpl-3.0

JHubi1/ollama-app

A modern and easy-to-use client for Ollama

592 (+11,740%)

apache-2.0

UbiquitousLearning/mllm

Fast Multimodal LLM on Mobile Devices

534 (+10,580%)

mit

pytorch/ao

PyTorch native quantization and sparsity for training and inference

1,587 (+8,717%)

bsd-3-clause

tenstorrent/tt-metal

:metal: TT-NN operator library, and TT-Metalium low level kernel programming model.

476 (+7,833%)

apache-2.0

cztomsik/ava

All-in-one desktop app for running LLMs locally.

421 (+6,917%)

Mobile-Artificial-Intelligence/maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

1,487 (+6,659%)

mit

apocas/restai

RESTai is an AIaaS (AI as a Service) open-source platform. Built on top of LlamaIndex & Langchain. Supports any public LLM supported by LlamaIndex and any local LLM suported by Ollama/vLLM/etc. Precis...

386 (+6,333%)

apache-2.0

azkadev/llama

LLaMA (Language Learning for Machine Translation) adalah proyek riset yang diprakarsai oleh Facebook AI Research (FAIR) yang bertujuan untuk meningkatkan kualitas terjemahan mesin menggunakan pendekat...

358 (+5,867%)

mishushakov/llm-scraper

Turn any webpage into structured data using LLMs

2,394 (+5,341%)

mit

SilasMarvin/lsp-ai

LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.

2,208 (+5,157%)

mit

shikiw/OPERA

[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation

288 (+4,700%)

mit

Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

189 (+4,625%)

apache-2.0

reid41/QA-Pilot

QA-Pilot is an interactive chat project that leverages online/local LLM for rapid understanding and navigation of GitHub code repository.

184 (+4,500%)

apache-2.0