16 results found Sort:

553
6.3k
mit
46
Use your locally running AI models to assist you in your web browsing
Created 2023-04-09
708 commits to main branch, last one 4 days ago
115
1.2k
other
21
A generalized information-seeking agent system with Large Language Models (LLMs).
Created 2023-11-13
27 commits to main branch, last one 10 months ago
Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser
Created 2025-01-08
253 commits to main branch, last one 8 days ago
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
Created 2023-06-12
50 commits to main branch, last one about a year ago
Model swapping for llama.cpp (or any local OpenAPI compatible server)
Created 2024-10-04
170 commits to main branch, last one 23 hours ago
30
341
unknown
10
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Created 2024-01-31
12 commits to main branch, last one 9 months ago
Run MemGPT-AutoGEN-Local LLM Together
Created 2023-10-31
5 commits to main branch, last one about a year ago
The .NET library to consume 100+ APIs: OpenAI, Anthropic, Google, DeepSeek, Cohere, Mistral, Azure, xAI, Perplexity, Groq, Ollama, LocalAi, and many more!
Created 2023-10-08
486 commits to master branch, last one 17 hours ago
6
108
apache-2.0
3
A nifty little library for working with Ollama in Elixir.
Created 2024-01-13
67 commits to main branch, last one 3 months ago
10
101
apache-2.0
5
The PyVisionAI Official Repo
Created 2024-11-26
142 commits to main branch, last one 2 months ago
Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs
Created 2024-02-29
1,164 commits to main branch, last one 28 days ago
MVP of an idea using multiple local LLM models to simulate and play D&D
Created 2024-05-16
17 commits to main branch, last one a day ago
OpenLocalUI: Native desktop app for Windows, MacOS and Linux. Easily run Large Language Models locally, no complex setups required. Inspired by OpenWebUI's simplicity for LLM use.
Created 2024-04-17
2 commits to main branch, last one 2 months ago
Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on different ports and loading/unloading them on demand
Created 2024-07-21
126 commits to main branch, last one 2 months ago
Chat with your pdf using your local LLM, OLLAMA client.(incomplete)
Created 2024-10-07
41 commits to main branch, last one 6 months ago
The client for the Symmetry peer-to-peer inference network. Enabling users to connect with each other, share computational resources, and collect valuable machine learning data.
Created 2024-07-10
120 commits to master branch, last one about a month ago