40 results found Sort:
- Filter by Primary Language:
- C++ (18)
- Python (10)
- JavaScript (3)
- TypeScript (2)
- HTML (1)
- Kotlin (1)
- Go (1)
- Rust (1)
- Shell (1)
- Swift (1)
- Dart (1)
- +
LLM inference in C/C++
Created
2023-03-10
4,374 commits to master branch, last one 10 hours ago
[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models
This repository has been archived
(exclude archived)
Created
2023-03-13
1,093 commits to main branch, last one 6 months ago
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
Created
2023-06-14
1,029 commits to main branch, last one a day ago
Stable Diffusion and Flux in pure C/C++
Created
2023-08-13
173 commits to master branch, last one 21 days ago
llama and other large language models on iOS and MacOS offline using GGML library.
Created
2023-06-14
309 commits to main branch, last one 4 days ago
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Created
2023-03-30
421 commits to master branch, last one 4 months ago
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
Created
2023-09-12
72 commits to main branch, last one 17 days ago
Suno AI's Bark model in C/C++ for fast text-to-speech generation
Created
2023-07-01
109 commits to main branch, last one about a month ago
Run inference on MPT-30B using CPU
Created
2023-06-26
22 commits to main branch, last one about a year ago
Whisper Dart is a cross platform library for dart and flutter that allows converting audio to text / speech to text / inference from Open AI models
Created
2022-10-05
11 commits to main branch, last one 8 days ago
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
Created
2023-07-15
15 commits to master branch, last one about a year ago
Self-evaluating interview for AI coders
Created
2023-05-27
746 commits to main branch, last one 3 days ago
CLIP inference in plain C/C++ with no extra dependencies
Created
2023-04-28
87 commits to main branch, last one 4 months ago
Large Language Models for All, 🦙 Cult and More, Stay in touch !
Created
2023-03-30
27 commits to main branch, last one about a year ago
WIP Library Text To Speech From Suno AI's Bark in C/C++ for fast inference
Created
2023-07-16
23 commits to main branch, last one 8 months ago
GENERAL Ai Library For DART & Flutter
Created
2024-02-24
19 commits to main branch, last one 8 months ago
Inference Vision Transformer (ViT) in plain C/C++ with ggml
Created
2023-11-02
110 commits to main branch, last one 10 months ago
Chat with your data privately using MPT-30b
Created
2023-06-28
3 commits to main branch, last one about a year ago
A ggml (C++) re-implementation of tortoise-tts
Created
2023-11-25
539 commits to master branch, last one 4 months ago
llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2
Created
2023-04-01
731 commits to main branch, last one 4 days ago
Run inference on replit-3B code instruct model using CPU
Created
2023-06-27
22 commits to main branch, last one about a year ago
Booster - open accelerator for LLM models. Better inference and debugging for AI hackers
Created
2023-05-04
491 commits to main branch, last one 4 months ago
🪶 Lightweight OpenAI drop-in replacement for Kubernetes
This repository has been archived
(exclude archived)
Created
2023-05-23
154 commits to main branch, last one 10 months ago
workbench for learing&practising AI tech in real scenario on Android device, powered by GGML(Georgi Gerganov Machine Learning) and NCNN(Tencent NCNN) and FFmpeg
Created
2021-05-27
2,986 commits to master branch, last one 6 months ago
Obsidian Local LLM is a plugin for Obsidian that provides access to a powerful neural network, allowing users to generate text in a wide range of styles and formats using a local LLM.
Created
2023-04-30
14 commits to main branch, last one about a year ago
LangCommand is a local inference command-line tool that transforms natural language descriptions into shell commands.
Created
2024-10-03
3,950 commits to main branch, last one about a month ago
Running any GGUF SLMs/LLMs locally, on-device in Android
Created
2024-11-10
33 commits to main branch, last one 4 days ago
C++17 port of Demucs v3 (hybrid) and v4 (hybrid transformer) models with ggml and Eigen3
Created
2023-11-26
32 commits to main branch, last one 20 days ago
Your customized AI assistant - Personal assistants on any hardware! With llama.cpp, whisper.cpp, ggml, LLaMA-v2.
Created
2023-07-21
92 commits to master branch, last one about a year ago