Trending repositories for topic llm
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI ...
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you q...
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A high-throughput and memory-efficient inference and serving engine for LLMs
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ...
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...
An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
Champion at BUET CSE FEST 2024 Hackathon - EasyTrip is an AI-powered platform that simplifies travel planning by generating smart itineraries, integrating interactive maps and weather updates, and aut...
Nosia is a platform that allows you to run an AI model on your own data. It is designed to be easy to install and use.
An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.
RAGGENIE: An open-source, low-code platform to build custom Retrieval-Augmented Generation (RAG) Copilets with your own data. Simplify AI development with ease!
Python and TypeScript library for integrating the Stripe API into agentic workflows
Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案
Dynamiq is an orchestration framework for agentic AI and LLM applications
High-scale LLM gateway, written in Rust. OpenTelemetry-based observability included
Codai is an AI code assistant that helps developers through a session-based CLI, providing intelligent code suggestions, refactoring, and code reviews based on the full context of the project. It supp...
Dabarqus is a stand alone application that implements a complete RAG solution.
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI ...
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you q...
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ...
A high-throughput and memory-efficient inference and serving engine for LLMs
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
A cloud-native vector database, storage for next generation AI applications
📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or ...
An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.
An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.
Codai is an AI code assistant that helps developers through a session-based CLI, providing intelligent code suggestions, refactoring, and code reviews based on the full context of the project. It supp...
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
Python and TypeScript library for integrating the Stripe API into agentic workflows
Human-friendly framework to test and evaluate LLMs, RAGs and ChatBots.
Nosia is a platform that allows you to run an AI model on your own data. It is designed to be easy to install and use.
Dynamiq is an orchestration framework for agentic AI and LLM applications
Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案
SwarmZero's SDK for building AI agents, swarms of agents and much more.
RAGGENIE: An open-source, low-code platform to build custom Retrieval-Augmented Generation (RAG) Copilets with your own data. Simplify AI development with ease!
A sleek and user-friendly interface for interacting with Ollama models, built with Python and Gradio.
Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
The first AI agent that builds third-party integrations through reverse engineering platforms' internal APIs.
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.
Instant AI Git Commit message, Git changes summary from the CLI (no API key required)
Flexpilot - Open-Source, Native and a True GitHub Copilot Alternative for VS Code
Python and TypeScript library for integrating the Stripe API into agentic workflows
A command-line personal assistant that integrates with Google Calendar, Gmail, and Tasks to help manage your digital life.
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
🦙 echoOLlama: A real-time voice AI platform powered by local LLMs. Features WebSocket streaming, voice interactions, and OpenAI API compatibility. Built with FastAPI, Redis, and PostgreSQL. Perfect f...
Open source AI analyst powered by E2B. Analyze your CSV files with Llama 3.1 and create interactive charts.
Champion at BUET CSE FEST 2024 Hackathon - EasyTrip is an AI-powered platform that simplifies travel planning by generating smart itineraries, integrating interactive maps and weather updates, and aut...
A sleek and user-friendly interface for interacting with Ollama models, built with Python and Gradio.
Automate browser-based workflows with LLMs and Computer Vision
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you q...
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a ...
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI ...
The first AI agent that builds third-party integrations through reverse engineering platforms' internal APIs.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or ...
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, tr...
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
A high-throughput and memory-efficient inference and serving engine for LLMs
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ...
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
An AI memory layer with short- and long-term storage, semantic clustering, and optional memory decay for context-aware applications.
Flexpilot - Open-Source, Native and a True GitHub Copilot Alternative for VS Code
Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
🦙 echoOLlama: A real-time voice AI platform powered by local LLMs. Features WebSocket streaming, voice interactions, and OpenAI API compatibility. Built with FastAPI, Redis, and PostgreSQL. Perfect f...
SwarmZero's SDK for building AI agents, swarms of agents and much more.
LangCommand is a local inference command-line tool that transforms natural language descriptions into shell commands.
Python and TypeScript library for integrating the Stripe API into agentic workflows
[EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering
Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.
Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges...
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
rewind.ai x cursor.com = your AI assistant that has all the context. 24/7 screen & voice recording for the age of super intelligence. get your data ready or be left behind
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Building a quick conversation-based search demo with Lepton AI.
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
The open source platform for AI-native application development.
SGLang is a fast serving framework for large language models and vision language models.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you q...
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
A high-throughput and memory-efficient inference and serving engine for LLMs
A modular graph-based Retrieval-Augmented Generation (RAG) system
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Ll...
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
Start building LLM-empowered multi-agent applications in an easier way.
Open-source Next.js template for building apps that are fully generated by AI. By E2B.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
基于大模型的智能对话客服工具,支持微信、拼多多、千牛、哔哩哔哩、抖音企业号、抖音、抖店、微博聊天、小红书专业号运营、小红书、知乎等平台接入,可选择 GPT3.5/GPT4.0/ 懒人百宝箱 (后续会支持更多平台),能处理文本、语音和图片,通过插件访问操作系统和互联网等外部资源,支持基于自有知识库定制企业 AI 应用。
SDG is a specialized framework designed to generate high-quality structured tabular data.
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.
A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.
The open source Tines / Splunk SOAR alternative for security engineers.
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
SGLang is a fast serving framework for large language models and vision language models.
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥