Trending repositories for topic llm
Video Processing Service is an automated video processing service that supports extracting audio from videos, generating subtitles, and embedding subtitles into the video.
Talk with your AWS using Claude. Model Context Protocol (MCP) server for AWS. Better Amazon Q alternative.
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you q...
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatib...
Bringing BERT into modernity via both architecture changes and scaling
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, a...
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
A high-throughput and memory-efficient inference and serving engine for LLMs
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
A modular graph-based Retrieval-Augmented Generation (RAG) system
React UI + elegant infrastructure for AI Copilots, in-app AI agents, AI chatbots, and AI-powered Textareas 🪁
AI as Workspace - 新一代 AI (LLM) 客户端,全功能、轻量级、可拓展。支持多服务商、文档解析、视频解析、多工作区、插件系统、跨平台、本地优先+实时云同步、动态提示词、自部署
An agent benchmark with tasks in a simulated software company.
Bringing BERT into modernity via both architecture changes and scaling
Video Processing Service is an automated video processing service that supports extracting audio from videos, generating subtitles, and embedding subtitles into the video.
Talk with your AWS using Claude. Model Context Protocol (MCP) server for AWS. Better Amazon Q alternative.
Build realtime voice agents with Google's new Gemini 2.0 (API is free for now)
A Model Context Protocol (MCP) server that helps LLMs manage reasoning, task management, and organization
Talk with your notes in Claude. RAG over your Apple Notes using Model Context Protocol.
A structured approach to building and guiding customer-facing AI agents
Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)
multispy is a lsp client library in Python intended to be used to build applications around language servers.
Instant AI Git Commit message, Git changes summary from the CLI (no API key required)
Video Processing Service is an automated video processing service that supports extracting audio from videos, generating subtitles, and embedding subtitles into the video.
Build realtime voice agents with Google's new Gemini 2.0 (API is free for now)
Talk with your notes in Claude. RAG over your Apple Notes using Model Context Protocol.
Talk with your AWS using Claude. Model Context Protocol (MCP) server for AWS. Better Amazon Q alternative.
A Model Context Protocol (MCP) server that helps LLMs manage reasoning, task management, and organization
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatib...
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you q...
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, a...
A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A high-throughput and memory-efficient inference and serving engine for LLMs
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
React UI + elegant infrastructure for AI Copilots, in-app AI agents, AI chatbots, and AI-powered Textareas 🪁
A modular graph-based Retrieval-Augmented Generation (RAG) system
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
AI as Workspace - 新一代 AI (LLM) 客户端,全功能、轻量级、可拓展。支持多服务商、文档解析、视频解析、多工作区、插件系统、跨平台、本地优先+实时云同步、动态提示词、自部署
Talk with your notes in Claude. RAG over your Apple Notes using Model Context Protocol.
An agent benchmark with tasks in a simulated software company.
Build realtime voice agents with Google's new Gemini 2.0 (API is free for now)
Bringing BERT into modernity via both architecture changes and scaling
A Model Context Protocol (MCP) server that helps LLMs manage reasoning, task management, and organization
Video Processing Service is an automated video processing service that supports extracting audio from videos, generating subtitles, and embedding subtitles into the video.
Talk with your AWS using Claude. Model Context Protocol (MCP) server for AWS. Better Amazon Q alternative.
A structured approach to building and guiding customer-facing AI agents
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatib...
Kheish: A no-code multi-agent LLM platform enabling easy agent creation, flexible workflows, external modules, and RAG-based large codebase analysis
This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.
A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.
A highly optimized LLM inference acceleration engine for Llama and its variants.
A mini-framework for evaluating LLM performance on the Bulls and Cows number guessing game, supporting multiple LLM providers.
A lightweight, functional, and composable framework for building AI agents. No PhD required.
Video Processing Service is an automated video processing service that supports extracting audio from videos, generating subtitles, and embedding subtitles into the video.
Official implementation of X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models
Build realtime voice agents with Google's new Gemini 2.0 (API is free for now)
A smarter web fuzzing tool that combines local LLM models and ffuf to optimize directory and file discovery
Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding
A C++ implementation of Open Interpreter, based on llama.cpp. / Open Interpreter 的 C++ 实现,基于 llama.cpp
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you q...
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, a...
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web without worrying about infrastructure.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
build ai agents that have the full context, open source, runs locally, developer friendly. 24/7 screen, mic, keyboard recording and control
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatib...
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Open-source observability for your LLM application, based on OpenTelemetry
A high-throughput and memory-efficient inference and serving engine for LLMs
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI ...
Unified framework for building enterprise RAG pipelines with small, specialized models
A modular graph-based Retrieval-Augmented Generation (RAG) system
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web without worrying about infrastructure.
This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.
CEO (ceo-py) is an intuitive and modular AI agent framework for task automation.
AI as Workspace - 新一代 AI (LLM) 客户端,全功能、轻量级、可拓展。支持多服务商、文档解析、视频解析、多工作区、插件系统、跨平台、本地优先+实时云同步、动态提示词、自部署
🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite
Awesome-RAG: Collect typical RAG papers and systems.
AgentKit is a framework for creating and orchestrating AI Agents, from single model inference calls to multi-agent systems which use tools.
MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler and analysis tools for LLMs.
Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".
Talk with your notes in Claude. RAG over your Apple Notes using Model Context Protocol.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
Build realtime voice agents with Google's new Gemini 2.0 (API is free for now)
A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024...
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
build ai agents that have the full context, open source, runs locally, developer friendly. 24/7 screen, mic, keyboard recording and control
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and context...
Building a quick conversation-based search demo with Lepton AI.
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
SGLang is a fast serving framework for large language models and vision language models.
The open source platform for AI-native application development.
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress o...
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you q...
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
A modular graph-based Retrieval-Augmented Generation (RAG) system
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
A high-throughput and memory-efficient inference and serving engine for LLMs
Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory
⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI ...
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Start building LLM-empowered multi-agent applications in an easier way.
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Open-source Next.js template for building apps that are fully generated by AI. By E2B.
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web without worrying about infrastructure.
基于大模型的智能对话客服工具,支持微信、拼多多、千牛、哔哩哔哩、抖音企业号、抖音、抖店、微博聊天、小红书专业号运营、小红书、知乎等平台接入,可选择 GPT3.5/GPT4.0/ 懒人百宝箱 (后续会支持更多平台),能处理文本、语音和图片,通过插件访问操作系统和互联网等外部资源,支持基于自有知识库定制企业 AI 应用。
A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
The open source Tines / Splunk SOAR alternative for security engineers.
SDG is a specialized framework designed to generate high-quality structured tabular data.
TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compatib...
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
SGLang is a fast serving framework for large language models and vision language models.