Trending repositories for topic large-language-models
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss...
AI Search & RAG Without Moving Your Data. Get instant answers from your company's knowledge across 100+ apps while keeping data secure. Deploy in minutes, not months.
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
Containerized, state of the art Retrieval-Augmented Generation (RAG) system with a RESTful API
🧑🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
A lightweight Python tool to generate LLM-optimized knowledge graphs for your Python project | Code structure visualization | LLM Context Window Efficiency | Static analysis for AI | Large Language Mo...
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
Source code for <Large language models surpass human experts in predicting neuroscience results>
RAG-QA-Generator 是一个用于检索增强生成(RAG)系统的自动化知识库构建与管理工具。该工具通过读取文档数据,利用大规模语言模型生成高质量的问答对(QA对),并将这些数据插入数据库中,实现RAG系统知识库的自动化构建和管理。
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents
动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/
This repository collects awesome survey, resource, and paper for Lifelong Learning with Large Language Models. (Updated Regularly)
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
OpenAI o1 advanced reasoning powered vulnerable web page generator for testing and educational purposes
A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement, ASE 2024
App-Controller: Allow users to manipulate your App with natural language
A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information
Automatic Ontology and Knowledge Graph construction with LLM
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
AI Search & RAG Without Moving Your Data. Get instant answers from your company's knowledge across 100+ apps while keeping data secure. Deploy in minutes, not months.
A Curated Collection of LLM resources (work in progress).
[NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
A lightweight Python tool to generate LLM-optimized knowledge graphs for your Python project | Code structure visualization | LLM Context Window Efficiency | Static analysis for AI | Large Language Mo...
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss...
AI Search & RAG Without Moving Your Data. Get instant answers from your company's knowledge across 100+ apps while keeping data secure. Deploy in minutes, not months.
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
🧊 Open source LLM-Observability Platform for Developers. One-line integration for monitoring, metrics, evals, agent tracing, prompt management, playground, etc. Supports OpenAI SDK, Vercel AI SDK, An...
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
🧑🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
A lightweight Python tool to generate LLM-optimized knowledge graphs for your Python project | Code structure visualization | LLM Context Window Efficiency | Static analysis for AI | Large Language Mo...
Source code for <Large language models surpass human experts in predicting neuroscience results>
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents
App-Controller: Allow users to manipulate your App with natural language
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/
SmartVscode: Controlling anything of Vscode by natural language
📰 Must-read papers on KV Cache Compression (constantly updating 🤗).
AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, da...
OpenAI o1 advanced reasoning powered vulnerable web page generator for testing and educational purposes
A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information
[EMNLP 2024 Industry Track & KDD UrbComp 2024 Best Paper Award] ITINERA: Integrating Spatial Optimization with Large Language Models for Open-domain Urban Itinerary Planning
Building LLM-Enabled Multi Agent Applications with AutoGen
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
A lightweight Python tool to generate LLM-optimized knowledge graphs for your Python project | Code structure visualization | LLM Context Window Efficiency | Static analysis for AI | Large Language Mo...
A Curated Collection of LLM resources (work in progress).
A Completely Modular LLM Reverse Engineering, Red Teaming, and Vulnerability Research Framework.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss...
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
🧑🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
The official implementation of MARS: Unleashing the Power of Variance Reduction for Training Large Models
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
App-Controller: Allow users to manipulate your App with natural language
A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement, ASE 2024
SmartVscode: Controlling anything of Vscode by natural language
Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."
Source code for <Large language models surpass human experts in predicting neuroscience results>
A lightweight Python tool to generate LLM-optimized knowledge graphs for your Python project | Code structure visualization | LLM Context Window Efficiency | Static analysis for AI | Large Language Mo...
[EMNLP 2024 Industry Track & KDD UrbComp 2024 Best Paper Award] ITINERA: Integrating Spatial Optimization with Large Language Models for Open-domain Urban Itinerary Planning
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
LangFair is a Python library for conducting use-case level LLM bias and fairness assessments
动手学Ollama,CPU玩转大模型部署,在线阅读地址:https://datawhalechina.github.io/handy-ollama/
up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources
PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model similar to large language models (LLMs). The architecture incor...
Repository demonstrating best practices and patterns for implementing agentic workflows in Python, featuring modular, scalable, and reusable design patterns for intelligent automation.
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Start building LLM-empowered multi-agent applications in an easier way.
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simp...
Containerized, state of the art Retrieval-Augmented Generation (RAG) system with a RESTful API
Reverse Engineering: Decompiling Binary Code with Large Language Models
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 ...
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
🧑🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, ...
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs 🚀 🚀 🚀
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss...
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Start building LLM-empowered multi-agent applications in an easier way.
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simp...
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Start building LLM-empowered multi-agent applications in an easier way.
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
[TMLR 2024] Efficient Large Language Models: A Survey
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization
Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
A generalized information-seeking agent system with Large Language Models (LLMs).