Trending repositories for topic language-model
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your ...
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
A framework for few-shot evaluation of language models.
The official gpt4free repository | various collection of powerful language models
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Implementation of paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
Implementation of paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
An official implementation of ShareGPT4V: Improving Large Multi-modal Models with Better Captions
Benchmarks, environments, and toolkits for general computer agents
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation
吴恩达《ChatGPT Prompt Engineering for Developers》课程中英版
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.
A Survey on Data Selection for Language Models
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
The official gpt4free repository | various collection of powerful language models
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your ...
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
A framework for few-shot evaluation of language models.
A curriculum for learning about foundation models, from scratch to the frontier
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
CommonLit - Evaluate Student Summaries competition submission
An official implementation of ShareGPT4V: Improving Large Multi-modal Models with Better Captions
[Preprint] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
Implementation of paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
Benchmarks, environments, and toolkits for general computer agents
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation
A curriculum for learning about foundation models, from scratch to the frontier
Official Code for "Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction (CVPR 2024)"
吴恩达《ChatGPT Prompt Engineering for Developers》课程中英版
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation
An official implementation of ShareGPT4V: Improving Large Multi-modal Models with Better Captions
CommonLit - Evaluate Student Summaries competition submission
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
The official gpt4free repository | various collection of powerful language models
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your ...
A framework for few-shot evaluation of language models.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa...
[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?
An official implementation of ShareGPT4V: Improving Large Multi-modal Models with Better Captions
CommonLit - Evaluate Student Summaries competition submission
A web app and Python API for multi-modal RAG framework to ground LLMs on high-fidelity materials informatics. An agentic materials scientist powered by @materialsproject, @langchain-ai, and @openai
[Preprint] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models
Official Code for "Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction (CVPR 2024)"
Benchmarks, environments, and toolkits for general computer agents
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
Sparse and discrete interpretability tool for neural networks
A Survey on Data Selection for Language Models
🎯 Your free LLM evaluation toolkit helps you assess the accuracy of facts, how well it understands context, its tone, and more. This helps you see how good your LLM applications are.
Implementation of paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
吴恩达《ChatGPT Prompt Engineering for Developers》课程中英版
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
WikiChat stops the hallucination of large language models by retrieving data from Wikipedia.
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
A curriculum for learning about foundation models, from scratch to the frontier
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
List of language agents based on paper "Cognitive Architectures for Language Agents"
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The official gpt4free repository | various collection of powerful language models
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
a state-of-the-art-level open visual language model | 多模态预训练模型
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your ...
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
GPT 3.5/4 with a Chat Web UI. No API key required.
A framework for few-shot evaluation of language models.
OpenAgents: An Open Platform for Language Agents in the Wild
Code and documentation to train Stanford's Alpaca models, and generate the data.
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa...
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
a state-of-the-art-level open visual language model | 多模态预训练模型
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
OpenAgents: An Open Platform for Language Agents in the Wild
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
List of language agents based on paper "Cognitive Architectures for Language Agents"
[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?
A program synthesis agent that autonomously fixes its output by running tests!
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
A complete guide to start and improve your LLM skills in 2024 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!
[ICLR 2024] Lemur: Open Foundation Models for Language Agents