Trending repositories for topic language-model
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ...
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
The official gpt4free repository | various collection of powerful language models
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
A framework for few-shot evaluation of language models.
Chatbot for documentation, that allows you to chat with your data. Privately deployable, provides AI knowledge sharing and integrates knowledge into your AI workflow
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa...
The Scene Language: Representing Scenes with Programs, Words, and Embeddings (arXiv preprint)
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ...
[ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View
The original transformer implementation from scratch. It contains informative comments on each block
SELFormer: Molecular Representation Learning via SELFIES Language Models
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.
A complete guide to start and improve your LLM skills in 2024 with little background in the field and stay up-to-date with the latest news and state-of-the-art techniques!
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
List of language agents based on paper "Cognitive Architectures for Language Agents"
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ...
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
The official gpt4free repository | various collection of powerful language models
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
A framework for few-shot evaluation of language models.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?
Speech To Speech: an effort for an open-sourced and modular GPT4-o
The Scene Language: Representing Scenes with Programs, Words, and Embeddings (arXiv preprint)
Chatbot for documentation, that allows you to chat with your data. Privately deployable, provides AI knowledge sharing and integrates knowledge into your AI workflow
The Scene Language: Representing Scenes with Programs, Words, and Embeddings (arXiv preprint)
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ...
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
This repo contains the code and data for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks"
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
[AI-Assistant] “通慧智教”-大模型赋能智能教学网站-通过AI教学助理-小慧,使用语音或文本,以自然语言方式与网站交互。Large Model Empowered Intelligent Teaching Website - Interacts with the website using voice or text in a natural language manner thro...
A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model.
Unified Go interface for Language Model (LLM) providers. Simplifies LLM integration with flexible prompt management and common task functions.
Self-Repairing Autonomous Agent for Digital Consciousness Backup Using Large Language Models (LLM) and powerful code generation capability, self-editing source code and self-debugging its own source ...
[ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View
ProAgent: Building Proactive Cooperative Agents with Large Language Models
The original transformer implementation from scratch. It contains informative comments on each block
This repository provide the studies on the security of language models for code (CodeLMs).
The Scene Language: Representing Scenes with Programs, Words, and Embeddings (arXiv preprint)
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ...
The official gpt4free repository | various collection of powerful language models
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Speech To Speech: an effort for an open-sourced and modular GPT4-o
A framework for few-shot evaluation of language models.
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Chatbot for documentation, that allows you to chat with your data. Privately deployable, provides AI knowledge sharing and integrates knowledge into your AI workflow
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
This repo contains the code and data for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks"
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ...
[ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View
vnc-lm is a Discord bot that integrates leading large language model APIs.
A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model.
[AI-Assistant] “通慧智教”-大模型赋能智能教学网站-通过AI教学助理-小慧,使用语音或文本,以自然语言方式与网站交互。Large Model Empowered Intelligent Teaching Website - Interacts with the website using voice or text in a natural language manner thro...
The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
This repository provide the studies on the security of language models for code (CodeLMs).
[NeurIPS 2023] This is the official code for the paper "TPSR: Transformer-based Planning for Symbolic Regression"
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ...
Speech To Speech: an effort for an open-sourced and modular GPT4-o
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
A curriculum for learning about foundation models, from scratch to the frontier
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR
Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.
The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.
Unified Go interface for Language Model (LLM) providers. Simplifies LLM integration with flexible prompt management and common task functions.
Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few lines of modular code.
The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"
A simple example implementation of the VoiceRAG pattern to power interactive voice generative AI experiences using RAG with Azure AI Search and Azure OpenAI's gpt-4o-realtime-preview model.
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
The official gpt4free repository | various collection of powerful language models
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. W...
A framework for few-shot evaluation of language models.
a state-of-the-art-level open visual language model | 多模态预训练模型
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ...
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Speech To Speech: an effort for an open-sourced and modular GPT4-o
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa...
Code and documentation to train Stanford's Alpaca models, and generate the data.
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions
A curriculum for learning about foundation models, from scratch to the frontier
Unified Go interface for Language Model (LLM) providers. Simplifies LLM integration with flexible prompt management and common task functions.
ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model, a unified and user-friendly shape-language model
Official Code for "Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction (CVPR 2024)"
A Survey on Data Selection for Language Models
GPT4V-level open-source multi-modal model based on Llama3-8B
This repo contains the code and data for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks"
Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)
[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR
[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?