66 results found Sort:
- Filter by Primary Language:
- Python (44)
- Jupyter Notebook (11)
- Go (2)
- JavaScript (1)
- HTML (1)
- Scala (1)
- Shell (1)
- TypeScript (1)
- +
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Created
2023-05-23
901 commits to main branch, last one 4 days ago
Low-code framework for building custom LLMs, neural networks, and other AI models
Created
2018-12-27
3,861 commits to master branch, last one 2 months ago
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Created
2021-08-11
2,187 commits to master branch, last one 13 hours ago
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Created
2023-07-11
330 commits to main branch, last one about a month ago
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Created
2023-04-17
450 commits to main branch, last one about a month ago
Efficient Triton Kernels for LLM Training
Created
2024-08-06
356 commits to main branch, last one a day ago
Code examples and resources for DBRX, a large language model developed by Databricks
Created
2024-03-26
9 commits to main branch, last one 8 months ago
dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, & TPU.
Created
2022-01-04
2,359 commits to master branch, last one a day ago
DLRover: An Automatic Distributed Deep Learning System
Created
2022-06-24
2,864 commits to master branch, last one 4 days ago
Nvidia GPU exporter for prometheus using nvidia-smi binary
Created
2021-06-08
625 commits to master branch, last one 2 days ago
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Created
2022-09-01
29 commits to main branch, last one 5 months ago
A PyTorch Native LLM Training Framework
Created
2024-02-26
36 commits to main branch, last one 3 months ago
113
638
mit
17
LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.
Created
2023-05-28
668 commits to main branch, last one 14 days ago
irresponsible innovation. Try now at https://chat.dev/
Created
2023-05-19
503 commits to main branch, last one 7 months ago
LLM (Large Language Model) FineTuning
Created
2023-10-22
165 commits to main branch, last one 7 months ago
Repo for fine-tuning Casual LLMs
Created
2021-08-07
166 commits to main branch, last one 8 months ago
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
Created
2023-09-26
152 commits to main branch, last one 2 months ago
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
Created
2023-10-23
137 commits to main branch, last one about a year ago
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Created
2024-03-27
225 commits to main branch, last one 4 days ago
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Created
2024-01-16
493 commits to develop branch, last one 4 days ago
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
Created
2024-01-11
485 commits to main branch, last one 2 days ago
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
Created
2023-04-28
79 commits to main branch, last one 11 months ago
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
Created
2023-09-30
1,104 commits to main branch, last one 3 days ago
Finetune LLMs on K8s by using Runbooks
This repository has been archived
(exclude archived)
Created
2023-06-19
138 commits to main branch, last one 3 months ago
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
Created
2024-06-19
91 commits to main branch, last one 4 months ago
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Created
2024-06-18
1,075 commits to main branch, last one 6 days ago
collection of text2cypher datasets, evaluations, and finetuning instructions
Created
2024-02-05
27 commits to main branch, last one 6 months ago
ICLR 2024 论文和开源项目合集
Created
2024-05-14
20 commits to main branch, last one 7 months ago
seqax = sequence modeling + JAX
Created
2024-04-26
6 commits to main branch, last one 5 months ago
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
Created
2023-12-03
217 commits to main branch, last one 5 months ago