69 results found Sort:

1.5k
13.5k
apache-2.0
116
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Created 2023-05-23
904 commits to main branch, last one 18 days ago
1.2k
11.3k
apache-2.0
193
Low-code framework for building custom LLMs, neural networks, and other AI models
Created 2018-12-27
3,861 commits to master branch, last one 3 months ago
555
7.1k
apache-2.0
69
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Created 2021-08-11
2,270 commits to master branch, last one 2 days ago
252
4.3k
bsd-2-clause
44
Efficient Triton Kernels for LLM Training
Created 2024-08-06
387 commits to main branch, last one 3 days ago
329
4.2k
apache-2.0
36
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Created 2023-07-11
334 commits to main branch, last one 11 days ago
428
4.1k
apache-2.0
80
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Created 2023-04-17
451 commits to main branch, last one 16 days ago
241
2.5k
other
42
Code examples and resources for DBRX, a large language model developed by Databricks
Created 2024-03-26
9 commits to main branch, last one 9 months ago
164
1.7k
mpl-2.0
11
dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, TPU, and Intel acce...
Created 2022-01-04
2,438 commits to master branch, last one 2 days ago
DLRover: An Automatic Distributed Deep Learning System
Created 2022-06-24
2,909 commits to master branch, last one a day ago
Nvidia GPU exporter for prometheus using nvidia-smi binary
Created 2021-06-08
637 commits to master branch, last one 9 days ago
64
772
apache-2.0
7
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Created 2022-09-01
29 commits to main branch, last one 7 months ago
37
701
apache-2.0
32
A PyTorch Native LLM Training Framework
Created 2024-02-26
37 commits to main branch, last one about a month ago
LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.
Created 2023-05-28
674 commits to main branch, last one 14 days ago
LLM (Large Language Model) FineTuning
Created 2023-10-22
165 commits to main branch, last one 8 months ago
144
486
mit
12
irresponsible innovation. Try now at https://chat.dev/
Created 2023-05-19
503 commits to main branch, last one 8 months ago
84
454
agpl-3.0
11
Repo for fine-tuning Casual LLMs
Created 2021-08-07
166 commits to main branch, last one 10 months ago
30
440
unknown
5
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
Created 2023-09-26
152 commits to main branch, last one 3 months ago
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Created 2024-03-27
232 commits to main branch, last one about a month ago
25
389
apache-2.0
11
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
Created 2023-10-23
137 commits to main branch, last one about a year ago
57
339
apache-2.0
10
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Created 2024-01-16
500 commits to develop branch, last one 12 days ago
24
248
mit
8
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
Created 2024-01-11
496 commits to main branch, last one 4 days ago
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
Created 2023-09-30
1,113 commits to main branch, last one 2 days ago
15
237
apache-2.0
4
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
Created 2023-04-28
79 commits to main branch, last one about a year ago
11
208
apache-2.0
4
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Created 2024-06-18
1,078 commits to main branch, last one 20 days ago
Finetune LLMs on K8s by using Runbooks
This repository has been archived (exclude archived)
Created 2023-06-19
138 commits to main branch, last one 5 months ago
5
167
unknown
2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
Created 2024-06-19
91 commits to main branch, last one 6 months ago
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Created 2025-01-05
10 commits to master branch, last one 28 days ago
collection of text2cypher datasets, evaluations, and finetuning instructions
Created 2024-02-05
27 commits to main branch, last one 7 months ago
10
136
bsd-3-clause
7
seqax = sequence modeling + JAX
Created 2024-04-26
6 commits to main branch, last one 6 months ago