66 results found Sort:

1.3k
12.0k
apache-2.0
99
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Created 2023-05-23
901 commits to main branch, last one 4 days ago
1.2k
11.2k
apache-2.0
194
Low-code framework for building custom LLMs, neural networks, and other AI models
Created 2018-12-27
3,861 commits to master branch, last one 2 months ago
531
6.9k
apache-2.0
70
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Created 2021-08-11
2,187 commits to master branch, last one 13 hours ago
322
4.1k
apache-2.0
36
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Created 2023-07-11
330 commits to main branch, last one about a month ago
418
4.1k
apache-2.0
80
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Created 2023-04-17
450 commits to main branch, last one about a month ago
230
3.9k
bsd-2-clause
40
Efficient Triton Kernels for LLM Training
Created 2024-08-06
356 commits to main branch, last one a day ago
238
2.5k
other
41
Code examples and resources for DBRX, a large language model developed by Databricks
Created 2024-03-26
9 commits to main branch, last one 8 months ago
161
1.6k
mpl-2.0
12
dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, & TPU.
Created 2022-01-04
2,359 commits to master branch, last one a day ago
DLRover: An Automatic Distributed Deep Learning System
Created 2022-06-24
2,864 commits to master branch, last one 4 days ago
Nvidia GPU exporter for prometheus using nvidia-smi binary
Created 2021-06-08
625 commits to master branch, last one 2 days ago
64
771
apache-2.0
7
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Created 2022-09-01
29 commits to main branch, last one 5 months ago
34
680
apache-2.0
34
A PyTorch Native LLM Training Framework
Created 2024-02-26
36 commits to main branch, last one 3 months ago
LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.
Created 2023-05-28
668 commits to main branch, last one 14 days ago
144
483
mit
12
irresponsible innovation. Try now at https://chat.dev/
Created 2023-05-19
503 commits to main branch, last one 7 months ago
LLM (Large Language Model) FineTuning
Created 2023-10-22
165 commits to main branch, last one 7 months ago
84
451
agpl-3.0
10
Repo for fine-tuning Casual LLMs
Created 2021-08-07
166 commits to main branch, last one 8 months ago
30
439
unknown
5
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
Created 2023-09-26
152 commits to main branch, last one 2 months ago
26
390
apache-2.0
11
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
Created 2023-10-23
137 commits to main branch, last one about a year ago
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Created 2024-03-27
225 commits to main branch, last one 4 days ago
54
317
apache-2.0
10
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Created 2024-01-16
493 commits to develop branch, last one 4 days ago
23
234
mit
8
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
Created 2024-01-11
485 commits to main branch, last one 2 days ago
15
232
apache-2.0
4
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
Created 2023-04-28
79 commits to main branch, last one 11 months ago
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
Created 2023-09-30
1,104 commits to main branch, last one 3 days ago
Finetune LLMs on K8s by using Runbooks
This repository has been archived (exclude archived)
Created 2023-06-19
138 commits to main branch, last one 3 months ago
5
166
unknown
2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
Created 2024-06-19
91 commits to main branch, last one 4 months ago
8
153
apache-2.0
4
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Created 2024-06-18
1,075 commits to main branch, last one 6 days ago
collection of text2cypher datasets, evaluations, and finetuning instructions
Created 2024-02-05
27 commits to main branch, last one 6 months ago
10
136
bsd-3-clause
7
seqax = sequence modeling + JAX
Created 2024-04-26
6 commits to main branch, last one 5 months ago
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
Created 2023-12-03
217 commits to main branch, last one 5 months ago