64 results found Sort:

1.2k
11.2k
apache-2.0
194
Low-code framework for building custom LLMs, neural networks, and other AI models
Created 2018-12-27
3,861 commits to master branch, last one about a month ago
1.1k
10.8k
apache-2.0
95
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Created 2023-05-23
898 commits to main branch, last one 17 days ago
513
6.8k
apache-2.0
70
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Created 2021-08-11
2,135 commits to master branch, last one a day ago
418
4.0k
apache-2.0
81
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Created 2023-04-17
450 commits to main branch, last one 29 days ago
311
4.0k
apache-2.0
34
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Created 2023-07-11
330 commits to main branch, last one 13 days ago
206
3.5k
bsd-2-clause
40
Efficient Triton Kernels for LLM Training
Created 2024-08-06
299 commits to main branch, last one a day ago
237
2.5k
other
41
Code examples and resources for DBRX, a large language model developed by Databricks
Created 2024-03-26
9 commits to main branch, last one 7 months ago
153
1.6k
mpl-2.0
11
dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, & TPU.
Created 2022-01-04
2,285 commits to master branch, last one 20 hours ago
DLRover: An Automatic Distributed Deep Learning System
Created 2022-06-24
2,824 commits to master branch, last one a day ago
Nvidia GPU exporter for prometheus using nvidia-smi binary
Created 2021-06-08
613 commits to master branch, last one a day ago
64
761
apache-2.0
7
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Created 2022-09-01
29 commits to main branch, last one 4 months ago
34
669
apache-2.0
34
A PyTorch Native LLM Training Framework
Created 2024-02-26
36 commits to main branch, last one 2 months ago
LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.
Created 2023-05-28
667 commits to main branch, last one 2 days ago
145
482
mit
12
irresponsible innovation. Try now at https://chat.dev/
Created 2023-05-19
503 commits to main branch, last one 6 months ago
LLM (Large Language Model) FineTuning
Created 2023-10-22
165 commits to main branch, last one 6 months ago
84
449
agpl-3.0
10
Repo for fine-tuning Casual LLMs
Created 2021-08-07
166 commits to main branch, last one 7 months ago
30
438
unknown
5
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
Created 2023-09-26
152 commits to main branch, last one about a month ago
26
390
apache-2.0
11
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
Created 2023-10-23
137 commits to main branch, last one 11 months ago
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Created 2024-03-27
214 commits to main branch, last one 2 days ago
52
310
apache-2.0
10
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Created 2024-01-16
479 commits to develop branch, last one 3 days ago
15
232
apache-2.0
4
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
Created 2023-04-28
79 commits to main branch, last one 10 months ago
22
227
mit
8
SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.
Created 2024-01-11
459 commits to main branch, last one 19 hours ago
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
Created 2023-09-30
1,091 commits to main branch, last one a day ago
Finetune LLMs on K8s by using Runbooks
This repository has been archived (exclude archived)
Created 2023-06-19
138 commits to main branch, last one 2 months ago
5
167
unknown
2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
Created 2024-06-19
91 commits to main branch, last one 3 months ago
collection of text2cypher datasets, evaluations, and finetuning instructions
Created 2024-02-05
27 commits to main branch, last one 5 months ago
10
133
bsd-3-clause
7
seqax = sequence modeling + JAX
Created 2024-04-26
6 commits to main branch, last one 4 months ago
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
Created 2023-12-03
217 commits to main branch, last one 4 months ago
Generate ideal question-answers for testing RAG
Created 2023-07-04
59 commits to master branch, last one 4 months ago