15 results found Sort:

4.5k
40.8k
apache-2.0
393
Making large AI models cheaper, faster and more accessible
Created 2021-10-28
3,809 commits to main branch, last one 8 hours ago
4.3k
38.0k
apache-2.0
351
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Created 2020-01-23
2,754 commits to master branch, last one 17 hours ago
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Created 2022-06-12
522 commits to main branch, last one 7 months ago
99
836
bsd-3-clause
32
A GPipe implementation in PyTorch
Created 2019-05-10
350 commits to master branch, last one 4 years ago
164
465
apache-2.0
22
飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。
Created 2018-12-12
440 commits to develop branch, last one about a year ago
23
437
unknown
6
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train...
Created 2023-10-24
135 commits to master branch, last one about a month ago
56
401
apache-2.0
41
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
Created 2021-10-25
358 commits to main branch, last one 5 months ago
64
379
apache-2.0
10
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Created 2024-01-16
511 commits to develop branch, last one about a month ago
49
267
apache-2.0
12
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
Created 2022-02-23
21 commits to main branch, last one 2 years ago
A curated list of awesome projects and papers for distributed training or inference
Created 2022-04-22
33 commits to main branch, last one 6 months ago
13
160
apache-2.0
5
Serving Inside Pytorch
Created 2023-10-24
348 commits to v0 branch, last one about a month ago
13
88
apache-2.0
10
Decentralized LLMs fine-tuning and inference with offloading
Created 2025-01-23
49 commits to main branch, last one about a month ago
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
Created 2023-06-14
554 commits to main branch, last one about a year ago
8
62
gpl-3.0
1
Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.
Created 2021-05-30
19 commits to main branch, last one 29 days ago
7
41
unknown
3
FTPipe and related pipeline model parallelism research.
Created 2020-06-29
2,372 commits to master branch, last one about a year ago