13 results found Sort:

4.5k
40.8k
apache-2.0
392
Making large AI models cheaper, faster and more accessible
Created 2021-10-28
3,809 commits to main branch, last one 5 days ago
4.3k
38.1k
apache-2.0
352
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Created 2020-01-23
2,759 commits to master branch, last one 2 days ago
99
836
bsd-3-clause
32
A GPipe implementation in PyTorch
Created 2019-05-10
350 commits to master branch, last one 4 years ago
164
465
apache-2.0
22
飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。
Created 2018-12-12
440 commits to develop branch, last one about a year ago
56
402
apache-2.0
41
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
Created 2021-10-25
358 commits to main branch, last one 5 months ago
15
298
apache-2.0
10
Slicing a PyTorch Tensor Into Parallel Shards
Created 2021-04-27
9 commits to main branch, last one 3 years ago
49
267
apache-2.0
12
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
Created 2022-02-23
21 commits to main branch, last one 2 years ago
A curated list of awesome projects and papers for distributed training or inference
Created 2022-04-22
33 commits to main branch, last one 6 months ago
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
Created 2023-06-14
554 commits to main branch, last one about a year ago
8
65
apache-2.0
4
NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
Created 2022-12-19
674 commits to master branch, last one 4 months ago
Distributed training (multi-node) of a Transformer model
Created 2023-12-08
87 commits to main branch, last one about a year ago
SC23 Deep Learning at Scale Tutorial Material
Created 2023-08-28
85 commits to main branch, last one about a year ago
Distributed training of DNNs • C++/MPI Proxies (GPT-2, GPT-3, CosmoFlow, DLRM)
Created 2024-02-22
17 commits to main branch, last one 2 years ago