10 results found Sort:

4.3k
38.1k
apache-2.0
379
Making large AI models cheaper, faster and more accessible
Created 2021-10-28
3,357 commits to main branch, last one 22 hours ago
3.9k
33.2k
apache-2.0
335
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Created 2020-01-23
2,316 commits to master branch, last one 13 hours ago
22
531
other
20
A state-of-the-art multithreading runtime: message-passing based, fast, scalable, ultra-low overhead
Created 2019-07-20
460 commits to master branch, last one 5 months ago
158
425
apache-2.0
23
飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。
Created 2018-12-12
440 commits to develop branch, last one 9 months ago
55
376
apache-2.0
43
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
Created 2021-10-25
348 commits to main branch, last one about a month ago
49
253
apache-2.0
13
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
Created 2022-02-23
21 commits to main branch, last one about a year ago
Orkhon: ML Inference Framework and Server Runtime
Created 2019-05-18
102 commits to master branch, last one 3 years ago
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
Created 2023-06-14
554 commits to main branch, last one 5 months ago
Distributed training (multi-node) of a Transformer model
Created 2023-12-08
87 commits to main branch, last one about a month ago
SC23 Deep Learning at Scale Tutorial Material
Created 2023-08-28
85 commits to main branch, last one 6 months ago