52 results found Sort:

5.8k
36.1k
mit
1.2k
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Created 2018-11-05
18 commits to main branch, last one 5 months ago
4.6k
30.2k
apache-2.0
310
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
Created 2019-02-02
2,269 commits to main branch, last one 4 days ago
5.5k
21.7k
apache-2.0
721
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
Created 2016-08-15
48,655 commits to develop branch, last one 18 hours ago
2.8k
11.6k
unknown
103
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Sear...
Created 2021-02-05
4,884 commits to develop branch, last one a day ago
404
5.8k
apache-2.0
67
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
Created 2021-08-11
1,798 commits to master branch, last one 14 hours ago
769
4.1k
apache-2.0
114
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on a...
Created 2020-07-21
12,120 commits to master branch, last one 18 days ago
365
3.9k
apache-2.0
56
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Created 2021-10-28
711 commits to main branch, last one 11 months ago
484
3.6k
other
85
A high performance and generic framework for distributed DNN training
Created 2019-06-25
432 commits to master branch, last one 2 years ago
531
3.5k
apache-2.0
174
Fast and flexible AutoML with learning guarantees.
Created 2018-06-28
440 commits to master branch, last one 2 years ago
344
3.0k
apache-2.0
45
Training and serving large-scale neural networks with auto parallelization.
Created 2021-02-22
668 commits to main branch, last one 5 months ago
346
2.9k
apache-2.0
81
Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
Created 2020-04-07
7,807 commits to main branch, last one 15 hours ago
Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.
Created 2020-02-27
571 commits to master branch, last one 5 months ago
275
1.2k
unknown
58
Library for Fast and Flexible Human Pose Estimation
Created 2018-08-25
538 commits to master branch, last one 2 years ago
DLRover: An Automatic Distributed Deep Learning System
Created 2022-06-24
2,358 commits to master branch, last one a day ago
340
980
apache-2.0
33
DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.
Created 2021-12-24
65,622 commits to main branch, last one 9 days ago
Efficient Deep Learning Systems course materials (HSE, YSDA)
Created 2021-12-06
149 commits to main branch, last one 2 months ago
34
409
apache-2.0
8
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
Created 2023-04-27
217 commits to main branch, last one 4 months ago
75
407
apache-2.0
11
Resource-adaptive cluster scheduler for deep learning training.
Created 2020-08-23
123 commits to master branch, last one about a year ago
55
376
apache-2.0
43
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
Created 2021-10-25
348 commits to main branch, last one about a month ago
97
304
other
19
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
Created 2021-05-04
710 commits to main branch, last one a day ago
58
289
apache-2.0
23
Fast and Adaptive Distributed Machine Learning for TensorFlow, PyTorch and MindSpore.
Created 2018-12-29
384 commits to main branch, last one 3 months ago
41
282
mit
13
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
Created 2020-06-03
799 commits to master branch, last one about a month ago
11
264
mit
8
A Jax-based library for designing and training transformer models from scratch.
Created 2023-08-22
155 commits to main branch, last one 5 days ago
49
253
apache-2.0
13
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
Created 2022-02-23
21 commits to main branch, last one about a year ago
[ICLR 2018] Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Created 2020-06-22
1 commits to master branch, last one 2 years ago
15
201
unknown
9
universal visual model trained on LAION-400M
Created 2023-02-15
11 commits to main branch, last one 9 months ago
67
155
apache-2.0
4
OpenKS - 领域可泛化的知识学习与计算引擎
Created 2020-05-25
403 commits to master branch, last one 11 months ago
33
144
apache-2.0
21
Paddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, Swin, DeiT, CaiT, FaceViT, MoCo, MAE, ConvMAE, CAE.
Created 2019-12-13
172 commits to master branch, last one 12 months ago
9
134
apache-2.0
11
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
Created 2023-06-06
36 commits to main branch, last one about a month ago