52 results found Sort:

771
12.6k
cc-by-sa-4.0
117
Machine Learning Engineering Open Book
Created 2020-09-02
915 commits to master branch, last one 2 days ago
655
2.8k
apache-2.0
90
A DSL for data-driven computational pipelines
Created 2013-03-27
6,830 commits to master branch, last one 12 hours ago
677
2.8k
other
130
Slurm: A Highly Scalable Workload Manager
Created 2011-06-20
66,067 commits to master branch, last one 2 days ago
Python 3.8+ toolbox for submitting jobs to Slurm
Created 2020-04-24
147 commits to main branch, last one 4 months ago
240
907
apache-2.0
55
A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.
Created 2015-03-30
6,169 commits to master branch, last one 3 days ago
120
502
gpl-2.0
22
Python Interface to Slurm
Created 2011-11-20
756 commits to main branch, last one 22 hours ago
100
365
gpl-3.0
37
Open source web interface for Slurm HPC clusters
Created 2015-02-27
475 commits to main branch, last one 3 days ago
Best practices & guides on how to write distributed pytorch training code
Created 2024-07-31
270 commits to main branch, last one 10 days ago
118
341
other
20
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
Created 2021-05-04
768 commits to main branch, last one 2 days ago
A Slurm cluster using docker-compose
Created 2017-09-11
45 commits to main branch, last one 4 months ago
Create clusters of VMs on the cloud and configure them with Ansible.
Created 2013-03-20
2,187 commits to master branch, last one about a year ago
Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪
Created 2023-07-16
657 commits to main branch, last one 2 days ago
A scheduler for GPU/CPU tasks
Created 2020-10-29
233 commits to master branch, last one about a year ago
Simplify HPC and Batch workloads on Azure
This repository has been archived (exclude archived)
Created 2016-08-26
923 commits to master branch, last one about a year ago
A Cross-Platform, Multi-Cloud High-Performance Computing Platform
Created 2023-10-15
2,444 commits to master branch, last one 28 days ago
Prometheus exporter for performance metrics from Slurm.
Created 2017-04-18
148 commits to master branch, last one 2 years ago
124
231
apache-2.0
19
An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.
Created 2020-02-18
6,749 commits to main branch, last one 6 days ago
31
177
other
4
SEML: Slurm Experiment Management Library
Created 2019-10-13
634 commits to master branch, last one 2 months ago
51
174
lgpl-3.0
10
Tools for computation on batch systems
Created 2015-10-26
930 commits to master branch, last one about a year ago
16
156
apache-2.0
6
Run Slurm in Kubernetes
Created 2024-06-04
892 commits to dev branch, last one 2 days ago
A simple Snakemake profile for Slurm without --cluster-config
Created 2021-05-01
59 commits to main branch, last one 7 months ago
28
148
apache-2.0
8
R package to send function calls as jobs on LSF, SGE, Slurm, PBS/Torque, or each via SSH
Created 2016-06-18
1,233 commits to master branch, last one 14 days ago
TUI for the Slurm Workload Manager
Created 2023-01-29
68 commits to main branch, last one 5 months ago
Funnel is a toolkit for distributed task execution via a simple, standard API.
Created 2017-02-03
1,431 commits to master branch, last one 3 days ago
67
120
other
18
A collection of various resources, examples, and executables for the general NREL HPC user community's benefit. Use the following website for accessing documentation.
Created 2019-01-07
510 commits to master branch, last one 12 days ago
39
101
gpl-3.0
4
Slurm-Mail is a drop in replacement for Slurm's e-mails to give users much more information about their jobs compared to the standard Slurm e-mails.
Created 2018-02-11
892 commits to main branch, last one 6 days ago
Slurm Docker Container on CentOS 7
Created 2016-12-11
139 commits to main branch, last one 10 months ago
:rocket: R package future.batchtools: A Future API for Parallel and Distributed Processing using batchtools
Created 2016-04-17
484 commits to develop branch, last one about a year ago
A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/CLAIRE-Labo/no-representation-no-trust
Created 2022-10-15
71 commits to main branch, last one 17 days ago
11
81
mit
0
A Slurm dashboard for the terminal.
Created 2020-05-28
197 commits to main branch, last one 3 years ago