52 results found Sort:

732
12.0k
cc-by-sa-4.0
116
Machine Learning Engineering Open Book
Created 2020-09-02
793 commits to master branch, last one a day ago
642
2.8k
apache-2.0
89
A DSL for data-driven computational pipelines
Created 2013-03-27
6,771 commits to master branch, last one 2 days ago
673
2.8k
other
130
Slurm: A Highly Scalable Workload Manager
Created 2011-06-20
65,652 commits to master branch, last one 23 hours ago
Python 3.8+ toolbox for submitting jobs to Slurm
Created 2020-04-24
147 commits to main branch, last one 3 months ago
240
902
apache-2.0
56
A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.
Created 2015-03-30
6,159 commits to master branch, last one 8 days ago
119
500
gpl-2.0
22
Python Interface to Slurm
Created 2011-11-20
742 commits to main branch, last one 3 days ago
100
356
gpl-3.0
37
Open source web interface for Slurm HPC clusters
Created 2015-02-27
410 commits to main branch, last one 23 days ago
Create clusters of VMs on the cloud and configure them with Ansible.
Created 2013-03-20
2,187 commits to master branch, last one about a year ago
113
335
other
21
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
Created 2021-05-04
763 commits to main branch, last one 3 days ago
A Slurm cluster using docker-compose
Created 2017-09-11
45 commits to main branch, last one 2 months ago
Best practices & guides on how to write distributed pytorch training code
Created 2024-07-31
238 commits to main branch, last one 5 days ago
A scheduler for GPU/CPU tasks
Created 2020-10-29
233 commits to master branch, last one 11 months ago
Simplify HPC and Batch workloads on Azure
This repository has been archived (exclude archived)
Created 2016-08-26
923 commits to master branch, last one about a year ago
A Cross-Platform, Multi-Cloud High-Performance Computing Platform
Created 2023-10-15
2,443 commits to master branch, last one 2 months ago
Prometheus exporter for performance metrics from Slurm.
Created 2017-04-18
148 commits to master branch, last one 2 years ago
Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪
Created 2023-07-16
544 commits to main branch, last one 13 hours ago
119
228
apache-2.0
20
An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.
Created 2020-02-18
6,738 commits to main branch, last one 15 days ago
51
173
lgpl-3.0
10
Tools for computation on batch systems
Created 2015-10-26
930 commits to master branch, last one about a year ago
30
173
other
4
SEML: Slurm Experiment Management Library
Created 2019-10-13
634 commits to master branch, last one about a month ago
27
147
apache-2.0
8
R package to send function calls as jobs on LSF, SGE, Slurm, PBS/Torque, or each via SSH
Created 2016-06-18
1,212 commits to master branch, last one 4 months ago
A simple Snakemake profile for Slurm without --cluster-config
Created 2021-05-01
59 commits to main branch, last one 6 months ago
14
137
apache-2.0
6
Run Slurm in Kubernetes
Created 2024-06-04
681 commits to main branch, last one 4 days ago
TUI for the Slurm Workload Manager
Created 2023-01-29
68 commits to main branch, last one 3 months ago
Funnel is a toolkit for distributed task execution via a simple, standard API.
Created 2017-02-03
1,429 commits to master branch, last one a day ago
66
117
other
18
A collection of various resources, examples, and executables for the general NREL HPC user community's benefit. Use the following website for accessing documentation.
Created 2019-01-07
506 commits to master branch, last one about a month ago
Slurm-Mail is a drop in replacement for Slurm's e-mails to give users much more information about their jobs compared to the standard Slurm e-mails.
Created 2018-02-11
890 commits to main branch, last one 21 days ago
Slurm Docker Container on CentOS 7
Created 2016-12-11
139 commits to main branch, last one 8 months ago
:rocket: R package future.batchtools: A Future API for Parallel and Distributed Processing using batchtools
Created 2016-04-17
484 commits to develop branch, last one about a year ago
11
81
mit
0
A Slurm dashboard for the terminal.
Created 2020-05-28
197 commits to main branch, last one 3 years ago
A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/CLAIRE-Labo/no-representation-no-trust
Created 2022-10-15
63 commits to main branch, last one 2 months ago