Statistics for topic pyspark

RepositoryStats tracks 630,459 Github repositories, of these 116 are tagged with the pyspark topic. The most common primary language for repositories using this topic is Python (54). Other languages include: Jupyter Notebook (30)

Stargazers over time for topic pyspark

90908080707060605050404030302020101000202020202021202120222022202320232024202420252025

Most starred repositories for topic pyspark (view more)

623
5.6k
apache-2.0
83
the portable Python dataframe library
Created 2015-04-17
9,512 commits to main branch, last one 17 hours ago
843
5.1k
mit
141
Simple and Distributed Machine Learning
Created 2017-06-05
1,663 commits to master branch, last one 16 days ago
1.2k
3.3k
apache-2.0
263
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Created 2019-07-23
4,233 commits to master branch, last one 11 days ago
Implementing best practices for PySpark ETL jobs and applications.
Created 2017-12-28
36 commits to master branch, last one 3 years ago
282
1.8k
apache-2.0
37
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...
Created 2018-06-15
691 commits to master branch, last one about a year ago

Trending repositories for topic pyspark (view more)