Statistics for topic pyspark

RepositoryStats tracks 584,792 Github repositories, of these 107 are tagged with the pyspark topic. The most common primary language for repositories using this topic is Python (47). Other languages include: Jupyter Notebook (28)

Stargazers over time for topic pyspark

Most starred repositories for topic pyspark (view more)

598
5.3k
apache-2.0
84
the portable Python dataframe library
Created 2015-04-17
9,021 commits to main branch, last one 18 hours ago
831
5.1k
mit
146
Simple and Distributed Machine Learning
Created 2017-06-05
1,647 commits to master branch, last one 2 days ago
1.2k
3.3k
apache-2.0
264
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Created 2019-07-23
4,216 commits to master branch, last one 9 days ago
284
1.8k
apache-2.0
40
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, a...
Created 2018-06-15
691 commits to master branch, last one 11 months ago
A curated list of awesome Apache Spark packages and resources.
Created 2016-02-01
263 commits to main branch, last one 28 days ago

Trending repositories for topic pyspark (view more)