Statistics for topic apache-spark

RepositoryStats tracks 641,701 Github repositories, of these 115 are tagged with the apache-spark topic. The most common primary language for repositories using this topic is Python (29). Other languages include: Scala (28), Jupyter Notebook (11)

Stargazers over time for topic apache-spark

Most starred repositories for topic apache-spark (view more)

mlflow mlflow

4.5k

20.2k

apache-2.0

307

Open source platform for the machine learning lifecycle

ai ml mlflow apache-spark machine-learning model-management

Created 2018-06-05

7,521 commits to master branch, last one 20 hours ago

SynapseML microsoft

842

5.1k

mit

142

Simple and Distributed Machine Learning

Created 2017-06-05

1,676 commits to master branch, last one 2 days ago

lakeFS treeverse

371

4.6k

apache-2.0

lakeFS - Data version control for your data lake | Git for data

go aws-s3 golang lakefs datalake data-lake datalakes apache-spark data-quality git-for-data azure-storage object-storage apache-sparksql data-versioning data-engineering hadoop-filesystem azure-blob-storage data-version-control google-cloud-storage

Created 2019-09-12

5,771 commits to master branch, last one 10 hours ago