Trending repositories for topic big-data-analytics

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

MrXujiang/v6.dooring.public

可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.

515 (+5)

gpl-3.0

v6d-io/v6d

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

844 (+1)

apache-2.0

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,617 (+1)

mit

Last 3 days (relative gain)

MrXujiang/v6.dooring.public

可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.

515 (+1.0%)

gpl-3.0

v6d-io/v6d

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

844 (+0.1%)

apache-2.0

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,617 (+0.0%)

mit

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

MrXujiang/v6.dooring.public

可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.

515 (+27)

gpl-3.0

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,617 (+5)

mit

caioricciuti/ch-ui

Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...

147 (+3)

mit

v6d-io/v6d

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

844 (+1)

apache-2.0

Last week (relative gain)

MrXujiang/v6.dooring.public

可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.

515 (+6%)

gpl-3.0

caioricciuti/ch-ui

147 (+2%)

mit

v6d-io/v6d

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

844 (+0.1%)

apache-2.0

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,617 (+0.0%)

mit

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

MrXujiang/v6.dooring.public

可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.

515 (+53)

gpl-3.0

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,617 (+45)

mit

caioricciuti/ch-ui

147 (+21)

mit

rouyang2017/SISSO

A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.

259 (+7)

apache-2.0

v6d-io/v6d

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

844 (+6)

apache-2.0

archivesunleashed/aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

139 (+1)

apache-2.0

lithops-cloud/lithops

A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀

321 (+1)

apache-2.0

mahmoudparsian/pyspark-tutorial

PySpark-Tutorial provides basic algorithms using PySpark

1,189 (+1)

Last month (relative gain)

caioricciuti/ch-ui

147 (+17%)

mit

MrXujiang/v6.dooring.public

可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.

515 (+11%)

gpl-3.0

rouyang2017/SISSO

A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.

259 (+3%)

apache-2.0

archivesunleashed/aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

139 (+0.7%)

apache-2.0

v6d-io/v6d

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

844 (+0.7%)

apache-2.0

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,617 (+0.4%)

mit

lithops-cloud/lithops

A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀

321 (+0.3%)

apache-2.0

mahmoudparsian/pyspark-tutorial

PySpark-Tutorial provides basic algorithms using PySpark

1,189 (+0.1%)

Last 12-months (new repositories)

caioricciuti/ch-ui

147

mit

Last 12-months (absolute gain)

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,617 (+997)

mit

caioricciuti/ch-ui

147 (+145)

mit

mahmoudparsian/pyspark-tutorial

PySpark-Tutorial provides basic algorithms using PySpark

1,189 (+119)

MrXujiang/v6.dooring.public

可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.

515 (+78)

gpl-3.0

v6d-io/v6d

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

844 (+64)

apache-2.0

rouyang2017/SISSO

A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.

259 (+63)

apache-2.0

lithops-cloud/lithops

A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀

321 (+30)

apache-2.0

metatron-app/metatron-discovery

Powerful & Easy way for big data discovery

442 (+20)

apache-2.0

dongsuo/vue-data-board

A Data Analysis Board in Vue.

1,319 (+20)

trieu/leo-cdp-free-edition

The binary build of LEO CDP Free Edition for training purposes

35 (+14)

apache-2.0

panstacks/pandata

The Pandata scalable open-source analysis stack

69 (+13)

bsd-3-clause

Thomas-George-T/Movies-Analytics-in-Spark-and-Scala

Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.

94 (+11)

apache-2.0

drshahizan/BDM

Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.

56 (+10)

archivesunleashed/aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

139 (+7)

apache-2.0

ICT-BDA/EasyML

Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.

1,973 (+6)

apache-2.0

XuanyouLiu/US-Real-Estate-Analysis

US Real Estate Rental Price Analysis

28 (+3)

Last 12-months (relative gain)

trieu/leo-cdp-free-edition

The binary build of LEO CDP Free Edition for training purposes

35 (+67%)

apache-2.0

rouyang2017/SISSO

A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.

259 (+32%)

apache-2.0

panstacks/pandata

The Pandata scalable open-source analysis stack

69 (+23%)

bsd-3-clause

drshahizan/BDM

Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.

56 (+22%)

MrXujiang/v6.dooring.public

可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.

515 (+18%)

gpl-3.0

Thomas-George-T/Movies-Analytics-in-Spark-and-Scala

Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.

94 (+13%)

apache-2.0

XuanyouLiu/US-Real-Estate-Analysis

US Real Estate Rental Price Analysis

28 (+12%)

mahmoudparsian/pyspark-tutorial

PySpark-Tutorial provides basic algorithms using PySpark

1,189 (+11%)

lithops-cloud/lithops

A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀

321 (+10%)

apache-2.0

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,617 (+9%)

mit

v6d-io/v6d

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

844 (+8%)

apache-2.0

archivesunleashed/aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

139 (+5%)

apache-2.0

metatron-app/metatron-discovery

Powerful & Easy way for big data discovery

442 (+5%)

apache-2.0

dongsuo/vue-data-board

A Data Analysis Board in Vue.

1,319 (+2%)

ICT-BDA/EasyML

Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.

1,973 (+0.3%)

apache-2.0