Trending repositories for topic big-data-analytics
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
PySpark-Tutorial provides basic algorithms using PySpark
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
PySpark-Tutorial provides basic algorithms using PySpark
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
PySpark-Tutorial provides basic algorithms using PySpark
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
PySpark-Tutorial provides basic algorithms using PySpark
A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
The binary build of LEO CDP Free Edition for training purposes
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
The binary build of LEO CDP Free Edition for training purposes
A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.
Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
PySpark-Tutorial provides basic algorithms using PySpark
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.