Trending repositories for topic big-data-analytics
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
The binary build of LEO CDP Free Edition for training purposes
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
The binary build of LEO CDP Free Edition for training purposes
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
The binary build of LEO CDP Free Edition for training purposes
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
The binary build of LEO CDP Free Edition for training purposes
A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
PySpark-Tutorial provides basic algorithms using PySpark
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
PySpark-Tutorial provides basic algorithms using PySpark
Use CH-UI to work with your data from Click House self-hosted with a user-friendly interface. CH-UI is a modern and feature-rich user interface for ClickHouse databases. It offers an intuitive platfor...
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
The binary build of LEO CDP Free Edition for training purposes
Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
The binary build of LEO CDP Free Edition for training purposes
Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.
A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
PySpark-Tutorial provides basic algorithms using PySpark
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.