Trending repositories for topic big-data-analytics
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
PySpark-Tutorial provides basic algorithms using PySpark
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
PySpark-Tutorial provides basic algorithms using PySpark
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
PySpark-Tutorial provides basic algorithms using PySpark
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
PySpark-Tutorial provides basic algorithms using PySpark
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.
Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.
A data-driven method combining symbolic regression and compressed sensing for accurate & interpretable models.
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
PySpark-Tutorial provides basic algorithms using PySpark
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks.