Trending repositories for topic exploratory-data-analysis
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Always know what to expect from your data.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Beautiful visualizations of how language differs among document types.
Automatically visualize your pandas dataframe via a single print! 📊 💡
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Always know what to expect from your data.
Beautiful visualizations of how language differs among document types.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Automatically visualize your pandas dataframe via a single print! 📊 💡
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Always know what to expect from your data.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Automatically visualize your pandas dataframe via a single print! 📊 💡
Beautiful visualizations of how language differs among document types.
Visualize and compare datasets, target values and associations, with one line of code.
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Beautiful visualizations of how language differs among document types.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Always know what to expect from your data.
Automatically visualize your pandas dataframe via a single print! 📊 💡
Visualize and compare datasets, target values and associations, with one line of code.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Always know what to expect from your data.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Automatically visualize your pandas dataframe via a single print! 📊 💡
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Beautiful visualizations of how language differs among document types.
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Visualize and compare datasets, target values and associations, with one line of code.
Automatically find issues in image datasets and practice data-centric computer vision.
The Google Advanced Data Analytics Certificate contains information on how to use machine learning, predictive modeling, and experimental design to collect and analyze large amounts of data, and prepa...
This Repository consists of Assignments and projects of the iNeuron Full Stack Data Science Course
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
:full_moon_with_face: Lottery prediction besides of following "law of proability","Probability: Independent Events", there are still "Saying "a Tail is due", or "just one more go, my luck is due to ch...
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
The Google Advanced Data Analytics Certificate contains information on how to use machine learning, predictive modeling, and experimental design to collect and analyze large amounts of data, and prepa...
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
This repository contains the code and instructions to build a job recommendation system using machine learning. The system is designed to provide personalized job recommendations based on user prefere...
This Repository consists of Assignments and projects of the iNeuron Full Stack Data Science Course
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
:full_moon_with_face: Lottery prediction besides of following "law of proability","Probability: Independent Events", there are still "Saying "a Tail is due", or "just one more go, my luck is due to ch...
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Automatically visualize your pandas dataframe via a single print! 📊 💡
Automatically find issues in image datasets and practice data-centric computer vision.
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Always know what to expect from your data.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Always know what to expect from your data.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Interactively explore unstructured datasets from your dataframe.
Automatically visualize your pandas dataframe via a single print! 📊 💡
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Automatically find issues in image datasets and practice data-centric computer vision.
Visualize and compare datasets, target values and associations, with one line of code.
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Complete-Life-Cycle-of-a-Data-Science-Project
Beautiful visualizations of how language differs among document types.
This Repository consists of Assignments and projects of the iNeuron Full Stack Data Science Course
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
breadroll 🥟 is a simple lightweight library for data processing operations written in Typescript and powered by Bun.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
This repository contains the code and instructions to build a job recommendation system using machine learning. The system is designed to provide personalized job recommendations based on user prefere...
breadroll 🥟 is a simple lightweight library for data processing operations written in Typescript and powered by Bun.
In this personal Superstore Sales SQL Data Analysis project, an exploratory data analysis was performed on the Superstore Sales Data available on Kaggle. The main aim of the project is to uncover insi...
The Google Advanced Data Analytics Certificate contains information on how to use machine learning, predictive modeling, and experimental design to collect and analyze large amounts of data, and prepa...
In this project, a RFM model is implemented to relate to customers in each segment. Assessed the Data Quality, performed EDA using Python and created Dashboard using Tableau.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Interactively explore unstructured datasets from your dataframe.
An open-source Python library for Data Scientists & Data Analysts designed to simplify the exploratory data analysis process. Using Edvart, you can explore data sets and generate reports with minimal ...
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
Automatically find issues in image datasets and practice data-centric computer vision.
This Repository consists of Assignments and projects of the iNeuron Full Stack Data Science Course
Lead Scoring is such a powerful metric when it comes to quantifying the lead & it is nowadays used by every CRM. In this repository, we are going to take a look at the UpGrad lead scoring case study a...
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
A curated list of awesome resources such as books, tutorials, courses, open-source libraries, exercises, and other materials that support Pythonistas in the making, and Pythonistas migrating into Data...
Exploratory Data Analysis on Bellabeat fitness tracker app using Python. Capstone project from Google Data Analytics Professional Certification.
Perform a survival analysis based on the time-to-event (death event) for the subjects. Compare machine learning models to assess the likelihood of a death by heart failure condition. This can be used ...