Trending repositories for topic exploratory-data-analysis
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Always know what to expect from your data.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Visualize and compare datasets, target values and associations, with one line of code.
Beautiful visualizations of how language differs among document types.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Visualize and compare datasets, target values and associations, with one line of code.
Beautiful visualizations of how language differs among document types.
Always know what to expect from your data.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Always know what to expect from your data.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Beautiful visualizations of how language differs among document types.
Visualize and compare datasets, target values and associations, with one line of code.
Automatically visualize your pandas dataframe via a single print! 📊 💡
A day to day plan for this challenge. Covers both theoritical and practical aspects
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
A day to day plan for this challenge. Covers both theoritical and practical aspects
Beautiful visualizations of how language differs among document types.
Always know what to expect from your data.
Visualize and compare datasets, target values and associations, with one line of code.
Compilation of R and Python programming codes on the Data Professor YouTube channel.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Automatically visualize your pandas dataframe via a single print! 📊 💡
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Always know what to expect from your data.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Automatically visualize your pandas dataframe via a single print! 📊 💡
Beautiful visualizations of how language differs among document types.
Visualize and compare datasets, target values and associations, with one line of code.
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhance their skills and apply their knowledge.
The Google Advanced Data Analytics Certificate contains information on how to use machine learning, predictive modeling, and experimental design to collect and analyze large amounts of data, and prepa...
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Classification of Breast Cancer diagnosis Using Support Vector Machines
:full_moon_with_face: Lottery prediction besides of following "law of proability","Probability: Independent Events", there are still "Saying "a Tail is due", or "just one more go, my luck is due to ch...
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
The Google Advanced Data Analytics Certificate contains information on how to use machine learning, predictive modeling, and experimental design to collect and analyze large amounts of data, and prepa...
In this personal Superstore Sales SQL Data Analysis project, an exploratory data analysis was performed on the Superstore Sales Data available on Kaggle. The main aim of the project is to uncover insi...
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
This repository contains the code and instructions to build a job recommendation system using machine learning. The system is designed to provide personalized job recommendations based on user prefere...
An open-source Python library for Data Scientists & Data Analysts designed to simplify the exploratory data analysis process. Using Edvart, you can explore data sets and generate reports with minimal ...
this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhance their skills and apply their knowledge.
In this project, a RFM model is implemented to relate to customers in each segment. Assessed the Data Quality, performed EDA using Python and created Dashboard using Tableau.
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
A curated list of awesome resources such as books, tutorials, courses, open-source libraries, exercises, and other materials that support Pythonistas in the making, and Pythonistas migrating into Data...
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Classification of Breast Cancer diagnosis Using Support Vector Machines
Solution of the Titanic Kaggle competition
:full_moon_with_face: Lottery prediction besides of following "law of proability","Probability: Independent Events", there are still "Saying "a Tail is due", or "just one more go, my luck is due to ch...
This topic explains about the implementation of exploratory data analysis (EDA). A total of 21 EDA case studies have been implemented using the Malaysian dataset.
Always know what to expect from your data.
Advancing QGrid, an interactive grid for exploring DataFrames in JupyterLab/Notebook
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Always know what to expect from your data.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
Automatically visualize your pandas dataframe via a single print! 📊 💡
Interactively explore unstructured datasets from your dataframe.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Visualize and compare datasets, target values and associations, with one line of code.
Automatically find issues in image datasets and practice data-centric computer vision.
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Beautiful visualizations of how language differs among document types.
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Complete-Life-Cycle-of-a-Data-Science-Project
this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhance their skills and apply their knowledge.
breadroll 🥟 is a simple lightweight library for data processing operations written in Typescript and powered by Bun.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
breadroll 🥟 is a simple lightweight library for data processing operations written in Typescript and powered by Bun.
This repository contains the code and instructions to build a job recommendation system using machine learning. The system is designed to provide personalized job recommendations based on user prefere...
The Google Advanced Data Analytics Certificate contains information on how to use machine learning, predictive modeling, and experimental design to collect and analyze large amounts of data, and prepa...
In this personal Superstore Sales SQL Data Analysis project, an exploratory data analysis was performed on the Superstore Sales Data available on Kaggle. The main aim of the project is to uncover insi...
In this project, a RFM model is implemented to relate to customers in each segment. Assessed the Data Quality, performed EDA using Python and created Dashboard using Tableau.
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
An open-source Python library for Data Scientists & Data Analysts designed to simplify the exploratory data analysis process. Using Edvart, you can explore data sets and generate reports with minimal ...
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Interactively explore unstructured datasets from your dataframe.
Lead Scoring is such a powerful metric when it comes to quantifying the lead & it is nowadays used by every CRM. In this repository, we are going to take a look at the UpGrad lead scoring case study a...
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhance their skills and apply their knowledge.
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Perform a survival analysis based on the time-to-event (death event) for the subjects. Compare machine learning models to assess the likelihood of a death by heart failure condition. This can be used ...