Statistics for topic exploratory-data-analysis
RepositoryStats tracks 518,986 Github repositories, of these 63 are tagged with the exploratory-data-analysis topic. The most common primary language for repositories using this topic is Jupyter Notebook (27). Other languages include: Python (18)
Stargazers over time for topic exploratory-data-analysis
Most starred repositories for topic exploratory-data-analysis (view more)
Trending repositories for topic exploratory-data-analysis (view more)
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Automatically visualize your pandas dataframe via a single print! 📊 💡
Always know what to expect from your data.
The Google Advanced Data Analytics Certificate contains information on how to use machine learning, predictive modeling, and experimental design to collect and analyze large amounts of data, and prepa...
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Automatically visualize your pandas dataframe via a single print! 📊 💡
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown
Always know what to expect from your data.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
This repository contains the code and instructions to build a job recommendation system using machine learning. The system is designed to provide personalized job recommendations based on user prefere...
The Google Advanced Data Analytics Certificate contains information on how to use machine learning, predictive modeling, and experimental design to collect and analyze large amounts of data, and prepa...
Automatically visualize your pandas dataframe via a single print! 📊 💡
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown
Always know what to expect from your data.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
This repository contains the code and instructions to build a job recommendation system using machine learning. The system is designed to provide personalized job recommendations based on user prefere...
The Google Advanced Data Analytics Certificate contains information on how to use machine learning, predictive modeling, and experimental design to collect and analyze large amounts of data, and prepa...
breadroll 🥟 is a simple lightweight library for data processing operations written in Typescript and powered by Bun.
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
The Google Advanced Data Analytics Certificate contains information on how to use machine learning, predictive modeling, and experimental design to collect and analyze large amounts of data, and prepa...
An open-source Python library for Data Scientists & Data Analysts designed to simplify the exploratory data analysis process. Using Edvart, you can explore data sets and generate reports with minimal ...
Business intelligence as code: build fast, interactive data visualizations in pure SQL and markdown
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Always know what to expect from your data.
Interactively explore unstructured datasets from your dataframe.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...
Interactively explore unstructured datasets from your dataframe.
A curated list of awesome resources such as books, tutorials, courses, open-source libraries, exercises, and other materials that support Pythonistas in the making, and Pythonistas migrating into Data...
In this project, a RFM model is implemented to relate to customers in each segment. Assessed the Data Quality, performed EDA using Python and created Dashboard using Tableau.