Trending repositories for topic exploratory-data-analysis

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

cleanlab/cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

10,343 (+99)

agpl-3.0

great-expectations/great_expectations

Always know what to expect from your data.

10,285 (+11)

apache-2.0

evidence-dev/evidence

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

4,982 (+8)

mit

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,805 (+2)

mit

hurshd0/must-read-papers-for-ml

Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer

1,123 (+1)

mit

Last 3 days (relative gain)

cleanlab/cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

10,343 (+1.0%)

agpl-3.0

evidence-dev/evidence

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

4,982 (+0.2%)

mit

great-expectations/great_expectations

Always know what to expect from your data.

10,285 (+0.1%)

apache-2.0

hurshd0/must-read-papers-for-ml

Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer

1,123 (+0.1%)

mit

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,805 (+0.0%)

mit

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

cleanlab/cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

10,343 (+104)

agpl-3.0

great-expectations/great_expectations

Always know what to expect from your data.

10,285 (+23)

apache-2.0

evidence-dev/evidence

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

4,982 (+17)

mit

hurshd0/must-read-papers-for-ml

Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer

1,123 (+8)

mit

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,805 (+8)

mit

fbdesignpro/sweetviz

Visualize and compare datasets, target values and associations, with one line of code.

2,999 (+3)

mit

Jean-njoroge/Breast-cancer-risk-prediction

Classification of Breast Cancer diagnosis Using Support Vector Machines

245 (+1)

mit

dataprofessor/streamlit_freecodecamp

Build 12 Data Apps in Python with Streamlit

625 (+1)

Renumics/spotlight

Interactively explore unstructured datasets from your dataframe.

1,157 (+1)

mit

JasonKessler/scattertext

Beautiful visualizations of how language differs among document types.

2,288 (+1)

apache-2.0

lux-org/lux

Automatically visualize your pandas dataframe via a single print! 📊 💡

5,263 (+1)

apache-2.0

Last week (relative gain)

cleanlab/cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

10,343 (+1%)

agpl-3.0

hurshd0/must-read-papers-for-ml

Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer

1,123 (+0.7%)

mit

Jean-njoroge/Breast-cancer-risk-prediction

Classification of Breast Cancer diagnosis Using Support Vector Machines

245 (+0.4%)

mit

evidence-dev/evidence

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

4,982 (+0.3%)

mit

great-expectations/great_expectations

Always know what to expect from your data.

10,285 (+0.2%)

apache-2.0

dataprofessor/streamlit_freecodecamp

Build 12 Data Apps in Python with Streamlit

625 (+0.2%)

fbdesignpro/sweetviz

Visualize and compare datasets, target values and associations, with one line of code.

2,999 (+0.1%)

mit

Renumics/spotlight

Interactively explore unstructured datasets from your dataframe.

1,157 (+0.1%)

mit

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,805 (+0.1%)

mit

JasonKessler/scattertext

Beautiful visualizations of how language differs among document types.

2,288 (+0.0%)

apache-2.0

lux-org/lux

Automatically visualize your pandas dataframe via a single print! 📊 💡

5,263 (+0.0%)

apache-2.0

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

cleanlab/cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

10,343 (+150)

agpl-3.0

evidence-dev/evidence

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

4,982 (+124)

mit

great-expectations/great_expectations

Always know what to expect from your data.

10,285 (+79)

apache-2.0

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,805 (+64)

mit

hurshd0/must-read-papers-for-ml

Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer

1,123 (+28)

mit

fbdesignpro/sweetviz

Visualize and compare datasets, target values and associations, with one line of code.

2,999 (+14)

mit

lux-org/lux

Automatically visualize your pandas dataframe via a single print! 📊 💡

5,263 (+14)

apache-2.0

aeturrell/skimpy

skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.

443 (+12)

mit

cleanlab/cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

1,062 (+11)

agpl-3.0

sfu-db/dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

2,137 (+10)

mit

dataprofessor/code

Compilation of R and Python programming codes on the Data Professor YouTube channel.

960 (+8)

dataprofessor/streamlit_freecodecamp

Build 12 Data Apps in Python with Streamlit

625 (+7)

JasonKessler/scattertext

Beautiful visualizations of how language differs among document types.

2,288 (+5)

apache-2.0

drshahizan/Python_EDA

This topic explains about the implementation of exploratory data analysis (EDA). A total of 21 EDA case studies have been implemented using the Malaysian dataset.

183 (+4)

achuthasubhash/Complete-Life-Cycle-of-a-Data-Science-Project

Complete-Life-Cycle-of-a-Data-Science-Project

601 (+4)

mit

tommyod/KDEpy

Kernel Density Estimation in Python

603 (+4)

bsd-3-clause

latitude-dev/latitude

Developer-first embedded analytics

901 (+4)

lgpl-3.0

Renumics/spotlight

Interactively explore unstructured datasets from your dataframe.

1,157 (+4)

mit

abbas99-hub/Job-Recommendation-System

This repository contains the code and instructions to build a job recommendation system using machine learning. The system is designed to provide personalized job recommendations based on user prefere...

52 (+2)

Jean-njoroge/Breast-cancer-risk-prediction

Classification of Breast Cancer diagnosis Using Support Vector Machines

245 (+2)

mit

Last month (relative gain)

abbas99-hub/Job-Recommendation-System

52 (+4%)

Michel-Nguegang/PROJECT-PORTFOLIO--Superstore-Sales-SQL-Data-Analysis

In this personal Superstore Sales SQL Data Analysis project, an exploratory data analysis was performed on the Superstore Sales Data available on Kaggle. The main aim of the project is to uncover insi...

31 (+3%)

zhihanyue/qgridnext

Advancing QGrid, an interactive grid for exploring DataFrames in JupyterLab/Notebook

31 (+3%)

apache-2.0

aeturrell/skimpy

skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.

443 (+3%)

mit

hurshd0/must-read-papers-for-ml

Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer

1,123 (+3%)

mit

evidence-dev/evidence

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

4,982 (+3%)

mit

drshahizan/Python_EDA

This topic explains about the implementation of exploratory data analysis (EDA). A total of 21 EDA case studies have been implemented using the Malaysian dataset.

183 (+2%)

PacktWorkshops/The-Data-Analysis-Workshop

A New Interactive Approach to Learning Data Analysis

69 (+1%)

mit

PetrKorab/Arabica

Python package for text mining of time-series data

71 (+1%)

apache-2.0

tusharnankani/whatsapp-chat-data-analysis

An Exhaustive WhatsApp Chat Data Analysis.

72 (+1%)

mit

Data-Centric-AI-Community/awesome-python-for-data-science

A curated list of awesome resources such as books, tutorials, courses, open-source libraries, exercises, and other materials that support Pythonistas in the making, and Pythonistas migrating into Data...

83 (+1%)

sankeshyadav98/Google-Advanced-Data-Analytics-Professional-Certificate

The Google Advanced Data Analytics Certificate contains information on how to use machine learning, predictive modeling, and experimental design to collect and analyze large amounts of data, and prepa...

83 (+1%)

dataprofessor/streamlit_freecodecamp

Build 12 Data Apps in Python with Streamlit

625 (+1%)

cleanlab/cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

1,062 (+1%)

agpl-3.0

dataprofessor/code

Compilation of R and Python programming codes on the Data Professor YouTube channel.

960 (+0.8%)

Jean-njoroge/Breast-cancer-risk-prediction

Classification of Breast Cancer diagnosis Using Support Vector Machines

245 (+0.8%)

mit

great-expectations/great_expectations

Always know what to expect from your data.

10,285 (+0.8%)

apache-2.0

amanovishnu/ineuron-full-stack-data-science-assignments

this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhance their skills and apply their knowledge.

283 (+0.7%)

mit

achuthasubhash/Complete-Life-Cycle-of-a-Data-Science-Project

Complete-Life-Cycle-of-a-Data-Science-Project

601 (+0.7%)

mit

tommyod/KDEpy

Kernel Density Estimation in Python

603 (+0.7%)

bsd-3-clause

Last 12-months (new repositories)

no newly created repositories trending in the last 12 months

Last 12-months (absolute gain)

cleanlab/cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

10,343 (+2,213)

agpl-3.0

evidence-dev/evidence

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

4,982 (+1,823)

mit

great-expectations/great_expectations

Always know what to expect from your data.

10,285 (+922)

apache-2.0

ydataai/ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

12,805 (+876)

mit

latitude-dev/latitude

Developer-first embedded analytics

901 (+786)

lgpl-3.0

hurshd0/must-read-papers-for-ml

Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer

1,123 (+459)

mit

lux-org/lux

Automatically visualize your pandas dataframe via a single print! 📊 💡

5,263 (+362)

apache-2.0

Desbordante/desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algor...

396 (+336)

agpl-3.0

sfu-db/dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

2,137 (+239)

mit

Renumics/spotlight

Interactively explore unstructured datasets from your dataframe.

1,157 (+178)

mit

fbdesignpro/sweetviz

Visualize and compare datasets, target values and associations, with one line of code.

2,999 (+174)

mit

cleanlab/cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

1,062 (+153)

agpl-3.0

dataprofessor/code

Compilation of R and Python programming codes on the Data Professor YouTube channel.

960 (+102)

JasonKessler/scattertext

Beautiful visualizations of how language differs among document types.

2,288 (+94)

apache-2.0

aeturrell/skimpy

skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.

443 (+91)

mit

achuthasubhash/Complete-Life-Cycle-of-a-Data-Science-Project

Complete-Life-Cycle-of-a-Data-Science-Project

601 (+80)

mit

dataprofessor/streamlit_freecodecamp

Build 12 Data Apps in Python with Streamlit

625 (+79)

tommyod/KDEpy

Kernel Density Estimation in Python

603 (+55)

bsd-3-clause

amanovishnu/ineuron-full-stack-data-science-assignments

this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhance their skills and apply their knowledge.

283 (+54)

mit

sankeshyadav98/Google-Advanced-Data-Analytics-Professional-Certificate

83 (+39)

Last 12-months (relative gain)

latitude-dev/latitude

Developer-first embedded analytics

901 (+683%)

lgpl-3.0

Desbordante/desbordante-core

396 (+560%)

agpl-3.0

abbas99-hub/Job-Recommendation-System

52 (+160%)

zhihanyue/qgridnext

Advancing QGrid, an interactive grid for exploring DataFrames in JupyterLab/Notebook

31 (+138%)

apache-2.0

PetrKorab/Arabica

Python package for text mining of time-series data

71 (+103%)

apache-2.0

sankeshyadav98/Google-Advanced-Data-Analytics-Professional-Certificate

83 (+89%)

Michel-Nguegang/PROJECT-PORTFOLIO--Superstore-Sales-SQL-Data-Analysis

31 (+72%)

AbhishekGit-hash/Data-Analytics-Customer-Segmentation

In this project, a RFM model is implemented to relate to customers in each segment. Assessed the Data Quality, performed EDA using Python and created Dashboard using Tableau.

67 (+72%)

hurshd0/must-read-papers-for-ml

Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer

1,123 (+69%)

mit

devsgnr/breadroll

breadroll 🥟 is a simple lightweight library for data processing operations written in Typescript and powered by Bun.

75 (+63%)

mit

evidence-dev/evidence

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

4,982 (+58%)

mit

pachterlab/voyager

From geospatial to spatial -omics

88 (+47%)

artistic-2.0

datamole-ai/edvart

An open-source Python library for Data Scientists & Data Analysts designed to simplify the exploratory data analysis process. Using Edvart, you can explore data sets and generate reports with minimal ...

52 (+41%)

mit

aatmunbaxi/orgroamtools

Helper library for data analysis of org-roam collections

28 (+40%)

mit

mukulsinghal001/lead-scoring-model-python

Lead Scoring is such a powerful metric when it comes to quantifying the lead & it is nowadays used by every CRM. In this repository, we are going to take a look at the UpGrad lead scoring case study a...

54 (+35%)

cleanlab/cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

10,343 (+27%)

agpl-3.0

aeturrell/skimpy

skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.

443 (+26%)

mit

Data-Centric-AI-Community/awesome-python-for-data-science

83 (+26%)

drshahizan/Python_EDA

This topic explains about the implementation of exploratory data analysis (EDA). A total of 21 EDA case studies have been implemented using the Malaysian dataset.

183 (+24%)

amanovishnu/ineuron-full-stack-data-science-assignments

this repository features assignments and projects from the iNeuron full stack data science course, providing valuable resources for learners to enhance their skills and apply their knowledge.

283 (+24%)

mit