hoangsonww / End-to-End-Data-Pipeline

📈 A scalable, production-ready data pipeline for real-time streaming & batch processing, integrating Kafka, Spark, Airflow, AWS, Kubernetes, and MLflow. Supports end-to-end data ingestion, transformation, storage, monitoring, and AI/ML serving with CI/CD automation using Terraform & GitHub Actions.

Date Created 2025-02-15 (2 months ago)
Commits 83 (last one about a month ago)
Stargazers 26 (-1 this week)
Watchers 18 (0 this week)
Forks 20
License mit
Ranking

RepositoryStats indexes 643,432 repositories, of these hoangsonww/End-to-End-Data-Pipeline is ranked #629,644 (2nd percentile) for total stargazers, and #120,982 for total watchers. Github reports the primary language for this repository as Python, for repositories using this language it is ranked #129,229/132,390.

hoangsonww/End-to-End-Data-Pipeline is also tagged with popular topics, for these it's ranked: python (#23,412/23716),  docker (#6,602/6696),  kubernetes (#4,095/4129),  postgresql (#2,038/2074),  sql (#1,900/1925),  terraform (#1,269/1290),  kafka (#862/882),  prometheus (#841/853),  elasticsearch (#773/778),  grafana (#565/575),  spark (#552/559),  apache (#279/281),  hadoop (#186/188),  flink (#157/157),  airflow (#154/157)

Star History

Github stargazers over time

30302525202015151010550020 Feb20 FebMar '25Mar '2510 Mar10 Mar20 Mar20 MarApr '25Apr '2510 Apr10 Apr20 Apr20 Apr

Watcher History

Github watchers over time, collection started in '23

19191919191918.518.518181818181803 Apr03 Apr04 Apr04 Apr05 Apr05 Apr06 Apr06 Apr07 Apr07 Apr08 Apr08 Apr09 Apr09 Apr10 Apr10 Apr11 Apr11 Apr12 Apr12 Apr13 Apr13 Apr14 Apr14 Apr15 Apr15 Apr16 Apr16 Apr17 Apr17 Apr18 Apr18 Apr19 Apr19 Apr20 Apr20 Apr21 Apr21 Apr22 Apr22 Apr23 Apr23 Apr24 Apr24 Apr

Recent Commit History

83 commits on the default branch (master) since jan '22

9090808070706060505040403030202010100020 Feb20 FebMar '25Mar '2510 Mar10 Mar20 Mar20 MarApr '25Apr '2510 Apr10 Apr20 Apr20 Apr

Yearly Commits

Commits to the default branch (master) per year

2222111111000020242024

Issue History

No issues have been posted

Languages

The primary language is Python but there's also others...

PythonPythonJupyter NotebookJupyter NotebookHCLHCLDockerfileDockerfileShellShell

updated: 2025-04-24 @ 06:33pm, id: 933085039 / R_kgDON52_bw