Trending repositories for topic flink

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

lakesoul-io/LakeSoul

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

2,655 (+7)

apache-2.0

apache/flink

Apache Flink

24,651 (+5)

apache-2.0

wangzhiwubigdata/God-Of-BigData

专注大数据学习面试，大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

10,007 (+4)

birdLark/LarkMidTable

LarkMidTable 是一站式开源的数据中台，实现中台的基础建设，数据治理，数据开发，监控告警，数据服务，数据的可视化，实现高效赋能数据前台并提供数据服务的产品。

1,905 (+4)

apache-2.0

Mrkuhuo/data-warehouse-learning

【2025最新版】大数据数据分析电商系统实时数仓离线数仓数据湖建设方案及实战代码，涉及组件 #flink #paimon #doris #seatunnel #dolphinscheduler #datart #dinky #hudi #iceberg。

750 (+4)

artistic-2.0

geekyouth/SZT-bigdata

深圳地铁大数据客流分析系统🚇🚄🌟

2,334 (+3)

MoRan1607/BigDataGuide

大数据学习，从零开始学习大数据，包含大数据学习各阶段学习视频、面试资料

2,855 (+3)

DataLinkDC/dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

3,355 (+3)

apache-2.0

apache/flink-cdc

Flink CDC is a streaming data integration tool

6,003 (+3)

apache-2.0

apache/zeppelin

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

6,471 (+3)

apache-2.0

zhisheng17/flink-learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例，还有 Flink 落地应用的大型项目案例（PVUV、日志存储、百亿数据实时去重、...

14,701 (+3)

apache-2.0

datavane/tis

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

1,115 (+3)

apache-2.0

OBenner/data-engineering-interview-questions

More than 2000+ Data engineer interview questions.

1,285 (+2)

collabH/bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

1,569 (+2)

mit

apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

2,694 (+2)

apache-2.0

SplitfireUptown/datalinkx

🔥🔥DatalinkX异构数据源之间的数据同步系统，支持海量数据的增量或全量同步，同时支持HTTP、Oracle、MySQL、ES等数据源之间的数据流转，支持中间transform算子如SQL算子、大模型算子，底层依赖Flink、Seatunnel引擎，提供流转任务管理、任务级联配置、任务日志采集等功能🔥🔥

223 (+1)

apache-2.0

fancyChuan/bigdata-hub

数据建设与大数据技术知识体系，包含hadoop、hive、spark、flink主流框架和系列框架，数据中台、数据湖、数据治理、数仓建设、数据化转型等

361 (+1)

DTStack/chunjun

A data integration framework

4,037 (+1)

apache-2.0

water8394/flink-recommandSystem-demo

:helicopter::rocket:基于Flink实现的商品实时推荐系统。flink统计商品热度，放入redis缓存，分析日志信息，将画像标签和实时记录放入Hbase。在用户发起推荐请求后，根据用户画像重排序热度榜，并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品，最后返回新的用户列表。

4,356 (+1)

zq2599/blog_demos

CSDN博客专家程序员欣宸的github，这里有六百多篇原创文章的详细分类和汇总，以及对应的源码，内容涉及Java、Docker、Kubernetes、DevOPS等方面

4,636 (+1)

apache-2.0

Last 3 days (relative gain)

Mrkuhuo/data-warehouse-learning

750 (+0.5%)

artistic-2.0

SplitfireUptown/datalinkx

223 (+0.5%)

apache-2.0

fancyChuan/bigdata-hub

数据建设与大数据技术知识体系，包含hadoop、hive、spark、flink主流框架和系列框架，数据中台、数据湖、数据治理、数仓建设、数据化转型等

361 (+0.3%)

datavane/tis

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

1,115 (+0.3%)

apache-2.0

lakesoul-io/LakeSoul

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

2,655 (+0.3%)

apache-2.0

birdLark/LarkMidTable

1,905 (+0.2%)

apache-2.0

OBenner/data-engineering-interview-questions

More than 2000+ Data engineer interview questions.

1,285 (+0.2%)

geekyouth/SZT-bigdata

深圳地铁大数据客流分析系统🚇🚄🌟

2,334 (+0.1%)

collabH/bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

1,569 (+0.1%)

mit

MoRan1607/BigDataGuide

大数据学习，从零开始学习大数据，包含大数据学习各阶段学习视频、面试资料

2,855 (+0.1%)

DataLinkDC/dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

3,355 (+0.1%)

apache-2.0

apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

2,694 (+0.1%)

apache-2.0

apache/flink-cdc

Flink CDC is a streaming data integration tool

6,003 (+0.1%)

apache-2.0

apache/zeppelin

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

6,471 (+0.0%)

apache-2.0

wangzhiwubigdata/God-Of-BigData

专注大数据学习面试，大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

10,007 (+0.0%)

DTStack/chunjun

A data integration framework

4,037 (+0.0%)

apache-2.0

water8394/flink-recommandSystem-demo

4,356 (+0.0%)

zq2599/blog_demos

CSDN博客专家程序员欣宸的github，这里有六百多篇原创文章的详细分类和汇总，以及对应的源码，内容涉及Java、Docker、Kubernetes、DevOPS等方面

4,636 (+0.0%)

apache-2.0

zhisheng17/flink-learning

14,701 (+0.0%)

apache-2.0

apache/flink

Apache Flink

24,651 (+0.0%)

apache-2.0

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

apache/flink

Apache Flink

24,651 (+20)

apache-2.0

lakesoul-io/LakeSoul

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

2,655 (+15)

apache-2.0

wangzhiwubigdata/God-Of-BigData

专注大数据学习面试，大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

10,007 (+12)

apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

2,694 (+12)

apache-2.0

MoRan1607/BigDataGuide

大数据学习，从零开始学习大数据，包含大数据学习各阶段学习视频、面试资料

2,855 (+11)

apache/flink-cdc

Flink CDC is a streaming data integration tool

6,003 (+10)

apache-2.0

zhisheng17/flink-learning

14,701 (+9)

apache-2.0

OBenner/data-engineering-interview-questions

More than 2000+ Data engineer interview questions.

1,285 (+8)

Mrkuhuo/data-warehouse-learning

750 (+7)

artistic-2.0

datavane/tis

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

1,115 (+7)

apache-2.0

birdLark/LarkMidTable

1,905 (+6)

apache-2.0

SplitfireUptown/datalinkx

223 (+5)

apache-2.0

DataLinkDC/dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

3,355 (+5)

apache-2.0

DTStack/chunjun

A data integration framework

4,037 (+5)

apache-2.0

water8394/flink-recommandSystem-demo

4,356 (+5)

fancyChuan/bigdata-hub

数据建设与大数据技术知识体系，包含hadoop、hive、spark、flink主流框架和系列框架，数据中台、数据湖、数据治理、数仓建设、数据化转型等

361 (+4)

geekyouth/SZT-bigdata

深圳地铁大数据客流分析系统🚇🚄🌟

2,334 (+4)

zq2599/blog_demos

CSDN博客专家程序员欣宸的github，这里有六百多篇原创文章的详细分类和汇总，以及对应的源码，内容涉及Java、Docker、Kubernetes、DevOPS等方面

4,636 (+4)

apache-2.0

melin/superior-sql-parser

基于 antlr4 的多种数据库SQL解析器，获取SQL中元数据，可用于数据平台产品中的多个场景：ddl语句提取元数据、sql 权限校验、表级血缘、sql语法校验等场景。支持spark、flink、gauss、starrocks、Oracle、MYSQL、Postgresql，sqlserver,、db2等

332 (+3)

apache-2.0

ittqqzz/ECommerceRecommendSystem

商品大数据实时推荐系统。前端：Vue + TypeScript + ElementUI，后端 Spring + Spark

463 (+3)

Last week (relative gain)

SplitfireUptown/datalinkx

223 (+2%)

apache-2.0

fancyChuan/bigdata-hub

数据建设与大数据技术知识体系，包含hadoop、hive、spark、flink主流框架和系列框架，数据中台、数据湖、数据治理、数仓建设、数据化转型等

361 (+1%)

Mrkuhuo/data-warehouse-learning

750 (+0.9%)

artistic-2.0

melin/superior-sql-parser

332 (+0.9%)

apache-2.0

ittqqzz/ECommerceRecommendSystem

商品大数据实时推荐系统。前端：Vue + TypeScript + ElementUI，后端 Spring + Spark

463 (+0.7%)

datavane/tis

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

1,115 (+0.6%)

apache-2.0

OBenner/data-engineering-interview-questions

More than 2000+ Data engineer interview questions.

1,285 (+0.6%)

lakesoul-io/LakeSoul

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

2,655 (+0.6%)

apache-2.0

apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

2,694 (+0.4%)

apache-2.0

MoRan1607/BigDataGuide

大数据学习，从零开始学习大数据，包含大数据学习各阶段学习视频、面试资料

2,855 (+0.4%)

xl-xueling/xl-lighthouse

新一代实时计算底座，计算性能超越flink/spark 100倍，XL-LightHouse是一套支持超大数据量、支持超高并发的通用型流式大数据统计系统【同时支持单机版】。常见的应用场景包括：PV、UV统计；电商销售额、下单用户数统计；日志量统计；接口调用量、异常量、耗时情况统计；服务器运维监控等功能，系统支持多维度统计，支持各种复杂的条件筛选和逻辑判断，一键部署，一行代码接入，轻松实现业务全链路...

298 (+0.3%)

apache-2.0

kamu-data/kamu-cli

Next-generation decentralized data lakehouse and a multi-party stream processing network

308 (+0.3%)

birdLark/LarkMidTable

1,905 (+0.3%)

apache-2.0

chainbase-labs/manuscript-core

Manuscript is a revolutionary blockchain data streaming framework. With Manuscript, you can seamlessly integrate on-chain and off-chain data into target data storage for unrestricted querying and anal...

650 (+0.3%)

apache-2.0

collabH/bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

1,569 (+0.3%)

mit

ververica/flink-sql-cookbook

The Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform a...

886 (+0.2%)

apache-2.0

WeBankFinTech/Exchangis

Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured heterogeneous data sources

446 (+0.2%)

apache-2.0

geekyouth/SZT-bigdata

深圳地铁大数据客流分析系统🚇🚄🌟

2,334 (+0.2%)

apache/flink-cdc

Flink CDC is a streaming data integration tool

6,003 (+0.2%)

apache-2.0

DataLinkDC/dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

3,355 (+0.1%)

apache-2.0

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

lakesoul-io/LakeSoul

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

2,655 (+117)

apache-2.0

apache/flink

Apache Flink

24,651 (+107)

apache-2.0

wangzhiwubigdata/God-Of-BigData

专注大数据学习面试，大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

10,007 (+72)

apache/flink-cdc

Flink CDC is a streaming data integration tool

6,003 (+51)

apache-2.0

apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

2,694 (+47)

apache-2.0

zhisheng17/flink-learning

14,701 (+46)

apache-2.0

Mrkuhuo/data-warehouse-learning

750 (+43)

artistic-2.0

MoRan1607/BigDataGuide

大数据学习，从零开始学习大数据，包含大数据学习各阶段学习视频、面试资料

2,855 (+41)

DataLinkDC/dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

3,355 (+37)

apache-2.0

OBenner/data-engineering-interview-questions

More than 2000+ Data engineer interview questions.

1,285 (+30)

datavane/tis

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

1,115 (+29)

apache-2.0

birdLark/LarkMidTable

1,905 (+23)

apache-2.0

chainbase-labs/manuscript-core

650 (+21)

apache-2.0

SplitfireUptown/datalinkx

223 (+19)

apache-2.0

water8394/flink-recommandSystem-demo

4,356 (+19)

collabH/bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

1,569 (+18)

mit

alibaba/SREWorks

Cloud Native DataOps & AIOps Platform | 云原生数智运维平台

1,859 (+15)

apache-2.0

zq2599/blog_demos

CSDN博客专家程序员欣宸的github，这里有六百多篇原创文章的详细分类和汇总，以及对应的源码，内容涉及Java、Docker、Kubernetes、DevOPS等方面

4,636 (+15)

apache-2.0

TuGraph-family/tugraph-analytics

GeaFlow: A Streaming Graph Compute Engine.

666 (+14)

apache-2.0

apache/flink-kubernetes-operator

Apache Flink Kubernetes Operator

854 (+14)

apache-2.0

Last month (relative gain)

hexnn/Stark

基于Spark+SparkMLlib+Debezium打造的简单易用、超高性能大数据治理引擎，适用于批流一体的数据集成和数据分析，支持机器学习算法模型、支持CDC实时数据采集，数据建模、算法建模和OLAP数据分析

27 (+29%)

jaehyeon-kim/flink-demos

Apache Flink (Pyflink) and Related Projects

35 (+13%)

Mrkuhuo/bigdata_learning

大数据组件学习代码

55 (+12%)

SplitfireUptown/datalinkx

223 (+9%)

apache-2.0

aws-samples/amazon-managed-service-for-apache-flink-examples

Collection of code examples for Amazon Managed Service for Apache Flink

50 (+6%)

mit-0

pathwaycom/pathway-benchmarks

Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams

68 (+6%)

mit

Mrkuhuo/data-warehouse-learning

750 (+6%)

artistic-2.0

lhq-123/Spark-Flink-DataWarehouse

基于Flink+Kafka的全链路数仓, 包括实时和离线

37 (+6%)

lakesoul-io/LakeSoul

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

2,655 (+5%)

apache-2.0

mikeroyal/Apache-Flink-Guide

Apache Flink Guide

57 (+4%)

chloro-pn/tunnel

Tunnel is a Pipeline Execution Engine based on C++20 coroutine

29 (+4%)

apache-2.0

fancyChuan/bigdata-hub

数据建设与大数据技术知识体系，包含hadoop、hive、spark、flink主流框架和系列框架，数据中台、数据湖、数据治理、数仓建设、数据化转型等

361 (+3%)

chainbase-labs/manuscript-core

650 (+3%)

apache-2.0

IndustryFusion/DigitalTwin

This repository contains the ingredients for the Digital Twin Concept of Industry Fusion.

36 (+3%)

apache-2.0

justdoitMr/rzf.github.io

✏️[计算机基础+java基础+大数据基础及进阶+面试指南] 一份涵盖计算机基础，java，大数据，面试宝典，大部分核心知识的项目，学习，面试，共同进步！

73 (+3%)

tlhhup/litemall-dw

基于开源Litemall电商项目的大数据项目，包含前端埋点(openresty+lua)、后端埋点；数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化)，同时也包含了Azkaban的workflow。

111 (+3%)

apache/flink-connector-elasticsearch

Apache Flink connector for ElasticSearch

76 (+3%)

apache-2.0

datavane/tis

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

1,115 (+3%)

apache-2.0

LB-Yu/data-systems-learning

Learning summary and examples about data systems.

40 (+3%)

xl-xueling/xl-lighthouse

298 (+2%)

apache-2.0

Last 12-months (new repositories)

chainbase-labs/manuscript-core

650

apache-2.0

hexnn/Stark

Last 12-months (absolute gain)

apache/flink

Apache Flink

24,651 (+1,645)

apache-2.0

apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

2,694 (+956)

apache-2.0

apache/flink-cdc

Flink CDC is a streaming data integration tool

6,003 (+884)

apache-2.0

wangzhiwubigdata/God-Of-BigData

专注大数据学习面试，大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

10,007 (+828)

Mrkuhuo/data-warehouse-learning

750 (+746)

artistic-2.0

chainbase-labs/manuscript-core

650 (+649)

apache-2.0

DataLinkDC/dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

3,355 (+598)

apache-2.0

OBenner/data-engineering-interview-questions

More than 2000+ Data engineer interview questions.

1,285 (+538)

zhisheng17/flink-learning

14,701 (+517)

apache-2.0

MoRan1607/BigDataGuide

大数据学习，从零开始学习大数据，包含大数据学习各阶段学习视频、面试资料

2,855 (+393)

lakesoul-io/LakeSoul

LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.

2,655 (+375)

apache-2.0

zq2599/blog_demos

CSDN博客专家程序员欣宸的github，这里有六百多篇原创文章的详细分类和汇总，以及对应的源码，内容涉及Java、Docker、Kubernetes、DevOPS等方面

4,636 (+367)

apache-2.0

collabH/bigdata-growth

大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。

1,569 (+342)

mit

birdLark/LarkMidTable

1,905 (+282)

apache-2.0

datavane/datavines

Know your data better！Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.

596 (+280)

apache-2.0

datavane/tis

Support agile DataOps Based on Flink, DataX and Flink-CDC, Chunjun with Web-UI

1,115 (+255)

apache-2.0

xl-xueling/xl-lighthouse

298 (+250)

apache-2.0

water8394/flink-recommandSystem-demo

4,356 (+245)

apache/zeppelin

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

6,471 (+224)

apache-2.0

geekyouth/SZT-bigdata

深圳地铁大数据客流分析系统🚇🚄🌟

2,334 (+223)

Last 12-months (relative gain)

Mrkuhuo/data-warehouse-learning

750 (+18,650%)

artistic-2.0

xl-xueling/xl-lighthouse

298 (+521%)

apache-2.0

jaehyeon-kim/flink-demos

Apache Flink (Pyflink) and Related Projects

35 (+250%)

SplitfireUptown/datalinkx

223 (+201%)

apache-2.0

aws-samples/amazon-managed-service-for-apache-flink-examples

Collection of code examples for Amazon Managed Service for Apache Flink

50 (+194%)

mit-0

Mrkuhuo/bigdata_learning

大数据组件学习代码

55 (+189%)

pathwaycom/pathway-benchmarks

Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams

68 (+152%)

mit

1ambda/lakehouse

Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)

52 (+148%)

justdoitMr/rzf.github.io

73 (+115%)

decodableco/examples

🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Postgres.

67 (+97%)

apache-2.0

lhq-123/Spark-Flink-DataWarehouse

基于Flink+Kafka的全链路数仓, 包括实时和离线

37 (+95%)

wzqwtt/BigData

小白大数据学习笔记 :star:

42 (+91%)

datavane/datavines

Know your data better！Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.

596 (+89%)

apache-2.0

mikeroyal/Apache-Flink-Guide

Apache Flink Guide

57 (+84%)

apache/flink-connector-aws

Apache flink

64 (+73%)

apache-2.0

apache/flink-connector-jdbc

Apache flink

145 (+73%)

apache-2.0

OBenner/data-engineering-interview-questions

More than 2000+ Data engineer interview questions.

1,285 (+72%)

IndustryFusion/DigitalTwin

This repository contains the ingredients for the Digital Twin Concept of Industry Fusion.

36 (+71%)

apache-2.0

melin/superior-sql-parser

332 (+69%)

apache-2.0

apache/flink-connector-hbase

Apache flink

31 (+63%)

apache-2.0