11 results found Sort:
- Filter by Primary Language:
- Java (8)
- JavaScript (1)
- +
大数据入门指南 :star:
Created
2019-03-10
607 commits to master branch, last one about a year ago
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Created
2019-03-01
8,563 commits to dev branch, last one 9 hours ago
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Created
2019-02-14
405 commits to master branch, last one 2 years ago
Azkaban workflow manager.
Created
2012-10-18
2,976 commits to master branch, last one about a year ago
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualizati...
Created
2019-11-24
3,795 commits to master branch, last one 6 days ago
Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display
Created
2021-03-02
1,465 commits to master branch, last one 3 months ago
Schedulis is a high performance workflow task scheduling system that supports high availability and multi-tenant financial level features, Linkis computing middleware, and has been integrated into dat...
Created
2020-05-07
4 commits to master branch, last one 6 days ago
最好的大数据项目。《Titan数据运营系统》,本项目是一个全栈闭环系统,我们有用作数据可视化的web系统,然后用flume-kafaka-flume进行日志的读取,在hive设计数仓,编写spark代码进行数仓表之间的转化以及ads层表到mysql的迁移,使用azkaban进行定时任务的调度,使用技术:Java/Scala语言,Hadoop、Spark、Hive、Kafka、Flume、Azkab...
Created
2020-07-29
33 commits to master branch, last one 3 years ago
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Created
2021-05-10
285 commits to dev branch, last one 2 years ago
基于大数据的图书推荐系统
Created
2020-01-15
24 commits to master branch, last one 2 years ago
基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步
Created
2019-04-28
75 commits to develop branch, last one 3 years ago