14 results found Sort:

4.2k
15.4k
unknown
443
大数据入门指南 :star:
Created 2019-03-10
607 commits to master branch, last one about a year ago
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Created 2019-02-14
405 commits to master branch, last one about a year ago
844
2.5k
unknown
46
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Created 2019-11-30
848 commits to master branch, last one 5 months ago
1.6k
2.5k
apache-2.0
227
Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data
Created 2011-08-12
2,012 commits to trunk branch, last one 19 days ago
141
1.4k
mit
36
Extract logic from your apps with a user-friendly node editor powered by React.
Created 2020-03-01
322 commits to master branch, last one 8 months ago
More than 2000+ Data engineer interview questions.
Created 2021-08-08
16 commits to master branch, last one 12 days ago
47
193
unknown
5
分布式实时日志分析与入侵检测系统
Created 2018-12-09
11 commits to master branch, last one 2 years ago
This repository has no description...
Created 2017-01-20
139 commits to master branch, last one 2 days ago
最好的大数据项目。《Titan数据运营系统》,本项目是一个全栈闭环系统,我们有用作数据可视化的web系统,然后用flume-kafaka-flume进行日志的读取,在hive设计数仓,编写spark代码进行数仓表之间的转化以及ads层表到mysql的迁移,使用azkaban进行定时任务的调度,使用技术:Java/Scala语言,Hadoop、Spark、Hive、Kafka、Flume、Azkab...
Created 2020-07-29
33 commits to master branch, last one 2 years ago
44
95
unknown
7
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Created 2021-05-10
285 commits to dev branch, last one 2 years ago
Flume NG MongoDB source.
Created 2012-09-28
43 commits to master branch, last one about a year ago
21
67
unknown
5
一个对用户行为日志进行分析的大数据项目
Created 2019-09-26
22 commits to master branch, last one 4 years ago
基于大数据的图书推荐系统
Created 2020-01-15
24 commits to master branch, last one 2 years ago
The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Postgres, Cassandra, Hue, Zeppelin, Kadmin, Kafka Control Center ...
Created 2022-11-30
16 commits to main branch, last one about a year ago