34 results found Sort:

1.6k
6.7k
other
449
A Flexible and Powerful Parameter Server for large-scale machine learning
Created 2017-04-25
2,963 commits to master branch, last one about a year ago
524
6.4k
apache-2.0
79
SQL stream processing, analytics, and management. We decouple storage and compute to offer instant failover, dynamic scaling, speedy bootstrapping, and efficient joins.
Created 2022-01-28
10,598 commits to main branch, last one 16 hours ago
1.4k
3.4k
unknown
443
酷玩 Spark: Spark 源代码解析、Spark 类库等
Created 2015-12-03
138 commits to master branch, last one 2 years ago
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Created 2018-04-18
27 commits to master branch, last one 5 years ago
310
2.0k
mit
92
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Created 2019-04-22
372 commits to main branch, last one about a year ago
437
1.1k
unknown
130
scala、spark使用过程中,各种测试用例以及相关资料整理
Created 2015-09-24
226 commits to master branch, last one 5 years ago
481
977
apache-2.0
103
Wormhole is a SPaaS (Stream Processing as a Service) Platform
Created 2017-09-05
6,922 commits to master branch, last one 3 years ago
212
940
mit
145
C# and F# language binding and extensions to Apache Spark
Created 2015-10-27
1,083 commits to master branch, last one 4 months ago
338
743
other
97
An open source framework for building data analytic applications.
Created 2014-08-02
52,744 commits to develop branch, last one 16 days ago
Streaming System 相关的论文读物
Created 2017-01-24
3 commits to master branch, last one 7 years ago
175
403
apache-2.0
46
Stream computing platform for bigdata
Created 2018-06-19
625 commits to main branch, last one 2 years ago
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Created 2019-03-14
942 commits to master branch, last one 21 days ago
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
Created 2019-07-23
290 commits to master branch, last one 9 hours ago
97
243
apache-2.0
36
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Created 2018-04-04
167 commits to master branch, last one about a year ago
173
232
apache-2.0
48
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Created 2015-09-04
377 commits to master branch, last one about a year ago
97
163
apache-2.0
684
StreamLine - Streaming Analytics
Created 2015-05-14
2,413 commits to master branch, last one 4 years ago
125
157
unknown
19
Updated repository
Created 2017-03-15
23 commits to master branch, last one 5 years ago
79
137
apache-2.0
13
Kinesis Connector for Structured Streaming
Created 2018-03-01
90 commits to master branch, last one about a year ago
:star2: :sparkles: Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
Created 2016-08-18
14 commits to master branch, last one 7 years ago
Apache Spark 3 - Structured Streaming Course Material
Created 2020-07-21
29 commits to master branch, last one 3 years ago
This project walks through how you can create recommendations using Apache Spark machine learning. There are a number of jupyter notebooks that you can run on IBM Data Science Experience, and there a ...
Created 2017-01-13
245 commits to master branch, last one about a year ago
Code examples on Apache Spark using python
Created 2017-08-09
138 commits to master branch, last one about a year ago
26
93
apache-2.0
8
Custom state store providers for Apache Spark
Created 2018-08-13
43 commits to master branch, last one 2 years ago
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...
Created 2022-05-10
45 commits to master branch, last one about a year ago
A data engineering project (Twitter monitor app)
Created 2022-04-14
83 commits to main branch, last one about a year ago
For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR
Created 2021-08-10
88 commits to main branch, last one 2 years ago
Repository used for Spark Trainings
Created 2015-12-28
358 commits to master branch, last one 2 years ago
秒杀,音乐商店项目实战,Redis源码,推荐系统
Created 2019-05-22
583 commits to master branch, last one 2 years ago
Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO
Created 2022-11-12
27 commits to main branch, last one 10 months ago
This repository contains the code base for the Open Stream Processing Benchmark.
Created 2019-07-26
27 commits to master branch, last one 2 years ago