37 results found Sort:

1.6k
6.7k
other
443
A Flexible and Powerful Parameter Server for large-scale machine learning
Created 2017-04-25
2,963 commits to master branch, last one 2 years ago
1.4k
3.5k
unknown
440
酷玩 Spark: Spark 源代码解析、Spark 类库等
Created 2015-12-03
138 commits to master branch, last one 2 years ago
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Created 2018-04-18
27 commits to master branch, last one 6 years ago
325
2.1k
mit
84
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Created 2019-04-22
384 commits to main branch, last one 19 days ago
433
1.1k
unknown
128
scala、spark使用过程中,各种测试用例以及相关资料整理
Created 2015-09-24
226 commits to master branch, last one 6 years ago
478
978
apache-2.0
102
Wormhole is a SPaaS (Stream Processing as a Service) Platform
Created 2017-09-05
6,922 commits to master branch, last one 4 years ago
211
940
mit
141
C# and F# language binding and extensions to Apache Spark
Created 2015-10-27
1,083 commits to master branch, last one about a year ago
345
771
other
95
An open source framework for building data analytic applications.
Created 2014-08-02
53,044 commits to develop branch, last one 9 days ago
Streaming System 相关的论文读物
Created 2017-01-24
3 commits to master branch, last one 8 years ago
174
402
apache-2.0
45
Stream computing platform for bigdata
Created 2018-06-19
625 commits to main branch, last one 3 years ago
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
Created 2019-07-23
303 commits to master branch, last one about a month ago
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Created 2019-03-14
961 commits to master branch, last one 3 months ago
35
252
other
7
Databricks framework to validate Data Quality of pySpark DataFrames
Created 2024-04-23
102 commits to main branch, last one 17 days ago
81
245
apache-2.0
35
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Created 2018-04-04
171 commits to master branch, last one 4 months ago
177
235
apache-2.0
45
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Created 2015-09-04
377 commits to master branch, last one 2 years ago
96
164
apache-2.0
682
StreamLine - Streaming Analytics
Created 2015-05-14
2,413 commits to master branch, last one 5 years ago
126
157
unknown
18
Updated repository
Created 2017-03-15
23 commits to master branch, last one 6 years ago
:star2: :sparkles: Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
Created 2016-08-18
14 commits to master branch, last one 7 years ago
80
136
apache-2.0
12
Kinesis Connector for Structured Streaming
Created 2018-03-01
90 commits to master branch, last one 2 years ago
Apache Spark 3 - Structured Streaming Course Material
Created 2020-07-21
29 commits to master branch, last one 4 years ago
Code examples on Apache Spark using python
Created 2017-08-09
138 commits to master branch, last one 2 years ago
This project walks through how you can create recommendations using Apache Spark machine learning. There are a number of jupyter notebooks that you can run on IBM Data Science Experience, and there a ...
Created 2017-01-13
245 commits to master branch, last one 2 years ago
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...
Created 2022-05-10
45 commits to master branch, last one 2 years ago
26
92
apache-2.0
7
Custom state store providers for Apache Spark
Created 2018-08-13
45 commits to master branch, last one 2 months ago
A data engineering project (Twitter monitor app)
Created 2022-04-14
83 commits to main branch, last one 2 years ago
Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams
Created 2023-04-17
104 commits to main branch, last one about a month ago
For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR
Created 2021-08-10
88 commits to main branch, last one 3 years ago
Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO
Created 2022-11-12
27 commits to main branch, last one about a year ago
秒杀,音乐商店项目实战,Redis源码,推荐系统
Created 2019-05-22
583 commits to master branch, last one 2 years ago
Repository used for Spark Trainings
Created 2015-12-28
358 commits to master branch, last one 3 years ago