35 results found Sort:
- Filter by Primary Language:
- Python (8)
- Scala (7)
- Jupyter Notebook (7)
- Java (5)
- C# (3)
- TypeScript (1)
- JavaScript (1)
- Rust (1)
- C (1)
- +
Best-in-class stream processing, analytics, and management. Perform continuous analytics, or build event-driven applications, real-time ETL pipelines, and feature stores in minutes. Unified streaming ...
Created
2022-01-28
11,973 commits to main branch, last one a day ago
A Flexible and Powerful Parameter Server for large-scale machine learning
Created
2017-04-25
2,963 commits to master branch, last one 2 years ago
酷玩 Spark: Spark 源代码解析、Spark 类库等
Created
2015-12-03
138 commits to master branch, last one 2 years ago
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Created
2018-04-18
27 commits to master branch, last one 5 years ago
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Created
2019-04-22
377 commits to main branch, last one 21 hours ago
scala、spark使用过程中,各种测试用例以及相关资料整理
Created
2015-09-24
226 commits to master branch, last one 5 years ago
Wormhole is a SPaaS (Stream Processing as a Service) Platform
Created
2017-09-05
6,922 commits to master branch, last one 3 years ago
C# and F# language binding and extensions to Apache Spark
Created
2015-10-27
1,083 commits to master branch, last one 10 months ago
An open source framework for building data analytic applications.
Created
2014-08-02
52,872 commits to develop branch, last one 23 hours ago
Streaming System 相关的论文读物
Created
2017-01-24
3 commits to master branch, last one 7 years ago
Stream computing platform for bigdata
Created
2018-06-19
625 commits to main branch, last one 2 years ago
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
Created
2019-07-23
300 commits to master branch, last one a day ago
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsigh...
Created
2019-03-14
960 commits to master branch, last one 2 months ago
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Created
2018-04-04
171 commits to master branch, last one 2 days ago
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Created
2015-09-04
377 commits to master branch, last one about a year ago
StreamLine - Streaming Analytics
Created
2015-05-14
2,413 commits to master branch, last one 5 years ago
Updated repository
Created
2017-03-15
23 commits to master branch, last one 6 years ago
:star2: :sparkles: Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
Created
2016-08-18
14 commits to master branch, last one 7 years ago
Kinesis Connector for Structured Streaming
Created
2018-03-01
90 commits to master branch, last one 2 years ago
Apache Spark 3 - Structured Streaming Course Material
Created
2020-07-21
29 commits to master branch, last one 4 years ago
Code examples on Apache Spark using python
Created
2017-08-09
138 commits to master branch, last one 2 years ago
This project walks through how you can create recommendations using Apache Spark machine learning. There are a number of jupyter notebooks that you can run on IBM Data Science Experience, and there a ...
Created
2017-01-13
245 commits to master branch, last one 2 years ago
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We wil...
Created
2022-05-10
45 commits to master branch, last one 2 years ago
Custom state store providers for Apache Spark
Created
2018-08-13
43 commits to master branch, last one 2 years ago
A data engineering project (Twitter monitor app)
Created
2022-04-14
83 commits to main branch, last one 2 years ago
For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR
Created
2021-08-10
88 commits to main branch, last one 2 years ago
Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO
Created
2022-11-12
27 commits to main branch, last one about a year ago
Repository used for Spark Trainings
Created
2015-12-28
358 commits to master branch, last one 3 years ago
秒杀,音乐商店项目实战,Redis源码,推荐系统
Created
2019-05-22
583 commits to master branch, last one 2 years ago
Benchmarks for data processing systems: Pathway, Spark, Flink, Kafka Streams
Created
2023-04-17
102 commits to main branch, last one 5 months ago