Trending repositories for topic databricks
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
A native Rust library for Delta Lake, with bindings into Python
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
A native Rust library for Delta Lake, with bindings into Python
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
A native Rust library for Delta Lake, with bindings into Python
The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
🏆 实时 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can...
This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.
Examples of using Terraform to deploy Databricks resources
Accelerates migrations to Databricks by automating code conversion and migration validation
Using U-Net Model to Detect Wildfire from Satellite Imagery
The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆
Accelerates migrations to Databricks by automating code conversion and migration validation
This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.
Using U-Net Model to Detect Wildfire from Satellite Imagery
Examples of using Terraform to deploy Databricks resources
This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
A native Rust library for Delta Lake, with bindings into Python
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
🏆 实时 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can...
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
A native Rust library for Delta Lake, with bindings into Python
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
🏆 实时 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can...
The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆
An open-source Python library for simplifying local testing of Databricks workflows that use PySpark and Delta tables.
This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
Notebooks and Code about Generative Ai, LLMs, MLOPS, NLP , CV and Graph databases
This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.
Code examples and resources for DBRX, a large language model developed by Databricks
Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data types and format characters) using Java.
Examples of using Terraform to deploy Databricks resources
This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
Databricks framework to validate Data Quality of pySpark DataFrames
An open-source Python library for simplifying local testing of Databricks workflows that use PySpark and Delta tables.
The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆
Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data types and format characters) using Java.
Notebooks and Code about Generative Ai, LLMs, MLOPS, NLP , CV and Graph databases
This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.
This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.
A native Rust library for Delta Lake, with bindings into Python
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or ...
Examples of using Terraform to deploy Databricks resources
Accelerates migrations to Databricks by automating code conversion and migration validation
Using U-Net Model to Detect Wildfire from Satellite Imagery
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
Code examples and resources for DBRX, a large language model developed by Databricks
The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆
Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data types and format characters) using Java.
An open-source Python library for simplifying local testing of Databricks workflows that use PySpark and Delta tables.
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Code examples and resources for DBRX, a large language model developed by Databricks
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Data Activation
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
🏆 实时 零代码、全功能、强安全 ORM 库 🚀 后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构 🏆 Real-Time coding-free, powerful and secure ORM 🚀 providing APIs and Docs without coding by Backend, and the returned JSON of API can...
A native Rust library for Delta Lake, with bindings into Python
This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
DataOps for Microsoft Data Platform technologies. https://aka.ms/dataops-repo
Examples of using Terraform to deploy Databricks resources
Code examples and resources for DBRX, a large language model developed by Databricks
This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.
This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
In this solution, we offer a novel approach to sustainable finance by combining NLP techniques and news analytics to extract key strategic ESG initiatives and learn companies' commitments to corporate...
Grafana Databricks integration allowing direct connection to Databricks to query and visualize Databricks data in Grafana.
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POC...
Examples of using Terraform to deploy Databricks resources
A native Rust library for Delta Lake, with bindings into Python
Demonstration of using Files in Repos with Databricks Delta Live Tables
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Prod...
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or ...