lakehouse
Here are 54 public repositories matching this topic...
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
-
Updated
Nov 28, 2023 - Java
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
-
Updated
Nov 16, 2023 - Java
YTsaurus is a scalable and fault-tolerant open-source big data platform.
-
Updated
Nov 28, 2023 - C++
Use SQL to build ELT pipelines on a data lakehouse.
-
Updated
May 25, 2022 - JavaScript
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
-
Updated
Nov 28, 2023 - Python
Examples of using Terraform to deploy Databricks resources
-
Updated
Nov 21, 2023 - HCL
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
-
Updated
Oct 18, 2023 - Python
Lakehouse storage system benchmark
-
Updated
Feb 22, 2023 - Scala
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
-
Updated
Sep 2, 2023 - Dockerfile
Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
-
Updated
Nov 9, 2023 - Python
DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics for your Delta Lakehouse. Unity Catalog supported in the v0.7.0-rc1 release.Documentation here - https://databrickslabs.github.io/delta-oms/v0.7.0-rc1/
-
Updated
Nov 27, 2023 - Scala
Open source stack lakehouse
-
Updated
Mar 27, 2023 - Python
Repositório dedicado a Workshop de Data Lakehouse com Delta Lake
-
Updated
Dec 6, 2021 - Jupyter Notebook
Unlocking the Power of Health Data With a Modern Data Lakehouse
-
Updated
Jun 16, 2023 - Python
Improve this page
Add a description, image, and links to the lakehouse topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the lakehouse topic, visit your repo's landing page and select "manage topics."

