Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
-
Updated
Aug 12, 2023 - Python
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
Concurrent and multi-stage data ingestion and data processing with Elixir
Pravega - Streaming as a new software defined storage primitive
Open-Source Hybrid Search for Postgres
Use SQL to build ELT pipelines on a data lakehouse.
A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way
The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย
Squirrel dataset hub
OpenKit Java Reference Implementation
Enables custom tracing of Java applications in Dynatrace
Download and warehouse historical trading data
Enables custom tracing of Python applications in Dynatrace
Sample code for the AWS Big Data Blog Post Building a scalable streaming data processor with Amazon Kinesis Data Streams on AWS Fargate
The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and egress.
Enables custom tracing of native applications in Dynatrace
Describes technical concepts of Dynatrace OneAgent SDK
Add a description, image, and links to the data-ingestion topic page so that developers can more easily learn about it.
To associate your repository with the data-ingestion topic, visit your repo's landing page and select "manage topics."