Logstash - transport and process your logs, events, or other data
-
Updated
Feb 13, 2023 - Java
Logstash - transport and process your logs, events, or other data
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
Flow-based programming for JavaScript
The open source high performance data integration platform built for developers.
This repository is a getting started guide to Singer.
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
A scalable general purpose micro-framework for defining dataflows. You can use it to build dataframes, numpy matrices, python objects, ML models, etc. Embed Hamilton anywhere python runs, e.g. spark, airflow, jupyter, fastapi, python scripts, etc.
Open source SQL engine in Python
Extract, Transform, Load: Any SQL Database in 4 lines of Code.
A simplified, lightweight ETL Framework based on Apache Spark
Knowledge Graph Toolkit
A tool for building feature stores.
A lightweight ETL (extract, transform, load) library and data integration toolbox for .NET.
Bender - Serverless ETL Framework
A visual ETL development and debugging tool for big data
Configurable Extract, Transform, and Load
一款基于kettle的数据处理web调度控制平台,支持文档资源库和数据库资源库,通过web平台控制kettle数据转换,可作为中间件集成到现有系统中
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
Add a description, image, and links to the etl-framework topic page so that developers can more easily learn about it.
To associate your repository with the etl-framework topic, visit your repo's landing page and select "manage topics."