The Wayback Machine - https://web.archive.org/web/20220621105948/https://github.com/odpf
Skip to content
@odpf

Open DataOps Foundation

Modern data platform that empowers organizations to discover, transform, analyse and secure data faster and efficiently.

Pinned

  1. optimus Public

    Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

    Go 620 147

  2. dagger Public

    Dagger is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of real-time streaming data.

    Java 182 26

  3. firehose Public

    Firehose is an extensible, no-code, and cloud-native service to load real-time streaming data from Kafka to data stores, data lakes, and analytical storage systems.

    Java 224 34

  4. raccoon Public

    Raccoon is a high-throughput, low-latency service to collect events in real-time from your web, mobile apps, and services using multiple network protocols.

    Go 134 16

  5. guardian Public

    Guardian is a tool for extensible and universal data access with automated access workflows and security controls across data stores, analytical systems, and cloud products.

    Go 116 8

  6. stencil Public

    Stencil is a schema registry that provides schema management and validation dynamically, efficiently, and reliably to ensure data compatibility across applications.

    Go 122 28

Repositories

  • proton Public

    This repository is home to the original protobuf interface definitions which are used throughout the open data platform ecosystem.

    41 Apache-2.0 13 3 2 Updated Jun 21, 2022
  • optimus Public

    Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

    Go 620 Apache-2.0 147 94 (2 issues need help) 9 Updated Jun 21, 2022
  • dagger Public

    Dagger is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of real-time streaming data.

    Java 182 Apache-2.0 26 27 (3 issues need help) 3 Updated Jun 21, 2022
  • raccoon Public

    Raccoon is a high-throughput, low-latency service to collect events in real-time from your web, mobile apps, and services using multiple network protocols.

    Go 134 Apache-2.0 16 2 1 Updated Jun 21, 2022
  • entropy Public

    Entropy is a framework to safely and predictably create, change, and improve modern cloud applications and infrastructure using familiar languages, tools, and engineering practices.

    Go 8 Apache-2.0 1 5 1 Updated Jun 21, 2022
  • siren Public

    Siren provides an easy-to-use universal alert, notification, channels management framework for the entire observability infrastructure.

    Go 54 Apache-2.0 6 16 (3 issues need help) 2 Updated Jun 21, 2022
  • meteor Public

    Meteor is an easy-to-use, plugin-driven metadata collection framework to extract data from different sources and sink to any data catalog.

    Go 98 Apache-2.0 24 31 (15 issues need help) 5 Updated Jun 21, 2022
  • charts Public

    This repository is home to the original helm charts for products throughout the open data platform ecosystem.

    Smarty 39 Apache-2.0 6 5 (1 issue needs help) 1 Updated Jun 20, 2022
  • firehose Public

    Firehose is an extensible, no-code, and cloud-native service to load real-time streaming data from Kafka to data stores, data lakes, and analytical storage systems.

    Java 224 Apache-2.0 34 10 2 Updated Jun 20, 2022
  • apsara Public

    Apsara is a UI design system for react written on top of ant design to power the projects for the open data platform.

    TypeScript 37 Apache-2.0 5 6 1 Updated Jun 20, 2022