The Metadata Platform for the Modern Data Stack
-
Updated
Feb 28, 2023 - Java
The Metadata Platform for the Modern Data Stack
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
Intake is a lightweight package for finding, investigating, loading and disseminating data.
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
Recap is a metadata toolkit written in Python
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
Meteor is an easy-to-use, plugin-driven metadata collection framework to extract data from different sources and sink to any data catalog.
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.
Reference Architectures for Datalakes on AWS
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Sample code with integration between Data Catalog and RDBMS data sources.
The documentation repository is part of the Corporate Linked Data Catalog - short: COLID - application.
End-to-end DataOps platform deployed by Terraform.
National Data Archive (NADA) is an open source data cataloging system that serves as a portal for researchers to browse, search, compare, apply for access, and download relevant census or survey information. It was originally developed to support the establishment of national survey data archives.
Sample code with integration between Data Catalog and BI data sources.
Data catalog for everything in your company
Add a description, image, and links to the data-catalog topic page so that developers can more easily learn about it.
To associate your repository with the data-catalog topic, visit your repo's landing page and select "manage topics."