COLLECTED BY
Organization:
Internet Archive
Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
The Wayback Machine - https://web.archive.org/web/20220812015204/https://github.com/topics/data-discovery
Here are
36 public repositories
matching this topic...
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
The Metadata Platform for the Modern Data Stack
Updated
Aug 12, 2022
Java
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Updated
Aug 12, 2022
Python
Collect, aggregate, and visualize a data ecosystem's metadata
Updated
Aug 11, 2022
Java
🐳 The stupidly simple CLI workspace for your data warehouse.
Updated
Jun 22, 2022
Python
Earthdata Search is a web application developed by NASA EOSDIS to enable data discovery, search, comparison, visualization, and access across EOSDIS' Earth Science data holdings.
Updated
Aug 1, 2022
JavaScript
SQL Lineage Analysis Tool powered by Python
Updated
Aug 11, 2022
Python
First open-source data discovery and observability platform. ODD Platform is based on ODD Specification.
Updated
Aug 11, 2022
Java
Object Oriented Internet - C# deliverables supporting a new Machine To Machine (M2M) communication architecture
📙 Awesome Data Catalogs and Observability Platforms.
收集整理各种人工智能比赛经验帖。【从他人比赛经验中,总是可以学到很多东西】
Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.
Updated
Aug 3, 2022
Python
ODD Specification is a universal open standard for collecting metadata.
WG3 Metadata Specification
Updated
Nov 5, 2018
Python
A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable front end that's just HTML.
Updated
Jul 7, 2022
Python
Toolkit for discovering and aggregating data for whole-cell modeling
Updated
Jan 19, 2022
Python
articat: data artifact catalog
Updated
May 20, 2022
Python
Open-source metadata collector based on ODD Specification
Updated
Aug 10, 2022
Python
A system for managing files and file replicas across many diverse sites
Updated
Jun 20, 2022
Java
Improve this page
Add a description, image, and links to the
data-discovery
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
data-discovery
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.