COLLECTED BY
Organization:
Internet Archive
Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
The Wayback Machine - https://web.archive.org/web/20200911184101/https://github.com/topics/data-curation
Here are
14 public repositories
matching this topic...
Metamapper is a self-updating data catalog with full-text search
Updated
Sep 9, 2020
Python
general purpose curation of exposure science data
Updated
Sep 9, 2020
JavaScript
Updated
Oct 17, 2017
Mathematica
tranSMART Arborist ETL toolkit
Updated
Jul 31, 2020
Python
TranSMART Arborist: Graphical tool for reshaping your data for the tranSMART data warehouse.
Updated
Jul 16, 2020
JavaScript
Code for data linkage (curation of research database).
Updated
Aug 6, 2019
Jupyter Notebook
Repository for the collection, management, and versioning of the GCIS data management conventions.
Some analysis on public datasets [WIP]
Updated
Jun 25, 2020
Jupyter Notebook
Codes I wrote for the paper : "Global determinants of freshwater and marine fish genetic diversity" Nature Communications, 2020
University Class Schedule XML & DTD Schema Design
Updated
Mar 8, 2018
Jupyter Notebook
Data curation - Relational Schema Design
Canonicalizing data and implementing strategies for ensuring equivalence
Data curation: Importance of data representation and the importance of DTD in data curation
Improve this page
Add a description, image, and links to the
data-curation
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
data-curation
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.