-
Updated
Sep 9, 2020 - Python
#
data-integration
Here are 190 public repositories matching this topic...
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
python
bioinformatics
analysis
clustering
gene-expression
data-visualization
dimensionality-reduction
awesome-list
data-integration
atac-seq
single-cell
rna-seq-data
scrna-seq-data
cell-cycle
cell-differentiation
gene-expression-profiles
analysis-pipeline
cell-populations
rna-seq-experiments
cell-clusters
-
Updated
Sep 8, 2020
Upserts, Deletes And Incremental Processing on Big Data.
bigdata
stream-processing
data-integration
datalake
apachespark
hudi
apachehudi
incremental-processing
-
Updated
Sep 11, 2020 - Java
Fast, sensitive and accurate integration of single-cell data with Harmony
-
Updated
Aug 14, 2020 - R
An example mini data warehouse for python project stats, template for new projects
-
Updated
Jul 21, 2020 - Python
Hetionet: an integrative network of disease
-
Updated
Jun 21, 2020 - HTML
scikit-fusion: Data fusion via collective latent factor models
-
Updated
Feb 14, 2020 - Python
NicheNet: predict active ligand-target links between interacting cells
rna-seq
gene-expression
network-inference
data-integration
single-cell-rna-seq
single-cell-omics
intercellular-communication
ligand-receptor
ligand-target
-
Updated
Aug 17, 2020 - R
WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing, schema matching, identity resolution, data fusion, and result evaluation.
-
Updated
Jul 1, 2020 - Java
The Common Core Ontology Repository holds the current released version of the Common Core Ontology suite.
semantics
interoperability
data-integration
ontologies
owl-ontology
bfo
cco
applied-ontology
semantic-consistency
ontology-suite
-
Updated
Sep 3, 2020
Business Intelligence and Data Warehousing
mysql
bi
etl
data-warehouse
business-intelligence
pentaho
pdi
data-integration
pentaho-data-integration
tableau
data-modeling
datalake
mysql-workbench
dremio
data-governance
-
Updated
Aug 10, 2020 - TSQL
an data-centric integration platform
-
Updated
Jul 1, 2020 - Java
-
Updated
Aug 11, 2020 - Python
Utilities for creating ETL pipelines with mara
-
Updated
Jul 7, 2020 - PLpgSQL
Toolbox for including enzyme constraints on a genome-scale model.
-
Updated
Sep 1, 2020 - MATLAB
A .NET class library that allows you to import data from different sources into a unified destination
mysql
html
json
csv
sql-server
csharp
xml
sqlite
excel
tabular-data
data-import
oracle
databases
powerpoint
vcard
data-integration
schema-matching
sqlce
schema-mapping
msaccess
-
Updated
Apr 14, 2020 - C#
Scripts and resources to create Hetionet v1.0, a heterogeneous network for drug repurposing
-
Updated
Sep 22, 2017 - Jupyter Notebook
Generation and Applications of Knowledge Graphs in Systems and Networks Biology
simulation
systems-biology
computational-biology
data-integration
curation
biological-expression-language
biocuration
knowledge-graph-embeddings
networks-biology
knowledge-graphs
-
Updated
Dec 16, 2019 - TeX
Research data management in biomedical and machine learning applications
python
workflow
machine-learning
automation
datastructures
neuroscience
pandas
datascience
data-structures
data-integration
neuroimaging
neuroinformatics
biomedical
userfriendly
medical-data
machine-learning-workflows
-
Updated
Jun 2, 2020 - Python
Installer for Thymeflow, a personal knowledge management system.
-
Updated
Apr 17, 2018
Some of the projects i made when starting to learn R for Data Science at the university
-
Updated
Jul 9, 2019 - R
Development of the Gellish Communicator reference application and tools for universal data exchange and data integration supporting Formal English and other Gellish formalized natural languages.
nlp
taxonomies
natural-language-processing
databases
ontology
interoperability
classification
data-integration
family
data-management
reference-implementation
query-language
data-modeling
formal-languages
data-exchange
knowledge-representation
universal-interface
knowledge-modeling
interoperability-of-systems
gellish
formalized-natural-language
-
Updated
Nov 12, 2018 - Python
汇总Apache Hudi相关资料
bigdata
apache
stream-processing
data-integration
datalake
hudi
apachehudi
incremental-processing
hudi-resources
-
Updated
Sep 4, 2020
R package for High dimensional data analysis and integration with O2PLS!
bioinformatics
biostatistics
pca
data-integration
omics
principal-component-analysis
latent-variable-models
partial-least-squares-regression
pls
multi-omics
-
Updated
Aug 1, 2019 - R
An Efficient RML-Compliant Engine for Knowledge Graph Construction
-
Updated
Sep 11, 2020 - Python
Repo for Data Warehouse Concepts, Design, and Data Integration by University of Colorado System (coursera)(Notes,Assignments, quiz and research papers)
-
Updated
Jun 2, 2018
josvanroosmalen
commented
Apr 10, 2020
Once the compiler is ready (well it could be possible now as well to be honest), we can (if there is demand for), create Graphs to design REST Extensions to serve data. We can use those graphs to generate REST Extensions. Investigate whether there is demand for this.
Some ideas:
An Input block with GET/POST/PUT/DELETE output node.
An output node with Content/StatusCode/StatusMessage.
Match schema attributes of relational databases by value similarity. As a study assignment, this isn't well documented, but you can contact me for questions and I may even add docs, if I sense enough interest.
-
Updated
Oct 18, 2019 - Python
Improve this page
Add a description, image, and links to the data-integration topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-integration topic, visit your repo's landing page and select "manage topics."


Complete algorithm: http://web.cecs.pdx.edu/~mpj/pubs/polyrec.html