COLLECTED BY
Organization:
Internet Archive
Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
The Wayback Machine - https://web.archive.org/web/20200822232838/https://github.com/topics/entity-resolution
Here are
62 public repositories
matching this topic...
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Updated
Aug 13, 2020
Python
A toolkit for record linkage and duplicate detection in Python
Updated
Jun 4, 2020
Python
🆔 Command line tool for deduplicating CSV files
Updated
Mar 31, 2020
Python
🆔 Examples for using the dedupe library
Updated
May 6, 2020
Python
A list of free data matching and record linkage software.
Recent trends of Entity Linking, Disambiguation, and Representation.
Entity resolution for Elasticsearch.
An open source, high scalability toolkit in Java for Entity Resolution.
OpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
Updated
May 26, 2020
Java
Link Wikidata items to large catalogs
Updated
Apr 1, 2020
Python
Record Linkage ToolKit (Find and link entities)
Updated
Jun 4, 2020
Python
Resources for tackling record linkage / deduplication / data matching problems
Python implementation of anonymous linkage using cryptographic linkage keys
Updated
Aug 21, 2020
Python
Distributed Bayesian Entity Resolution in Apache Spark
Updated
Apr 26, 2020
Scala
Learning String Alignments for Entity Aliases
Updated
Mar 21, 2019
Python
Implementation in Apache Spark of the EM algorithm to estimate parameters of Fellegi-Sunter's canonical model of record linkage.
Updated
Aug 21, 2020
Python
SparkER: an Entity Resolution framework for Apache Spark
Updated
Mar 23, 2020
Scala
Merge Dirty Data with Clean Reference Tables
Updated
Jun 20, 2019
Python
ReCiter: an enterprise open source author disambiguation system for academic institutions
Updated
Aug 17, 2020
Java
Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.
Updated
Jul 16, 2020
Python
WhatIs.this: simple entity resolution through Wikipedia
Learned string similarity for entity names using optimal transport.
Updated
Nov 21, 2019
Python
A browser user interface for manual labeling of record pairs.
Updated
Dec 26, 2019
JavaScript
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).
Updated
Jul 16, 2020
Python
Identifies and validates financial security ids such as Sedol, Cusip, Isin numbers.
Updated
Sep 20, 2017
Python
Minoan ER is an Entity Resolution (ER) framework, built by researchers in Crete (the land of the ancient Minoan civilization). Entity resolution aims to identify descriptions that refer to the same entity within or across knowledge bases.
Updated
Apr 25, 2019
Java
Fork of the Freely Extensible Biomedical Record Linkage program
Updated
Nov 4, 2016
Python
Updated
Jun 29, 2020
Java
A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).
Updated
Jul 28, 2017
Python
Rust crate for entity parsing
Updated
Jan 24, 2020
Rust
Improve this page
Add a description, image, and links to the
entity-resolution
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
entity-resolution
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.