The Wayback Machine - https://web.archive.org/web/20220506064706/https://github.com/fgregg
Skip to content
Avatar

Achievements

Achievements

Organizations

@open-city @datamade @dssg
Block or Report

Block or report fgregg

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
fgregg/README.md

"using my custom formula, I would get a prediction"

Pinned

  1. 🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

    Python 3.4k 475

  2. 🇺🇸 a python library for parsing unstructured United States address strings into address components

    Python 1.3k 264

  3. 🔖 A toolkit for making domain-specific probabilistic parsers

    Python 739 83

  4. 🆔 Command line tool for deduplicating CSV files

    Python 349 78

  5. Estimating Markov Random Fields models with Pseudolikelihood

    Python 1 1

  6. 👪 a python library for parsing unstructured western names into name components.

    Python 495 63

2,523 contributions in the last year

May Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr Mon Wed Fri

Contribution activity

May 2022

Reviewed 4 pull requests in 2 repositories
dedupeio/dedupe 2 pull requests
datamade/civic-scraper 2 pull requests

Created an issue in dedupeio/dedupe that received 3 comments

benchmark runs with training separately than runs that use settings file

@NickCrews, if i understand the benchmarking code correctly, the first time a benchmark runs it will create a settings file, and then in all subseq…

3 comments
Opened 2 other issues in 1 repository
12 contributions in private repositories May 1 – May 6