There are no reviews yet. Be the first to send feedback to the community and the maintainers!
dedupe
π A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.csvdedupe
π Command line tool for deduplicating CSV filesdedupe-examples
π Examples for using the dedupe libraryaddress-matching
Python script for matching a list of messy addresses against a gazetteer using dedupe.affinegap
π A Cython implementation of the affine gap string distancehcluster
Hierarchical Clustering Algorithmsdedupe-geocoder
π Demonstration of how dedupe might be used as geocoderdoublemetaphone
π Python wrapper for a C++ Double Metaphonefuzzycategory
π Fuzzy Categorical Distancesrlr
Regularized Logistic Regressiondedupe-variable-address
Address Variable Type for dedupededupe-variable-person
Dedupe variable for person names. just people. no companies.dedupe-variable-name
name variable type for dedupesoft-tfidf
Mispelling tolerant tf-idf similarity metrichighered
CRF Edit Distancededupeio-web-api-docs
Dedupe.io web API allows for matching and training against projects using a standard RESTful framework.dedupe-variable-employer
dedupe-variable-datetime
DateTime variable for dedupededupe-variable-fuzzycategory
Dedupe Variable for Fuzzy Categoriescategorical-distance
π Compare categorical variablesparseratorvariable
Base class for dedupe variables for parsed fieldssimplecosine
π simple cosine distancededupe-variable-number
Try to cast strings to numbers, then comparedatetime-distance
Β π Compare dates and timesdedupe-variable-ilcs
Dedupe variable for Illinois Compiled Statute (ILCS) codesLove Open Source and this site? Check out how you can help us