There are no reviews yet. Be the first to send feedback to the community and the maintainers!
dedupe
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.csvdedupe
🆔 Command line tool for deduplicating CSV filesdedupe-examples
🆔 Examples for using the dedupe libraryaddress-matching
Python script for matching a list of messy addresses against a gazetteer using dedupe.affinegap
📐 A Cython implementation of the affine gap string distancehcluster
Hierarchical Clustering Algorithmsdedupe-geocoder
📍 Demonstration of how dedupe might be used as geocoderdoublemetaphone
🔉 Python wrapper for a C++ Double Metaphonefuzzycategory
📐 Fuzzy Categorical Distancesdedupe-variable-address
Address Variable Type for dedupededupe-variable-person
Dedupe variable for person names. just people. no companies.dedupe-variable-name
name variable type for dedupesoft-tfidf
Mispelling tolerant tf-idf similarity metrichighered
CRF Edit Distancededupeio-web-api-docs
Dedupe.io web API allows for matching and training against projects using a standard RESTful framework.dedupe-variable-employer
dedupe-vowpal
Vowpal Wabbit Active Labeler for Dedupededupe-variable-datetime
DateTime variable for dedupededupe-variable-fuzzycategory
Dedupe Variable for Fuzzy Categoriescategorical-distance
📐 Compare categorical variablesparseratorvariable
Base class for dedupe variables for parsed fieldssimplecosine
📐 simple cosine distancededupe-variable-number
Try to cast strings to numbers, then comparedatetime-distance
📐 Compare dates and timesdedupe-variable-ilcs
Dedupe variable for Illinois Compiled Statute (ILCS) codesLove Open Source and this site? Check out how you can help us