• Stars
    star
    3
  • Rank 3,963,521 (Top 79 %)
  • Language
    Python
  • Created about 5 years ago
  • Updated over 4 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Try to cast strings to numbers, then compare

More Repositories

1

dedupe

🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Python
4,080
star
2

csvdedupe

🆔 Command line tool for deduplicating CSV files
Python
409
star
3

dedupe-examples

🆔 Examples for using the dedupe library
Python
403
star
4

address-matching

Python script for matching a list of messy addresses against a gazetteer using dedupe.
Python
60
star
5

affinegap

📐 A Cython implementation of the affine gap string distance
Cython
58
star
6

hcluster

Hierarchical Clustering Algorithms
Python
35
star
7

dedupe-geocoder

📍 Demonstration of how dedupe might be used as geocoder
Python
17
star
8

doublemetaphone

🔉 Python wrapper for a C++ Double Metaphone
C++
15
star
9

fuzzycategory

📐 Fuzzy Categorical Distances
Python
14
star
10

rlr

Regularized Logistic Regression
Python
11
star
11

dedupe-variable-address

Address Variable Type for dedupe
Python
9
star
12

dedupe-variable-person

Dedupe variable for person names. just people. no companies.
Python
9
star
13

dedupe-variable-name

name variable type for dedupe
Python
8
star
14

soft-tfidf

Mispelling tolerant tf-idf similarity metric
6
star
15

highered

CRF Edit Distance
Python
6
star
16

dedupeio-web-api-docs

Dedupe.io web API allows for matching and training against projects using a standard RESTful framework.
Python
6
star
17

dedupe-variable-employer

Python
5
star
18

dedupe-vowpal

Vowpal Wabbit Active Labeler for Dedupe
Python
4
star
19

dedupe-variable-datetime

DateTime variable for dedupe
Python
4
star
20

dedupe-variable-fuzzycategory

Dedupe Variable for Fuzzy Categories
Python
4
star
21

categorical-distance

📐 Compare categorical variables
Python
4
star
22

parseratorvariable

Base class for dedupe variables for parsed fields
Python
3
star
23

simplecosine

📐 simple cosine distance
Python
3
star
24

datetime-distance

 📐 Compare dates and times
Python
3
star
25

dedupe-variable-ilcs

Dedupe variable for Illinois Compiled Statute (ILCS) codes
Python
2
star