• Stars
    star
    15
  • Rank 1,363,680 (Top 27 %)
  • Language
    C++
  • License
    Artistic License 2.0
  • Created over 8 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

🔉 Python wrapper for a C++ Double Metaphone

More Repositories

1

dedupe

🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Python
4,080
star
2

csvdedupe

🆔 Command line tool for deduplicating CSV files
Python
409
star
3

dedupe-examples

🆔 Examples for using the dedupe library
Python
403
star
4

address-matching

Python script for matching a list of messy addresses against a gazetteer using dedupe.
Python
60
star
5

affinegap

📐 A Cython implementation of the affine gap string distance
Cython
58
star
6

hcluster

Hierarchical Clustering Algorithms
Python
35
star
7

dedupe-geocoder

📍 Demonstration of how dedupe might be used as geocoder
Python
17
star
8

fuzzycategory

📐 Fuzzy Categorical Distances
Python
14
star
9

rlr

Regularized Logistic Regression
Python
11
star
10

dedupe-variable-address

Address Variable Type for dedupe
Python
9
star
11

dedupe-variable-person

Dedupe variable for person names. just people. no companies.
Python
9
star
12

dedupe-variable-name

name variable type for dedupe
Python
8
star
13

soft-tfidf

Mispelling tolerant tf-idf similarity metric
6
star
14

highered

CRF Edit Distance
Python
6
star
15

dedupeio-web-api-docs

Dedupe.io web API allows for matching and training against projects using a standard RESTful framework.
Python
6
star
16

dedupe-variable-employer

Python
5
star
17

dedupe-vowpal

Vowpal Wabbit Active Labeler for Dedupe
Python
4
star
18

dedupe-variable-datetime

DateTime variable for dedupe
Python
4
star
19

dedupe-variable-fuzzycategory

Dedupe Variable for Fuzzy Categories
Python
4
star
20

categorical-distance

📐 Compare categorical variables
Python
4
star
21

parseratorvariable

Base class for dedupe variables for parsed fields
Python
3
star
22

simplecosine

📐 simple cosine distance
Python
3
star
23

dedupe-variable-number

Try to cast strings to numbers, then compare
Python
3
star
24

datetime-distance

 📐 Compare dates and times
Python
3
star
25

dedupe-variable-ilcs

Dedupe variable for Illinois Compiled Statute (ILCS) codes
Python
2
star