Jonathan de Bruin (@J535D165)

Top repositories

1

recordlinkage

A powerful and modular toolkit for record linkage and duplicate detection in Python
Python
838
star
2

data-matching-software

A list of free data matching and record linkage software.
311
star
3

CoronaWatchNL

Numbers concerning COVID-19 disease cases in The Netherlands by RIVM, LCPS, NICE, ECML, and Rijksoverheid.
Jupyter Notebook
145
star
4

cbsodata

Unofficial Statistics Netherlands (CBS) opendata API client for Python
Python
35
star
5

pyalex

A Python library for OpenAlex (openalex.org)
Python
34
star
6

recordlinkage-annotator

A browser user interface for manual labeling of record pairs.
JavaScript
34
star
7

PublicSectorNL

Open Source in the public sector in the Netherlands
Makefile
28
star
8

FEBRL-fork-v0.4.2

Fork of the Freely Extensible Biomedical Record Linkage program
Python
22
star
9

datahugger

One downloader for many scientific data and code repositories! DOI๐Ÿ‘Data
Python
12
star
10

recordlinkage-review

Make golden data or validate your record linkage.
JavaScript
7
star
11

scitree

Scitree is a recursive directory listing tool optimized for science
Python
5
star
12

cbsshape

Simple interface for CBS Wijk en Buurtkaart.
R
3
star
13

Data-Science-Day

Additional material for the Data Science Day (Utrecht University) workshop: "Data Engineering: Clean and Integrate Your Data!"
Jupyter Notebook
2
star
14

scisort

Sort files in research project folders in a scientific order
Python
2
star
15

CoronaWatchNLExtended

Models based on COVID-19 disease counts in The Netherlands, as reported by RIVM
Python
2
star
16

recordlinkage-notebooks

Jupyter Notebook
2
star
17

recordlinkage-performance

Experiments to get the best performance!
Jupyter Notebook
1
star