• Stars
    star
    1
  • Language
    Jupyter Notebook
  • Created almost 2 years ago
  • Updated almost 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

DSSG document categorization repository

More Repositories

1

aleph

Search and browse documents and data; find the people and companies you look for.
JavaScript
1,996
star
2

memorious

Lightweight web scraping toolkit for documents and structured data.
Python
311
star
3

followthemoney

Data model and processing tools for investigative entity data
Python
207
star
4

fingerprints

Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.
Python
142
star
5

cronodump

A Cronos database converter
Python
70
star
6

countrynames

Utility library to turn country names into ISO two-letter codes
Python
65
star
7

ingest-file

Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.
Python
54
star
8

datadesktop

DEPRECATED. Desktop graph visualization application
TypeScript
50
star
9

pdflib

Binary Python bindings for poppler utils for content extraction
Python
42
star
10

synonames

Trying to generate name synonyms from wikidata
Python
33
star
11

alephclient

API client for Aleph, supports bulk entity and document upload.
Python
27
star
12

pantomime

Python library for MIME type parsing, normalisation and grouping.
Python
12
star
13

offshoreleaks

Converter for ICIJ Offshore Leaks data into FollowTheMoney format
Python
12
star
14

followthemoney-store

Fragment storage/database layer for FollowTheMoney entities
Python
10
star
15

react-ftm

React UI component library for aleph/followthemoney
TypeScript
10
star
16

languagecodes

A Python helper library to convert between ISO 639 two- and three-letter codes.
Python
10
star
17

countrytagger

Extract names of places from text and determine which country they may refer to
Python
8
star
18

servicelayer

Common interface definitions for aleph toolkit services and applications
Python
7
star
19

followthemoney-ocds

Import data formatted as OpenContracting Data Standard (OCDS) objects into FollowTheMoney
Python
7
star
20

panama

Parser for a 2008 scrape of the Panama companies registry
Python
6
star
21

docs

GitHub mirror of the GitBook documentation
6
star
22

followthemoney-predict

Experiments with FtM record linkage
Jupyter Notebook
5
star
23

alephr

R package wrapper for Aleph API
R
4
star
24

translate-service

Demo: document processing service for automated translation
Python
4
star
25

example-personadeinteres

Example how to load mixed document/entity graphs to Aleph
Python
4
star
26

aleph-elasticsearch

Custom ElasticSearch configuration for Aleph
Shell
3
star
27

followthemoney-typepredict

Predict ftm types for string input data
Python
3
star
28

followthemoney-compare

followthemoney-compare
Jupyter Notebook
2
star
29

followthemoney-graph

Python
1
star