• Stars
    star
    8
  • Rank 2,099,232 (Top 42 %)
  • Language
    Python
  • License
    MIT License
  • Created over 5 years ago
  • Updated almost 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

๐Ÿ‡ฉ๐Ÿ‡ช Preprocess German texts to do some serious natural-language processing.

More Repositories

1

Sublime-Text-Plugins-for-Frontend-Web-Development

๐Ÿ“ Collection of plugins for Frontend Web Development
1,134
star
2

react-native-onboarding-swiper

๐Ÿ›ณ Delightful onboarding for your React-Native app
JavaScript
927
star
3

clean-text

๐Ÿงน Python package for text cleaning
Python
909
star
4

split-folders

๐Ÿ—‚ Split folders with files (i.e. images) into training, validation and test (dataset) folders
Python
403
star
5

pdf-scripts

๐Ÿ“‘ Scripts to repair, verify, OCR, compress, wrangle, crop (etc.) PDFs
Shell
55
star
6

text-classification-keras

๐Ÿ“š Text classification library with Keras
Python
52
star
7

frag-den-staat-app

๐Ÿ“ฑ iOS & Android App for FragDenStaat, the German FOI portal
JavaScript
25
star
8

hgmaassen-retweets

Hans-Georg MaaรŸen and the Retweets
Jupyter Notebook
23
star
9

brunch-on-speed

๐Ÿฝ Skeleton for Brunch for a long-scroll, single, static Web page
HTML
18
star
10

ulmfit-for-german

๐Ÿ‘ฉโ€๐Ÿซ Pre-trained German Language Model with sub-word tokenization for ULMFIT
Jupyter Notebook
16
star
11

hyperhyper

๐Ÿงฎ Python package to construct word embeddings for small data using PMI and SVD
Python
15
star
12

ptf-kommentare

Notes & code for my Protoypefund project about Machine Learning & news comments & language change
Jupyter Notebook
11
star
13

youdata

๐Ÿ‡ช๐Ÿ‡บ Because it's about you and your data. (discontinued)
JavaScript
10
star
14

eesti-kelt

๐Ÿ‡ช๐Ÿ‡ช English to Estonian dictionary with all the three important cases (discontinued)
JavaScript
9
star
15

german-abbreviations

๐Ÿ“– A list of 4262 German abbreviations from Wiktionary
Python
9
star
16

get-retries

Adding retries to Requests.get() with exponential backoff
Python
6
star
17

wikipedia-edits-verified-accounts

Get all revisions and recent changes for verified German Wikipedia users
Python
6
star
18

german-lemmatizer

โœ‚๏ธ Python package (using a Docker image under the hood) to lemmatize German texts.
Python
6
star
19

deep-plots

๐Ÿ“‰ Visualize your Deep Learning training in static graphics
Python
5
star
20

scrape-gutenberg-de

Scrape all Books from Projekt Gutenberg-DE
Python
5
star
21

masters-thesis

Master's Thesis: Conversation-aware Classification of News Comments
Jupyter Notebook
5
star
22

rechte-gewalt

Mapping of right-wing incidents in Germany
Python
4
star
23

get-wayback-machine

Fetch a URL via the latest Wayback Machine snapshot
Python
4
star
24

most-frequent-words-2019-german-eu-election-programs

Visualization of the most frequent words in the German 2019 EU election programs
Jupyter Notebook
4
star
25

MDMA

Make Deep Art Accessible
Python
3
star
26

sparse-svd-benchmark

Sparse Truncated SVD Benchmark (Python)
Jupyter Notebook
3
star
27

mw-category-members

Using MediaWiki's API, retrieve pages that belong to a given category
Python
2
star
28

btw21

Visualization of the most frequent words in the German federal election in 2021
Jupyter Notebook
2
star
29

nsu-urteil

Most frequent sentences in the written judgment against the NSU
Jupyter Notebook
2
star
30

offene-register-text-analysis

Text analysis of German corporates' names and associated officers
Jupyter Notebook
2
star
31

oauth-proxy

A simple proxy for OAuth to hide the client secret.
JavaScript
1
star
32

utils

bash scripts, dotfiles
Shell
1
star
33

german-lemmatizer-docker

โœ‚๏ธ Combining the power of several tools for lemmatization of German text
Python
1
star
34

autobahn

Playing around with data about broken bridges on the German Autobahn
R
1
star
35

tweets-with-images

Get all tweets with images from a given Twitter user
Python
1
star
36

00-dokku-default

Add a dummy lexicographically first site to a Dokku instance to act as default site
HTML
1
star
37

nlp

Solutions for a course in NLP in Winter 2014/15 @ OVGU, Magdeburg
Python
1
star
38

universal-style-transfer-pytorch

Universal Style Transfer in PyTorch (improved)
Python
1
star
39

hpi-kurs-zuordnung

Determine optimal specializations and course assignments @ HPI
JavaScript
1
star
40

ifg.jfilter.de

Blog for my investigative reporting using German FOI laws
Shell
1
star
41

lobbyalarm

๐Ÿšจ Browser Plugin to Highlight Lobbyism (in Germany)
Python
1
star
42

blog-examples

Example code of my blog posts
R
1
star