Research Group of Language Technology, NYTK (@nytud)

Top repositories

1

emtsv

e-magyar text processing system -- inter-module communication via tsv + REST API
Python
26
star
2

hadifogoly-adatbazis

A magyar hadifoglyok adatbázisának orosz-magyar transzkripciója
Python
22
star
3

quntoken

Hungarian tokenizer.
C++
14
star
4

emMorph

Perl
14
star
5

NYTK-NerKor

The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.
Shell
14
star
6

emmorphpy

A wrapper, a lemmatizer and REST API implemented in Python for emMorph (Humor) Hungarian morphological analyzer
Python
10
star
7

hunlp-GATE

Lang_Hungarian - a GATE plugin containing Hungarian NLP tools as GATE processing resources
C
8
star
8

HuLU

Hungarian Language Understanding Benchmark Kit
5
star
9

panmorph

Tagsets and description of Hungarian morphological analysers.
4
star
10

machine-translation

Dockerfile
4
star
11

HunTag3

A sequential tagger for NLP using Maximum Entropy Learning and Hidden Markov Models
Lex
3
star
12

neural-models

3
star
13

emLam

Preprocessing scripts for Hungarian Language Modeling
Python
3
star
14

emdeppy

A wrapper and REST API implemented in Python for emDep (Bohnet parser a.k.a. Mate Tools)
Python
2
star
15

HAPP

1
star
16

emIOBUtils

An IOB format converter and corrector
Io
1
star
17

HuCoPA

Hungarian Choice of Plausible Alternatives Corpus
1
star
18

bert_coref_hu

Python
1
star
19

hunspellpy

Hunspell integrated with the xtsv framework
Python
1
star
20

emgateconv

Python
1
star
21

xtsv

A generic TSV-style format based intermodular communication framework and REST API implemented in Python
Python
1
star
22

emterm

Python
1
star
23

parallelbible

TSV files of the Parallel Bible Reader
1
star
24

anonymizer_hu

The Hungarian anonymization tool for CURLICAT
Python
1
star
25

HuCOLA

Hungarian Corpus of Linguistic Acceptability
1
star
26

HuWS

Hungarian Winograd Schemes
1
star
27

HuSST

Hungarian version of the Stanford Sentiment Treebank
1
star
28

HuWiC

Hungarian Word-in-Context Corpus
Jupyter Notebook
1
star
29

embedding-demo

visualization for word2vec datasets
Python
1
star
30

e-magyar.hu

e-magyar.hu site stuff
PHP
1
star
31

purepospy

Python wrapper for PurePos
Python
1
star
32

korap_docker

Dockerfile based on KorAP-vagrant (github.com/KorAP/KorAP-Vagrant)
Dockerfile
1
star
33

emudpipe

An UDPipe wrapper for e-magyar (xtsv)
Python
1
star