MTA-PPKE Hungarian Language Technology Research Group (@ppke-nlpg)
  • Stars
    star
    62
  • Global Org. Rank 72,202 (Top 23 %)
  • Registered about 13 years ago
  • Most used languages
    Python
    66.7 %
    Java
    13.3 %
    HTML
    13.3 %
    Perl
    6.7 %
  • Location 🇭🇺 Hungary
  • Country Total Rank 534
  • Country Ranking
    Perl
    27
    Java
    103
    Python
    121
    HTML
    291

Top repositories

1

purepos

PurePos is an open source hybrid morphological tagger.
Java
15
star
2

boilerplateResults

Results of boilerplate removal algorithms
Python
8
star
3

pywnxml

Python3 API for WordNet XML (Hungarian WordNet / BalkaNet / VisDic format)
Python
5
star
4

manocska

Manócska -- integrált igei vonzatkeret adatbázis
Python
4
star
5

purepos-python3

PurePOS rewritten in Python3
Python
3
star
6

emmorphpy

A wrapper, a lemmatizer and REST API implemented in Python for emMorph (Humor) Hungarian morphological analyzer
Python
3
star
7

AraSum

Arab Summarization Corpus
2
star
8

SS05

The original SS05 algorithm from Hong Shen and Anoop Sarkar used in the paper 'Voting Between Multiple Data Representations for Text Chunking'
Perl
1
star
9

fastText_factored-cbow

HTML
1
star
10

AnaGramma-Parser

Egy pszicholingvisztikai indíttatású elemző modell
Python
1
star
11

nom-or-not

algorithm for case-disambiguation
Python
1
star
12

gut-besser-chunker

The program used in the paper 'Gut, Besser, Chunker – Selecting the best models for text chunking with voting' by Balázs Indig and István Endrédy
Python
1
star
13

purepospy

Python wrapper for PurePos
Java
1
star
14

CleanPortalEval

boilerplate removal test set for portals (more sites from the same domain)
HTML
1
star
15

commoncrawl-downloader

Simple Python command line tools for retrieving a list of urls and specific files in bulk
Python
1
star
16

less-is-more

The program used in the paper 'Less is More, More or Less... – Finding the Optimal Threshold for Lexicalisation in Chunking' by Balázs Indig
Python
1
star