There are no reviews yet. Be the first to send feedback to the community and the maintainers!
reldi-tagger
A tagger and lemmatiser for Croatian, Serbian and Slovene.geobert
csmtiser
A tool for text normalisation via character-level machine translationtweetcat
TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regionsreldi-lib
vejice
megahr-crossling
Predictions on concreteness and imageability of words in 77 languagesSlovene_ASR_e2e
Automatic Speech Recognition toolreldi-tokeniser
A two-mode (standard, nonstandard) tokeniser for South Slavic languagesmte-msd
MULTEXT-East morphosyntactic specificationsjanes-ner
NER system for South Slavic languagesbabushka-bench
Benchmarking NLP tools on Slovene, Croatian and Serbiantweetgeo
A Tool for Collecting, Visualising and Inferring from Geo-encoded Linguistic DataTEI-schema
Recommended TEI schema for CLARIN.SI resources, cf. also https://clarinsi.github.io/TEI-schema/parlaspeech
Code for bootstrapping ASR datasets from parliamentary recordings and transcriptsreldi-api
Slovene_NMT
Neural Machine Translation toolslovene_syllable_splitter
A rule-based syllable splitter for Slovene that takes an input word and returns a list of syllables in the word, e.g. predsedovati -> ['pred', 'se', 'do', 'va', 'ti']; decembrskega -> ['de', 'cem', 'brs', 'ke', 'ga'].reldi-depparse
classla-spoken
jos2ud
cordex
Obeliks4J
wikitalk-extractor
A corpus extractor from the Wikipedia page and user talk pagesbenchich
BENCHić - the benchmark for Bosnian, Croatian, Montenegrin, Serbian (and friends)sb-abbr
NLP dataset of the Slovenian Biographydrevesnik
Web portal for searching and displaying syntacically annotated corporaLove Open Source and this site? Check out how you can help us