There are no reviews yet. Be the first to send feedback to the community and the maintainers!
frog
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.ucto
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --PICCL
A set of workflows for corpus building through OCR, post-correction and normalisationtimbl
TiMBL implements several memory-based learning algorithms.libfolia
FoLiA library for C++ticcltools
Tools for TICCLCLIN28_ST_spelling_correction
Scripts that were used for preparing and converting the Wikipedia documents that are part of the CLIN28 shared task on spelling correctionLamaEvents
Lama Events is a calendar application listing events in the near future. The events are detected and selected by a fully automatic procedure in the Dutch Twitter stream.uctodata
Datafiles for the tokenizer ucto.mbt
MBT: Memory-based tagger generation and tagging MBT is a memory-based tagger-generator and tagger in one.ticcutils
Ticcutils, a generic utility library shared by our software.wopr
Memory Based Word Predictor/Language Model http://ilk.uvt.nl/wopr/foliautils
Command-line utilities for working with the Format for Linguistic Annotation (FoLiA), powered by libfolia (C++), written by Ko van der Sloot (CLST, Radboud University)quoll
timblserver
TiMBL implements several memory-based learning algorithms. This is the server part.ICDAR2017-PostOCR-Ticcl
Wrapper scripts for processing ICDAR2017 PostOCR data given a TICCL ranked input listdimbl
Distributed Tilburg Memory Based Learnermbtserver
dialect2keywords
Webinterface designed to convert words in Dutch dialects ("dialectopgaven") into standard Dutch keywords ("vernederlandste trefwoorden").releasereport
paramsearch
Automated parameter optimisation for Timblfrogdata
Data for Frog, mandatorytoad
Toad: Trainer Of All Data, the Frog training collectionbp-som
BP-SOM: A hybrid of back-propagation learning in multi-layered perceptrons and self-organizing mapsLove Open Source and this site? Check out how you can help us