python-for-text-analysis
If you want to use Python for text analysis, this course is for you!OpenDutchWordnet
This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.pepper
VU-CLTL Pepper/Nao Application Repository (Python 2)ba-text-mining
Hands-on material for the course text-mining BA, taught at VU Amsterdamwsd-dynamic-sense-vector
SpaCy-to-NAF
spaCy-to-naf converterEventCoreference
Compares descriptions of events within and across documents to decide if they refer to the same events.ThesisTips
A collection of tips for writing a PhD thesisKafNafParserPy
Parser for KAF NAF files written in Pythonsvm_wsd
Word Sense Disambiguation system developed on the DutchSemCor project using Support Vector Machines. The input is plain text, and the output XMLma-hlt-labs
Human Language Technology Notebooks for Lab sessions, Master Studentsopinion_miner_deluxe
Opinion miner based of Machine Learning that can be trained on a corpus of KAF/NAF filesma-ml4nlp-labs
BabelfyReimplementation
Reimplementation of Babelfy (http://babelfy.org)lexical_pattern_extractor
Lexical pattern extractor to generate patterns and target words from a seed listentity-identification-from-scratch
Entity recognition and linking for historical documents in Dutch, developed within the Clariah+ project at VU AmsterdamOntoTagger
Ontotagger inserts (semantic) labels into KAF representation on the basis of lemma or wordnet synset representations of textvu-rm-pip3
Dutch NewsReader pipelineecbPlus
ECB+ and derived corporaWordnetTools
Set of functions to use a wordnet in Wordnet-LMF formatma-language-as-data-labs
This Github provides the Jupyter notebooks for the Lab sessions of the VU Language-As-Data course.event-resource-interoperability
semantic_space_navigation
morphosyntactic_parser_nl
Morphosyntactic parser for Dutch based on the Alpino parsera-proof-zonmw
Detecting the functioning level of a patient from a free-text clinical note in Dutch.multilingual-finegrained-entity-typing
multilingual-wiki-event-pipeline
This project aims to extract information about incidents of a particular type. This information consists of structured data on the incidents from Wikidata, as well as unstructured description and supporting sources from Wikipedia. We obtain information from Wikipedia in multiple languages.EL-long-tail-phenomena
Systematic study of long tail phenomena in the task of entity linkingFormatConversions
Several conversions between formats that are commonly used by our toolsBiographyNet
NLP tools and data used in BiographyNetStoryTeller
Toolkit to query the NewsReader KnowledgeStore with SPARQL and create a JSON storycltl-ma-thesis
(LaTeX) MA thesis templateTarget-Spans-Detection
Target_Spans_HateXplainHumanLikeEL
Human-Like Entity Linking using Contextual knowledgeWordNetSimilarity
Programs and scripts that test performance of WordNet similarity measurements using different settingsFrameNet-annotation-tool
Python-based command-line tool for FrameNet annotationMultiWordTagger
Reads a KAF or NAF file to detect multiword sequences of terms according the WordNetaproof-icf-classifier
Classifier that can read medical reports and assign a functional level classification following the WHO ICF classification scheme.PostmaVossenGWC2014
This repository provides the code to replicate the results from PostmaVossenGWC2014SoNar2Naf
Converter from Folia to NAFvua-wsd-sem2015
System for the CLTL participation in SemEval2015 task 13: multilingual all-words sense disambiguation and entity linkingframe-annotation-tool
Annotation tool in JavaScript and Node.js for annotation of frames in Dutch documents.machine-learning-for-nlp-course
releases of notebooks for students participating in machine learning for nlpMoreIsNotAlwaysBetter
lexical-negation-dictionary
BiographicalDataModels
ma-communicative-robots
Communication robotsmultilingual_factuality
NAF-HeidelTime
NAF (KAF) Wrapper around HeidelTimeNewsAcquisition
Analysis and acquisition of news data from the Signal Media corpus and other news collectionstokeniser-opennlp
Tokenizer and sentence splitter based on opennlpreference-framing-perspective
Workshop websiteWordNetMapper
This repo provides the possibility to map between lexical keys | offsets | ilidefs from one wordnet version to the other ["16","17","171","20","21","30"]. It makes use of the index.sense files from WordNet (http://wordnet.princeton.edu/) and the automatically generated mappings between WordNet offsets (http://nlp.lsi.upc.edu/tools/download-map.php)a-proof
Tools for the text classification of clinical note in electronic patient recordsELBaselines
This repo is aimed to create baseline results for Entity Linking, by running a text against the state-of-the-art systems for entity linking, using their most standard configuration.LSTM-WSD
nlpp
Script to install NLP pipeline from its components.DFNDataReleases
LongTailAnnotation
Annotation tool for data2text approachesmicro-portraits
KafAnnotator
Standalone program to annotate KAF filesImage-Specificity
Reimplementation of Jas & Parikh's (2015) image specificity metric, using word embeddings.voc-missives
NER and format conversion scripts for the Generale MissivenTextToCoNLL
dutch-nlp-tools
Overview of data sets and resources for DutchNAF-4-Development
FrameNetNLTK
SemanticOverfitting
mergeAnnotationCAT
Script to merge files annotated from different annotators (on the same task) to better explore (dis-)agreementMining-Ministers
hpsp
Experiments with hyperspace models for selectional preferenceCuriousMachine
Investigations on how to build a curious machine based on NLP technologiesNAFFoLiAPy
Library for converting between FoLiA and NAFGunViolenceCorpus
News2RDF
MFS_classifier
This repo contains the scripts to attempt to remove the mfs bias from a WSD system.coreference-evaluation
Evaluation package for event coreference using the reference-scorerSimpleTagger
GRaSP
ceopathfinder
Finds a path of circumstantial relations between events on the basis of the CircumstantialEventOntologyrun_open-sesame
pepper_tensorflow
This is the repository for Pepper modules and external services. Use Python 3entity-link-postprocess
DutchDescriptions
Dutch descriptions for the Flickr30K validation and test data, plus a cross-lingual comparison tool.cltl.github.io
CLTL organization siteTeamRobot
LongTailIdentity
Generating profiles of long tail identities from textvua_factuality
a-proof-project
PythonVirtuosoInterface
Simple interface to SPARQL for python 2 and 3 scriptsrfp_corpus_collection
Collect a referentially grounded corpus for the 1st workshop on Reference, Framing, and Perspective (LREC-COLING 2024)Wikipedia_langlinks
pwgc
tool to load the princeton wordnet gloss corpusrelink
RElinking with CONtext - Entity linking moduleKafKybot
Extracts tuples from KAF file using profilesma-applied-tm-course
Github Repository supporting the Applied TM Course as part of the VU Text Mining MastersSPT_crowd_data_analysis
Code to analyze crowd annotations of property-concept pairs in terms of their relations.inner-outer-coreference
A repository for investigating the role of common ground in datasets of social dialogue in coreference resolution tasksma-course-subjectivity-mining
Repository for the Subjectivity mining courseBERT-WSD
Love Open Source and this site? Check out how you can help us