There are no reviews yet. Be the first to send feedback to the community and the maintainers!
python-for-text-analysis
If you want to use Python for text analysis, this course is for you!OpenDutchWordnet
This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.ba-text-mining
Hands-on material for the course text-mining BA, taught at VU Amsterdampepper
VU-CLTL Pepper/Nao Application Repository (Python 2)wsd-dynamic-sense-vector
SpaCy-to-NAF
spaCy-to-naf converterEventCoreference
Compares descriptions of events within and across documents to decide if they refer to the same events.ThesisTips
A collection of tips for writing a PhD thesisKafNafParserPy
Parser for KAF NAF files written in Pythonma-hlt-labs
Human Language Technology Notebooks for Lab sessions, Master Studentssvm_wsd
Word Sense Disambiguation system developed on the DutchSemCor project using Support Vector Machines. The input is plain text, and the output XMLopinion_miner_deluxe
Opinion miner based of Machine Learning that can be trained on a corpus of KAF/NAF filesBabelfyReimplementation
Reimplementation of Babelfy (http://babelfy.org)ma-ml4nlp-labs
Course code for "Machine Learning in NLP"lexical_pattern_extractor
Lexical pattern extractor to generate patterns and target words from a seed listentity-identification-from-scratch
Entity recognition and linking for historical documents in Dutch, developed within the Clariah+ project at VU AmsterdamOntoTagger
Ontotagger inserts (semantic) labels into KAF representation on the basis of lemma or wordnet synset representations of textvu-rm-pip3
Dutch NewsReader pipelineecbPlus
ECB+ and derived corporaWordnetTools
Set of functions to use a wordnet in Wordnet-LMF formatma-language-as-data-labs
This Github provides the Jupyter notebooks for the Lab sessions of the VU Language-As-Data course.semantic_space_navigation
event-resource-interoperability
morphosyntactic_parser_nl
Morphosyntactic parser for Dutch based on the Alpino parsera-proof-zonmw
Detecting the functioning level of a patient from a free-text clinical note in Dutch.multilingual-finegrained-entity-typing
FormatConversions
Several conversions between formats that are commonly used by our toolsBiographyNet
NLP tools and data used in BiographyNetStoryTeller
Toolkit to query the NewsReader KnowledgeStore with SPARQL and create a JSON storycltl-ma-thesis
(LaTeX) MA thesis templateHumanLikeEL
Human-Like Entity Linking using Contextual knowledgeWordNetSimilarity
Programs and scripts that test performance of WordNet similarity measurements using different settingsTarget-Spans-Detection
Target_Spans_HateXplainFrameNet-annotation-tool
Python-based command-line tool for FrameNet annotationMultiWordTagger
Reads a KAF or NAF file to detect multiword sequences of terms according the WordNetaproof-icf-classifier
Classifier that can read medical reports and assign a functional level classification following the WHO ICF classification scheme.EL-long-tail-phenomena
Systematic study of long tail phenomena in the task of entity linkingPostmaVossenGWC2014
This repository provides the code to replicate the results from PostmaVossenGWC2014SoNar2Naf
Converter from Folia to NAFvua-wsd-sem2015
System for the CLTL participation in SemEval2015 task 13: multilingual all-words sense disambiguation and entity linkingframe-annotation-tool
Annotation tool in JavaScript and Node.js for annotation of frames in Dutch documents.machine-learning-for-nlp-course
releases of notebooks for students participating in machine learning for nlpMoreIsNotAlwaysBetter
BiographicalDataModels
lexical-negation-dictionary
ma-communicative-robots
Communication robotsmultilingual_factuality
NAF-HeidelTime
NAF (KAF) Wrapper around HeidelTimereference-framing-perspective
Workshop websiteNewsAcquisition
Analysis and acquisition of news data from the Signal Media corpus and other news collectionstokeniser-opennlp
Tokenizer and sentence splitter based on opennlpWordNetMapper
This repo provides the possibility to map between lexical keys | offsets | ilidefs from one wordnet version to the other ["16","17","171","20","21","30"]. It makes use of the index.sense files from WordNet (http://wordnet.princeton.edu/) and the automatically generated mappings between WordNet offsets (http://nlp.lsi.upc.edu/tools/download-map.php)a-proof
Tools for the text classification of clinical note in electronic patient recordsLSTM-WSD
ELBaselines
This repo is aimed to create baseline results for Entity Linking, by running a text against the state-of-the-art systems for entity linking, using their most standard configuration.nlpp
Script to install NLP pipeline from its components.DFNDataReleases
micro-portraits
voc-missives
NER and format conversion scripts for the Generale MissivenImage-Specificity
Reimplementation of Jas & Parikh's (2015) image specificity metric, using word embeddings.TextToCoNLL
KafAnnotator
Standalone program to annotate KAF filesdutch-nlp-tools
Overview of data sets and resources for DutchNAF-4-Development
FrameNetNLTK
SemanticOverfitting
mergeAnnotationCAT
Script to merge files annotated from different annotators (on the same task) to better explore (dis-)agreementMining-Ministers
CuriousMachine
Investigations on how to build a curious machine based on NLP technologiesGunViolenceCorpus
News2RDF
NAFFoLiAPy
Library for converting between FoLiA and NAFMFS_classifier
This repo contains the scripts to attempt to remove the mfs bias from a WSD system.hpsp
Experiments with hyperspace models for selectional preferencecoreference-evaluation
Evaluation package for event coreference using the reference-scorerSimpleTagger
GRaSP
ceopathfinder
Finds a path of circumstantial relations between events on the basis of the CircumstantialEventOntologyrun_open-sesame
pepper_tensorflow
This is the repository for Pepper modules and external services. Use Python 3entity-link-postprocess
DutchDescriptions
Dutch descriptions for the Flickr30K validation and test data, plus a cross-lingual comparison tool.LongTailAnnotation
Annotation tool for data2text approachescltl.github.io
CLTL organization siteTeamRobot
vua_factuality
LongTailIdentity
Generating profiles of long tail identities from texta-proof-project
PythonVirtuosoInterface
Simple interface to SPARQL for python 2 and 3 scriptsrfp_corpus_collection
Collect a referentially grounded corpus for the 1st workshop on Reference, Framing, and Perspective (LREC-COLING 2024)pwgc
tool to load the princeton wordnet gloss corpusWikipedia_langlinks
relink
RElinking with CONtext - Entity linking moduleKafKybot
Extracts tuples from KAF file using profilesma-applied-tm-course
Github Repository supporting the Applied TM Course as part of the VU Text Mining MastersSPT_crowd_data_analysis
Code to analyze crowd annotations of property-concept pairs in terms of their relations.inner-outer-coreference
A repository for investigating the role of common ground in datasets of social dialogue in coreference resolution tasksma-course-subjectivity-mining
Repository for the Subjectivity mining courseBERT-WSD
VUA_pylib
Set of functions in python, including feature extractors, common functions, NAF/KAF manipulationLove Open Source and this site? Check out how you can help us