There are no reviews yet. Be the first to send feedback to the community and the maintainers!
python-for-text-analysis
If you want to use Python for text analysis, this course is for you!OpenDutchWordnet
This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.ba-text-mining
Hands-on material for the course text-mining BA, taught at VU Amsterdampepper
VU-CLTL Pepper/Nao Application Repository (Python 2)wsd-dynamic-sense-vector
SpaCy-to-NAF
spaCy-to-naf converterEventCoreference
Compares descriptions of events within and across documents to decide if they refer to the same events.ThesisTips
A collection of tips for writing a PhD thesisKafNafParserPy
Parser for KAF NAF files written in Pythonma-hlt-labs
Human Language Technology Notebooks for Lab sessions, Master Studentssvm_wsd
Word Sense Disambiguation system developed on the DutchSemCor project using Support Vector Machines. The input is plain text, and the output XMLopinion_miner_deluxe
Opinion miner based of Machine Learning that can be trained on a corpus of KAF/NAF filesBabelfyReimplementation
Reimplementation of Babelfy (http://babelfy.org)ma-ml4nlp-labs
Course code for "Machine Learning in NLP"lexical_pattern_extractor
Lexical pattern extractor to generate patterns and target words from a seed listentity-identification-from-scratch
Entity recognition and linking for historical documents in Dutch, developed within the Clariah+ project at VU AmsterdamOntoTagger
Ontotagger inserts (semantic) labels into KAF representation on the basis of lemma or wordnet synset representations of textvu-rm-pip3
Dutch NewsReader pipelineecbPlus
ECB+ and derived corporaWordnetTools
Set of functions to use a wordnet in Wordnet-LMF formatma-language-as-data-labs
This Github provides the Jupyter notebooks for the Lab sessions of the VU Language-As-Data course.semantic_space_navigation
event-resource-interoperability
morphosyntactic_parser_nl
Morphosyntactic parser for Dutch based on the Alpino parsera-proof-zonmw
Detecting the functioning level of a patient from a free-text clinical note in Dutch.multilingual-finegrained-entity-typing
multilingual-wiki-event-pipeline
This project aims to extract information about incidents of a particular type. This information consists of structured data on the incidents from Wikidata, as well as unstructured description and supporting sources from Wikipedia. We obtain information from Wikipedia in multiple languages.FormatConversions
Several conversions between formats that are commonly used by our toolsBiographyNet
NLP tools and data used in BiographyNetStoryTeller
Toolkit to query the NewsReader KnowledgeStore with SPARQL and create a JSON storycltl-ma-thesis
(LaTeX) MA thesis templateHumanLikeEL
Human-Like Entity Linking using Contextual knowledgeWordNetSimilarity
Programs and scripts that test performance of WordNet similarity measurements using different settingsTarget-Spans-Detection
Target_Spans_HateXplainFrameNet-annotation-tool
Python-based command-line tool for FrameNet annotationMultiWordTagger
Reads a KAF or NAF file to detect multiword sequences of terms according the WordNetaproof-icf-classifier
Classifier that can read medical reports and assign a functional level classification following the WHO ICF classification scheme.EL-long-tail-phenomena
Systematic study of long tail phenomena in the task of entity linkingSoNar2Naf
Converter from Folia to NAFvua-wsd-sem2015
System for the CLTL participation in SemEval2015 task 13: multilingual all-words sense disambiguation and entity linkingframe-annotation-tool
Annotation tool in JavaScript and Node.js for annotation of frames in Dutch documents.machine-learning-for-nlp-course
releases of notebooks for students participating in machine learning for nlpMoreIsNotAlwaysBetter
BiographicalDataModels
lexical-negation-dictionary
ma-communicative-robots
Communication robotsmultilingual_factuality
NAF-HeidelTime
NAF (KAF) Wrapper around HeidelTimereference-framing-perspective
Workshop websiteNewsAcquisition
Analysis and acquisition of news data from the Signal Media corpus and other news collectionstokeniser-opennlp
Tokenizer and sentence splitter based on opennlpWordNetMapper
This repo provides the possibility to map between lexical keys | offsets | ilidefs from one wordnet version to the other ["16","17","171","20","21","30"]. It makes use of the index.sense files from WordNet (http://wordnet.princeton.edu/) and the automatically generated mappings between WordNet offsets (http://nlp.lsi.upc.edu/tools/download-map.php)a-proof
Tools for the text classification of clinical note in electronic patient recordsLSTM-WSD
ELBaselines
This repo is aimed to create baseline results for Entity Linking, by running a text against the state-of-the-art systems for entity linking, using their most standard configuration.nlpp
Script to install NLP pipeline from its components.DFNDataReleases
micro-portraits
voc-missives
NER and format conversion scripts for the Generale MissivenImage-Specificity
Reimplementation of Jas & Parikh's (2015) image specificity metric, using word embeddings.TextToCoNLL
KafAnnotator
Standalone program to annotate KAF filesdutch-nlp-tools
Overview of data sets and resources for DutchNAF-4-Development
FrameNetNLTK
SemanticOverfitting
mergeAnnotationCAT
Script to merge files annotated from different annotators (on the same task) to better explore (dis-)agreementMining-Ministers
CuriousMachine
Investigations on how to build a curious machine based on NLP technologiesGunViolenceCorpus
News2RDF
NAFFoLiAPy
Library for converting between FoLiA and NAFMFS_classifier
This repo contains the scripts to attempt to remove the mfs bias from a WSD system.hpsp
Experiments with hyperspace models for selectional preferencecoreference-evaluation
Evaluation package for event coreference using the reference-scorerSimpleTagger
GRaSP
ceopathfinder
Finds a path of circumstantial relations between events on the basis of the CircumstantialEventOntologyrun_open-sesame
pepper_tensorflow
This is the repository for Pepper modules and external services. Use Python 3entity-link-postprocess
DutchDescriptions
Dutch descriptions for the Flickr30K validation and test data, plus a cross-lingual comparison tool.LongTailAnnotation
Annotation tool for data2text approachescltl.github.io
CLTL organization siteTeamRobot
vua_factuality
LongTailIdentity
Generating profiles of long tail identities from texta-proof-project
PythonVirtuosoInterface
Simple interface to SPARQL for python 2 and 3 scriptsrfp_corpus_collection
Collect a referentially grounded corpus for the 1st workshop on Reference, Framing, and Perspective (LREC-COLING 2024)pwgc
tool to load the princeton wordnet gloss corpusWikipedia_langlinks
relink
RElinking with CONtext - Entity linking moduleKafKybot
Extracts tuples from KAF file using profilesma-applied-tm-course
Github Repository supporting the Applied TM Course as part of the VU Text Mining MastersSPT_crowd_data_analysis
Code to analyze crowd annotations of property-concept pairs in terms of their relations.inner-outer-coreference
A repository for investigating the role of common ground in datasets of social dialogue in coreference resolution tasksma-course-subjectivity-mining
Repository for the Subjectivity mining courseBERT-WSD
VUA_pylib
Set of functions in python, including feature extractors, common functions, NAF/KAF manipulationLove Open Source and this site? Check out how you can help us