PyExPool
Python Multi-Process Execution Pool: concurrent asynchronous execution pool with custom resource constraints (memory, timeouts, affinity, CPU cores and caching), load balancing and profiling capabilities of the external apps on NUMA architectureLFR-Benchmark_UndirWeightOvp
Extended version of the Lancichinetti-Fortunato-Radicchi Benchmark for Undirected Weighted Overlapping networks to evaluate clustering algorithms using generated ground-truth communitiesJUST
LBSN2Vec
Code release for LBSN2VecFlashback_code
HINGE_code
NodeSketch
NodeSketch: Highly-Efficient Graph Embeddings via Recursive SketchingTRank
Ranking Entity Types using the Web of Databench-vldb20
RETA_code
ActiveLink
Deep active learning framework for link prediction in knowledge graphHistoSketch
Implementation of HistoSketch and D2HistoSketch in MATLABclubmark
Clubmark: a Parallel Isolation Framework for Benchmarking and Profiling of Clustering (Community Detection) Algorithms Considering Overlaps (Covers)pytrec_eval
A library to evaluate TREC-like runs with TREC-like qrels. Implements similarity of rankings, ttest between runs etc…PyCABeM
Python Benchmarking Framework for the Clustering Algorithms Evaluation: networks generation and shuffling; failover execution and resource consumption tracing (peak RAM RSS, CPU, ...); evaluation of Modularity, conductance, NMI and F1 Score for overlapping communitiesGenConvNMI
Generalized Conventional Mutual Information (GenConvMI) - NMI for overlapping (soft, fuzzy) clusters (communities), compatible with standard NMI, pure C++ version (single executable)MARTA
xmeasures
Extremely fast evaluation of the extrinsic clustering measures: various (mean) F1 measures and Omega Index (Fuzzy Adjusted Rand Index) for the multi-resolution clustering with overlaps/covers, standard NMI, clusters labelingTSM-Bench
Comprehensive Benchmark for Time Series Database Systemsfashion_nlp_v2
FashionBrain D2.1: Named Entity Recognition and Linking Methodsorbits
PyNetConvert
Network (Graph) Format Converter: RCG, Pajek, Metis, NSL (NCol, SNAP, ...), MathlabStaTIX
Statistical Type Inference (both fully automatic and semi supervised) for RDF datasetsdaoc
DAOC (Deterministic and Agglomerative Overlapping Clustering algorithm): Stable Clustering of Large NetworksGraphEmbEval
Graph (network) embeddings evaluation framework via classification, gram martix construction for links predictionfashionNLP
pSCAN
pSCAN: Fast and Exact Structural Graph Clustering (with overlaps)sanaphor
2018-Internship-TableDetection
This repository contains the pipeline for table detection/extraction from 'Bundesarchive' documents.Wiki2Prop
The companion material for the Wiki2Prop PaperOpenCrowd
WDCFramework
clone of https://www.assembla.com/spaces/commondata/subversion/source/HEAD/WDCFramework/trunkdaor
DAOR Parameter-free Embedding Framework for Large Graphs (Networks)CORAD
CORAD: Correlation-Aware Compression of Massive Time Series using Sparse Dictionary Codingcardinal
Source Code and Companion Material of the Non-Parametric Class Completeness EstimatorsTaxoComplete
his is the repositotry of TaxoComplete: Self-Supervised Taxonomy Completion Leveraging Position-Enhanced Semantic Matchingentity-disambiguation-data-ecir2013
2016-armatweet
NLP components of ArmaTweet devoted to converting tweets into quads of the form (`subject`, `predicate`, `object`, `location`) where `subject`, `object`, and `location` are DBpedia resources, and `predicate` is a WordNet synset.axel
Project for exploratory search on scientific articlesthesis_template
Latex template for XI BSc/MSc thesishirecs
High Resolution Hierarchical Clustering with Stable StateNetHash
NetHash algorithm from IJCAI 2018typhon
Deep Learning framework that trains a single model using multiple, heterogeneous datasets leveraging parallel transfer, strictly enforcing feature generalization and even preventing overfittinginFlux
Task Flow Controlwd-graph
A toolset to work with the Wikidata GraphWDCTools
timesvd_vc
preposition-data-cikm2014
Datasets with preposition corrections for CIKM 2014 paperSNF_disambiguation
vadetis
pgpr
seer
resmerge
Resolution levels clustering merger with filtering and clusters deduplication. Flattens a hierarchy/list of multiple resolutions levels (clusterings) into the single flat clustering (collection), synchronizing the node base and deduplicating.typhon_exp
Experiments for the paper: "Typhon: Parallel Transfer on Heterogeneous Datasets for Cancer Detection in Computer-Aided Diagnosis"cdrec
ase-lab
Lab of Time Series Database SystemsReVival-Code
nif-entity-linking-webservice
interval_index
A full-set of data structures and experimental data for CINTIA paperCDTool
bench-vldb20_full
2019_kais-bench
oslom2
Sources of the OSLOM2 (v2.5) clustering algorithm with slightly extended I/O for the benchmarking under Clubmarktag-recommendation-data-iswc2012
Dataset for the " Tag recommendation" paper from ISWC 2012scientific_NER_dataset
Judged dataset for NER in scientific documentsscala_utils
Few Scala utils...CGGC
RG (Randomized Greedy clustering), CGGC_RG (Core Groups Graph ensemble Clustering) or CGGCi_RG (Core Groups Graph ensemble Clustering Iterative) algorithmsBonusBar
BonusBar Django project. An HCI prototype for worker retention.HIT-Scheduler
Opensource, HIT Scheduling backend for Amazon Mechanical Turk.libMoji
The implementation of Moji VisualizationsWikidataSectionLinks
JOINER_code
TInfES
Type Inference Evaluation Scripts & Accessory Apps (used for the StaTIX benchmarking)sds2020_web_table_annotation
SDS2020 - Annotating Web Tables through Knowledge Bases: A Context-Based ApproachWikipedia30
A collections of 30 random Wikipedia pages manually annotated with entities.SMA-17s_CommunityDetection
Community detection programming exercises for the SMA-17s courseASE-lab-2023
Time Series Database System Lab 2023Love Open Source and this site? Check out how you can help us