@Vicomtech

Top repositories

1

hate-speech-dataset

Hate speech dataset from Stormfront forum manually labelled at sentence level.
164
star
2

DMD-Driver-Monitoring-Dataset

DMD - Driver Monitoring Dataset
Python
59
star
3

video-content-description-VCD

Video Content Description (VCD) is a schema, API and set of tools to produce semantically rich labels from multi-sensorial data series.
Python
56
star
4

STDG-evaluation-metrics

Standardised Metrics and Methods for Synthetic Tabular Data Evaluation
Jupyter Notebook
28
star
5

ArchABM

Agent-based model simulator for air quality and pandemic risk assessment in architectural spaces
R
15
star
6

itzuli-api-lib

Itzuli® Machine Translation Engine API libraries
Go
10
star
7

d-EVD_dual-Electric-Vehicle-Dataset

d-EVD-dual-electric-vehicle-dataset
9
star
8

weblabel

weblabel
8
star
9

NUBes-negation-uncertainty-biomedical-corpus

Repository of the NUBes corpus
Python
7
star
10

serverless-mlperf

This repo aims to benchmark Amazon AWS DNN performance with Caffe, TensorFlow and OpenVINO models, using OpenCV and OpenVINO IE as inference backend engines.
Python
6
star
11

ClinIDMap

ClinIDMap
Python
4
star
12

RailSceneSet

RailSceneSet Dataset
4
star
13

CAPTAIN-Elderly-clustering-and-evolution-analysis

CAPTAIN - Elderly clustering and evolution analysis
Python
2
star
14

tando

TANDO is a corpus for training and evaluation of document-level machine translation models in Basque-Spanish.
2
star
15

Dataset-of-2D-polygons-for-Additive-Manufacturing

Dataset of 2D polygons for Additive Manufacturing
Python
1
star
16

GRACE-Benchmark

GRACE-Benchmark
1
star
17

SOSDaR24

Synthetic Open Sensor Dataset for Rail 2024
1
star
18

BaSCo-Corpus

BaSCo Corpus
1
star
19

esport-corpus

ES-Port Corpus. Spontaneous spoken human-human dialogue corpus consisting of transcribed dialogues from calls to the technical customer support service of a Spanish telecom operator for companies. The corpus has been anonymised and annotated at various linguistic and acoustic-related extralinguistic levels.
1
star
20

ASVspoophone

The ASVspoophone corpus is the telephonic version of the ASV Spoof 2019 corpus found at https://www.asvspoof.org It contains the telephonic versions of the audios used for the countermeasure (CM) ASV Spoof 2019 challenge, which have been created by transferring each of them through real land-land, mobile-land and land-mobile telephonic channels. The results are the corresponding 8 kHz 8 bit A-Law versions of the originial audios, which can be used to train anti-spoofing systems that will be used on real telephonic scenarios such as call and contact centres.
1
star
21

dataset-machine-tool-wear

dataset_machine_tool_wear
1
star
22

synthetic-neu-seg-images-via-stable-diffusion

This dataset accompanies the paper "Latent Diffusion Models to Enhance the Performance of Visual Defect Segmentation Networks in Steel Surface Inspection".
1
star
23

CNC-Assist

CNC-Assist
1
star
24

DiverSim

DiverSim is an innovative simulating tool to generate synthetic pedestrian data with a focus on diversity and inclusion.
Python
1
star