Language and Voice Lab (@cadia-lvl)

Top repositories

1

punctuation-prediction

Support tools for punctuation and boundary detection for ASR output.
Python
57
star
2

ice-asr

An automatic speech recognition environment for Icelandic based on Kaldi
Python
14
star
3

kaldi-speaker-diarization

This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
Shell
12
star
4

icelandic-NLP-resources

Overview of Icelandic NLP resources at a glance
11
star
5

samromur-asr

Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi
Shell
10
star
6

ss_asr

A semi-supervised sequence-to-sequence ASR
Python
8
star
7

ictk

📚 Icelandic Corpora Toolkit - A collection of scripts to use with various Icelandic text corpora
Python
5
star
8

WebRICE

WebRICE (Web Reader ICE) is an open source web reader in development at Reykjavik University.
TypeScript
4
star
9

LOBE

LOBE is a recording client made specifically for TTS data collections. It supports multiple collections, single and multi-speaker, and can prompt sentences based on phonetic coverage.
Python
4
star
10

regina_normalizer

Python
3
star
11

tacotron

E2E-NN-TTS
Python
3
star
12

NER

Named entity recognition for Icelandic
Python
3
star
13

Icelandic-textnorm

Text normalization for Icelandic
Python
2
star
14

speech-corpora-toolkit

A collection of tools for processing public domain audio and scripts to prepare them for segmentation and alignment.
Python
2
star
15

deCODE

Voice Source and Vocal tract features
Shell
2
star
16

unit-selection-festival

First version of Unit Selection recipe for Icelandic from Reykjavík University
Shell
2
star
17

POS

A part-of-speech tagger, tailored to Icelandic
Python
2
star
18

sentiment-analysis

Deep Learning and Machine learning training on Icelandic translated data
Jupyter Notebook
2
star
19

althingi-asr

An ASR recipe and speech corpus of Icelandic parliamentary speeches
Shell
2
star
20

lm-is-forms

Generate language model data for form fillables
Python
1
star
21

broadcast_data_prep

This repository has scripts to extract data from various sources for ASR and speaker diarization.
Python
1
star
22

RUQuAD

A repository with information about Reykjavik University Question-Answer Dataset
1
star
23

qa-crowdsourcing-api

TypeScript
1
star
24

BinPackageAPI

Python
1
star
25

metawave

Meta-information tool for TTS datasets
Python
1
star
26

samromur-mfa

Aligner with MFA for Samromur dataset
Shell
1
star
27

compute

LVL Computing Resources
1
star
28

tal.ru.is

Vefgátt fyrir íslenskan talgreini
Python
1
star
29

ebs

Epoch Based Spectrum Estimation for Speech
MATLAB
1
star
30

GreynirCorrectAPI

Python
1
star
31

samromur-chat

Samrómur chat is a VoIP web application written in Typescript.
TypeScript
1
star
32

MOSI

TTS evaluation platform
Jinja
1
star