Language and Voice Lab (@cadia-lvl)

Top repositories

1

punctuation-prediction

Support tools for punctuation and boundary detection for ASR output.
Python
57
star
2

icelandic-NLP-resources

Overview of Icelandic NLP resources at a glance
15
star
3

kaldi-speaker-diarization

This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
Shell
13
star
4

ice-asr

An automatic speech recognition environment for Icelandic based on Kaldi
Python
13
star
5

ss_asr

A semi-supervised sequence-to-sequence ASR
Python
8
star
6

samromur-asr

Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi
Shell
6
star
7

ictk

📚 Icelandic Corpora Toolkit - A collection of scripts to use with various Icelandic text corpora
Python
5
star
8

WebRICE

WebRICE (Web Reader ICE) is an open source web reader in development at Reykjavik University.
TypeScript
5
star
9

regina_normalizer

Python
4
star
10

LOBE

LOBE is a recording client made specifically for TTS data collections. It supports multiple collections, single and multi-speaker, and can prompt sentences based on phonetic coverage.
Python
4
star
11

tacotron

E2E-NN-TTS
Python
3
star
12

NER

Named entity recognition for Icelandic
Python
3
star
13

unit-selection-festival

First version of Unit Selection recipe for Icelandic from Reykjavík University
Shell
3
star
14

Icelandic-textnorm

Text normalization for Icelandic
Python
2
star
15

speech-corpora-toolkit

A collection of tools for processing public domain audio and scripts to prepare them for segmentation and alignment.
Python
2
star
16

deCODE

Voice Source and Vocal tract features
Shell
2
star
17

samromur

TypeScript
2
star
18

POS

A part-of-speech tagger, tailored to Icelandic
Python
2
star
19

MOSI

TTS evaluation platform
Jinja
2
star
20

sentiment-analysis

Deep Learning and Machine learning training on Icelandic translated data
Jupyter Notebook
2
star
21

althingi-asr

An ASR recipe and speech corpus of Icelandic parliamentary speeches
Shell
2
star
22

broadcast_data_prep

This repository has scripts to extract data from various sources for ASR and speaker diarization.
Python
1
star
23

lm-is-forms

Generate language model data for form fillables
Python
1
star
24

RUQuAD

A repository with information about Reykjavik University Question-Answer Dataset
1
star
25

qa-crowdsourcing-api

TypeScript
1
star
26

BinPackageAPI

Python
1
star
27

metawave

Meta-information tool for TTS datasets
Python
1
star
28

samromur-mfa

Aligner with MFA for Samromur dataset
Shell
1
star
29

compute

LVL Computing Resources
1
star
30

tal.ru.is

Vefgátt fyrir íslenskan talgreini
Python
1
star
31

GreynirCorrectAPI

Python
1
star
32

ebs

Epoch Based Spectrum Estimation for Speech
MATLAB
1
star
33

diar-az

Diarization A to Z - Kaldi to Gecko to Kaldi and corpus and back
Python
1
star
34

samromur-chat

Samrómur chat is a VoIP web application written in Typescript.
TypeScript
1
star
35

SoftwareDevelopmentGuidelines

The LVL software development guidelines
1
star
36

cadia-lvl.github.io

This is the Language and Voice Lab's GitHub landing page
HTML
1
star
37

Greynir-setningafraedi

Rannsóknarverkefni í grunnnámi í Tölvunarfræði við HR. Greynir tólið nýtt til að búa til frumgerð af kennsluforriti fyrir framhaldsskólanema.
HTML
1
star
38

alignment-and-segmentation

Scripts for preparing the RUV TV and radio material for ASR
Shell
1
star