Aalto Speech Research (@aalto-speech)

Top repositories

1

speaker-diarization

Speaker diarization scripts, based on AaltoASR
Python
190
star
2

morfessor

Morfessor is a tool for unsupervised and semi-supervised morphological segmentation
Python
175
star
3

AaltoASR

Aalto Automatic Speech Recognition tools
C++
83
star
4

subword-kaldi

Properly handle position-dependent phones in a subword lexicon FST
Python
31
star
5

interspeech2019_karhila_et_al

Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Ylinen & Kurimo submitted to Interspeech 2019
Jupyter Notebook
24
star
6

flatcat

Morfessor FlatCat
Python
12
star
7

finnish-forced-alignment

Python
10
star
8

finnish-parliament-scripts

Scripts for retrieving and aligning speech and meeting transcripts from the web portal of the Parliament of Finland (https://www.eduskunta.fi)
Python
9
star
9

Wav2vec2Interpretation

scripts and images for article "Investigating wav2vec2 context representations and the effects of fine-tuning, a case-study of a Finnish model"
Python
8
star
10

FinChat

FinChat corpus and evaluation set
Jupyter Notebook
7
star
11

exchange

Bigram exchange algorithm
C++
6
star
12

speechbrain-cl

Implementation of different curriculum learning (CL) methods for speechbrain's ASR recipes.
Python
5
star
13

AaltoASR-online-demo

C++
4
star
14

avsr

Audio-visual speech recognition models
Jupyter Notebook
4
star
15

fin-parl-models

Baseline Finnish models trained with Finnish Parliament Speech corpus
Shell
3
star
16

fi-parliament-tools

Tools for downloading and processing Finnish parliament data
Python
3
star
17

ner-asr

Named Entity Recognition for Finnish Language
Python
3
star
18

ftk

This toolkit contains programs for segmenting strings and training string segmentation models. It has been developed primarily for learning units for speech recognition, but can be used for other purposes as well.
C++
3
star
19

kaldi-sb-north-sme

Kaldi + SpeechBrain + W2V2 models for Northern Sami
Python
3
star
20

Topic-identification-for-spontaneous-Finnish-speech

Python
2
star
21

rl-klm

RL-KLM implementation that can be used to estimate task completion times for user interface.
Python
2
star
22

modules

Installation scripts for used modules in Aalto ASR research group
Shell
2
star
23

moodle-mod_digitala

DigiTala is a Moodle plugin for assessing L2 Finnish and Swedish speech automatically. Cite as: "von Zansen, A., Alanen, T., Al-Ghezi, R., Erkkilä, J., Harjunpää, T., Heijala, M., Kallio, H. (2022). DigiTala Moodle plugin. https://github.com/aalto-speech/moodle-mod_digitala "
PHP
2
star
24

say-it-again-kid-pronunciation-learning

Privacy policies for the language learning games developed in collaboration with University of Helsinki Cognitive Brain Research group.
HTML
2
star
25

sb-fin-parl-models

SpeechBrain baseline recipes for Finnish Parliament data
Python
1
star
26

ComParE2023

Code repository for the experiments conducted for the ComParE 2023 challenge.
Python
1
star
27

wdecoder

Decoders for AaltoASR acoustic models.
Lex
1
star
28

fin-parl-lahjoita-puhetta-s5

Speech Recognition experiments combining Lahjoita Puhetta with Finnish Parliament
Python
1
star
29

conversation-assistant

Conversation Assistant iOS-app and Kaldi ASR server for real-time automatic speech recognition in conversational situations.
Python
1
star
30

lahjoita-puhetta-metadata-classification

Python
1
star
31

finnish_chatbot

Python
1
star
32

FinnishXL

Code Base for Transformer-XL on Finnish Language
Python
1
star
33

aalto-asr-preprocessor

Aalto ASR preprocessing tool for preparing texts.
Python
1
star
34

speechbrain-lahjoita-puhetta-baseline

Baseline E2E AED model for Lahjoita Puhetta in SpeechBrain
Python
1
star
35

l2-speech-scoring-tools

Implementation of automatic speech rating systems for second language (L2) learners of Finnish and Finland Swedish
Jupyter Notebook
1
star
36

lahjoita-puhetta-baseline-wav2vec2

Baseline self-supervised Wav2Vec2 ASR system for Lahjoita puhetta corpus
Python
1
star
37

BizSpeech_SpeechBrain

Building an ASR system recipe for BizSpeech data using SpeechBrain.
Python
1
star
38

lahjoita-puhetta-resources

A collection of resources related to the Lahjoita puhetta speech corpus.
1
star
39

Compare2020

Aalto's solutions for the 2020 Computational Paralinguistics Challenges: Breathing & Masks
1
star