speaker-diarization
Speaker diarization scripts, based on AaltoASRmorfessor
Morfessor is a tool for unsupervised and semi-supervised morphological segmentationAaltoASR
Aalto Automatic Speech Recognition toolssubword-kaldi
Properly handle position-dependent phones in a subword lexicon FSTinterspeech2019_karhila_et_al
Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Ylinen & Kurimo submitted to Interspeech 2019flatcat
Morfessor FlatCatfinnish-forced-alignment
finnish-parliament-scripts
Scripts for retrieving and aligning speech and meeting transcripts from the web portal of the Parliament of Finland (https://www.eduskunta.fi)Wav2vec2Interpretation
scripts and images for article "Investigating wav2vec2 context representations and the effects of fine-tuning, a case-study of a Finnish model"FinChat
FinChat corpus and evaluation setexchange
Bigram exchange algorithmspeechbrain-cl
Implementation of different curriculum learning (CL) methods for speechbrain's ASR recipes.AaltoASR-online-demo
avsr
Audio-visual speech recognition modelsfin-parl-models
Baseline Finnish models trained with Finnish Parliament Speech corpusfi-parliament-tools
Tools for downloading and processing Finnish parliament dataner-asr
Named Entity Recognition for Finnish Languageftk
This toolkit contains programs for segmenting strings and training string segmentation models. It has been developed primarily for learning units for speech recognition, but can be used for other purposes as well.kaldi-sb-north-sme
Kaldi + SpeechBrain + W2V2 models for Northern SamiTopic-identification-for-spontaneous-Finnish-speech
rl-klm
RL-KLM implementation that can be used to estimate task completion times for user interface.modules
Installation scripts for used modules in Aalto ASR research groupmoodle-mod_digitala
DigiTala is a Moodle plugin for assessing L2 Finnish and Swedish speech automatically. Cite as: "von Zansen, A., Alanen, T., Al-Ghezi, R., Erkkilä, J., Harjunpää, T., Heijala, M., Kallio, H. (2022). DigiTala Moodle plugin. https://github.com/aalto-speech/moodle-mod_digitala "say-it-again-kid-pronunciation-learning
Privacy policies for the language learning games developed in collaboration with University of Helsinki Cognitive Brain Research group.sb-fin-parl-models
SpeechBrain baseline recipes for Finnish Parliament dataComParE2023
Code repository for the experiments conducted for the ComParE 2023 challenge.wdecoder
Decoders for AaltoASR acoustic models.fin-parl-lahjoita-puhetta-s5
Speech Recognition experiments combining Lahjoita Puhetta with Finnish Parliamentconversation-assistant
Conversation Assistant iOS-app and Kaldi ASR server for real-time automatic speech recognition in conversational situations.lahjoita-puhetta-metadata-classification
finnish_chatbot
FinnishXL
Code Base for Transformer-XL on Finnish Languageaalto-asr-preprocessor
Aalto ASR preprocessing tool for preparing texts.speechbrain-lahjoita-puhetta-baseline
Baseline E2E AED model for Lahjoita Puhetta in SpeechBrainl2-speech-scoring-tools
Implementation of automatic speech rating systems for second language (L2) learners of Finnish and Finland Swedishlahjoita-puhetta-baseline-wav2vec2
Baseline self-supervised Wav2Vec2 ASR system for Lahjoita puhetta corpusBizSpeech_SpeechBrain
Building an ASR system recipe for BizSpeech data using SpeechBrain.lahjoita-puhetta-resources
A collection of resources related to the Lahjoita puhetta speech corpus.Compare2020
Aalto's solutions for the 2020 Computational Paralinguistics Challenges: Breathing & MasksLove Open Source and this site? Check out how you can help us