Skip to content

aalto-speech/fin-parl-models

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Finnish parliament ASR Kaldi model recipes

This repository contains Kaldi recipes for training models using the Finnish Parliament ASR corpus.

Requirements

In addition to Kaldi, the recipes rely on three external tools to do subword tokenization and language modeling.

VariKN

VariKN is used to do n-gram language modeling. For download and installation, see the VariKN Github.

SentencePiece

SentencePiece is used for subword tokenization. For download and installation, see the SentencePiece Github.

Subword-kaldi

Subword-kaldi is included as a git submodule in this repository. To get the submodule, run

git submodule init
git submodule update

SpeechBrain models

See this separate repository for the SpeechBrain models.

About

Baseline Finnish models trained with Finnish Parliament Speech corpus

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published