AUTOMATIC SPEECH RECOGNITION OF LOW RESOURCE LANGUAGES

Loading...
Thumbnail Image

Citation for Previous Publication

Link to Related Item

Abstract

Description

This study focused on exploring ASR systems primarily for the transcription of the Totonac languages of Coatepec and Upper Necaxa. Best ASR transcription results were achieved using Meta Research MMS multilingual model with the Wave2vec ASR framework. The Totonac languages were transcribed with a reasonable Phoneme Error rate based on the Highland Totonac language trained into the Meta MMS model. The transcription accuracy of consonants is higher than vowels, giving a linguistic researcher an automatically transcribed template that can serve as the basis for manual fine-tuning of phonemes and word boundaries.

Item Type

http://purl.org/coar/resource_type/c_1843

Alternative

Other License Text / Link

Language

en

Location

Time Period

Source