ASRLUX: AUTOMATIC SPEECH RECOGNITION FOR THE LOW-RESOURCE LANGUAGE LUXEMBOURGISH

GILLES, Peter; HILLAH, Léopold Edem Ayité; HOSSEINI KIVANANI, Nina

Download

Paper published in a book (Scientific congresses, symposiums and conference proceedings)

ASRLUX: AUTOMATIC SPEECH RECOGNITION FOR THE LOW-RESOURCE LANGUAGE LUXEMBOURGISH

GILLES, Peter; HILLAH, Léopold Edem Ayité; HOSSEINI KIVANANI, Nina

2023 • In Skarnitzl, Radek; Volín, Jan (Eds.) Proceedings of the 20th International Congress of Phonetic Sciences

Peer reviewed

Permalink
https://hdl.handle.net/10993/55819

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

Lux-ASR - ICPhS_2023_PROCEEDINGS.pdf

Publisher postprint (147.26 kB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Luxembourgish; automatic speech recognition (ASR); low-resource language

Abstract :

[en] We have developed an automatic speech recognition (ASR) system tailored to Luxembourgish, a low-resource language that poses distinct challenges for conventional ASR approaches due to the limited availability of training data and inherent multilingual nature. By employing transfer learning, we meticulously fine-tuned an array of models derived from pre-trained wav2vec 2.0 and Whisper checkpoints. These models have been trained on an extensive corpus of various languages and several hundred thousand hours of audio data, utilizing unsupervised and weak supervised methodologies, respectively. This includes linguistically related languages such as German, Dutch, and French, which expedite the cross-lingual training process for Luxembourgish-specific models. Fine-tuning was executed utilizing 67 hours of annotated Luxembourgish speech data sourced from a diverse range of speakers. The optimal word error rate (WER) achieved for wav2vec 2.0 and Whisper models were 9.5 and 12.1, respectively. The remarkably low WERs obtained serve to substantiate the efficacy of transfer learning in the context of ASR for low-resource languages.

Disciplines :

Computer science

Author, co-author :

GILLES, Peter ; University of Luxembourg > Faculty of Humanities, Education and Social Sciences (FHSE) > Department of Humanities (DHUM)

HILLAH, Léopold Edem Ayité ; University of Luxembourg > Faculty of Science, Technology and Medecine (FSTM)

HOSSEINI KIVANANI, Nina ; University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)

External co-authors :

Language :

English

Title :

ASRLUX: AUTOMATIC SPEECH RECOGNITION FOR THE LOW-RESOURCE LANGUAGE LUXEMBOURGISH

Publication date :

2023

Event name :

20. International Conference of Phonetic Sciences (ICPhS)

Event organizer :

University of Prague

Event date :

from 07-08-2023 to 11-08-2023

Audience :

International

Main work title :

Proceedings of the 20th International Congress of Phonetic Sciences

Editor :

Skarnitzl, Radek

Volín, Jan

Publisher :

Guarant International, Prague, Unknown/unspecified

ISBN/EAN :

978-80-908114-2-3

Pages :

3091-3095

Peer review/Selection committee :

Peer reviewed

Focus Area :

Computational Sciences

Available on ORBilu :

since 21 August 2023

Statistics

Number of views

543 (28 by Unilu)

Number of downloads

286 (13 by Unilu)

More statistics