Experiments of ASR-based mispronunciation detection for children and adult English learners

HOSSEINI KIVANANI, Nina; Gretter, Roberto; Matassoni, Marco; Falavigna, Giuseppe Daniele

Download

Paper published in a book (Scientific congresses, symposiums and conference proceedings)

Experiments of ASR-based mispronunciation detection for children and adult English learners

HOSSEINI KIVANANI, Nina; Gretter, Roberto; Matassoni, Marco et al.

2021 • In HOSSEINI KIVANANI, Nina; Gretter, Roberto; Matassoni, Marco et al. (Eds.) BNAIC/BeneLearn 2021

Peer reviewed

Permalink
https://hdl.handle.net/10993/51660

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

bnaic2021_preproceedings.pdf

Publisher postprint (56.77 MB)

ISSN 2799-2527

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

ASR; Detection of pronunciation errors; l2 learners

Abstract :

[en] Pronunciation is one of the fundamentals of language learning, and it is considered a primary factor of spoken language when it comes to an understanding and being understood by others. The persistent presence of high error rates in speech recognition domains resulting from mispronunciations motivates us to find alternative techniques for handling mispronunciations. In this study, we develop a mispronunciation assessment system that checks the pronunciation of non-native English speakers, identifies the commonly mispronounced phonemes of Italian learners of English, and presents an evaluation of the non-native pronunciation observed in phonetically annotated speech corpora. In this work, to detect mispronunciations, we used a phone-based ASR implemented using Kaldi. We used two non-native English labeled corpora; (i) a corpus of Italian adults contains 5,867 utterances from 46 speakers, and (ii) a corpus of Italian children consists of 5,268 utterances from 78 children. Our results show that the selected error model can discriminate correct sounds from incorrect sounds in both native and non-native speech, and therefore can be used to detect pronunciation errors in nonnative speech. The phone error rates show improvement in using the error language model. Furthermore, the ASR system shows better accuracy after applying the error model on our selected corpora.

Disciplines :

Computer science

Author, co-author :

HOSSEINI KIVANANI, Nina ; University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)

Gretter, Roberto

Matassoni, Marco

Falavigna, Giuseppe Daniele

External co-authors :

yes

Language :

English

Title :

Experiments of ASR-based mispronunciation detection for children and adult English learners

Publication date :

November 2021

Event name :

33rd Benelux Conference on Artificial Intelligence and 30th Belgian-Dutch Conference on Machine Learning

Event organizer :

Proceedings of BNAIC/BeneLearn 2021

Event place :

Luxembourg

Event date :

10/11/2021-12/11/2021

Main work title :

BNAIC/BeneLearn 2021

Author, co-author :

HOSSEINI KIVANANI, Nina

Gretter, Roberto

Matassoni, Marco

Falavigna, Giuseppe Daniele

Publisher :

BnL

ISBN/EAN :

0-2799-2527-X

Pages :

203-216

Peer reviewed :

Peer reviewed

Focus Area :

Computational Sciences

Available on ORBilu :

since 15 July 2022

Statistics

Number of views

184 (9 by Unilu)

Number of downloads

58 (2 by Unilu)

More statistics