Communication publiée dans un ouvrage (Colloques, congrès, conférences scientifiques et actes)
An Evaluation of Unsupervised Acoustic Model Training for a Dysarthric Speech Interface
Walter, Oliver; DESPOTOVIC, Vladimir; Haeb-Umbach, Reinhold et al.
2014In Proceedings of the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014)
Peer reviewed
 

Documents


Texte intégral
INTERSPEECH 2014.pdf
Postprint Éditeur (143.26 kB)
Télécharger

Tous les documents dans ORBilu sont protégés par une licence d'utilisation.

Envoyer vers



Détails



Mots-clés :
unsupervised learning; acoustic unit descriptors; dysarthric speech; non-negative matrix factorization
Résumé :
[en] In this paper, we investigate unsupervised acoustic model training approaches for dysarthric-speech recognition. These models are first, frame-based Gaussian posteriorgrams, obtained from Vector Quantization (VQ), second, so-called Acoustic Unit Descriptors (AUDs), which are hidden Markov models of phone-like units, that are trained in an unsupervised fashion, and, third, posteriorgrams computed on the AUDs. Experiments were carried out on a database collected from a home automation task and containing nine speakers, of which seven are considered to utter dysarthric speech. All unsupervised modeling approaches delivered significantly better recognition rates than a speaker-independent phoneme recognition baseline, showing the suitability of unsupervised acoustic model training for dysarthric speech. While the AUD models led to the most compact representation of an utterance for the subsequent semantic inference stage, posteriorgram-based representations resulted in higher recognition rates, with the Gaussian posteriorgram achieving the highest slot filling F-score of 97.02%.
Disciplines :
Sciences informatiques
Auteur, co-auteur :
Walter, Oliver;  University of Paderborn > Department of Communications Engineering
DESPOTOVIC, Vladimir ;  University of Paderborn > Department of Communications Engineering
Haeb-Umbach, Reinhold;  University of Paderborn > Department of Communications Engineering
Gemmeke, Jort;  Katholieke Universiteit Leuven - KUL > ESAT - PSI, Processing Speech and Images
Van hamme, Hugo;  Katholieke Universiteit Leuven - KUL > ESAT - PSI, Processing Speech and Images
Ons, Bart;  Katholieke Universiteit Leuven - KUL > ESAT - PSI, Processing Speech and Images
Co-auteurs externes :
yes
Langue du document :
Anglais
Titre :
An Evaluation of Unsupervised Acoustic Model Training for a Dysarthric Speech Interface
Date de publication/diffusion :
septembre 2014
Nom de la manifestation :
15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014)
Lieu de la manifestation :
Singapore, Singapour
Date de la manifestation :
from 14-09-2014 to 18-09-2014
Manifestation à portée :
International
Titre de l'ouvrage principal :
Proceedings of the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014)
Pagination :
1013-1017
Peer reviewed :
Peer reviewed
Organisme subsidiant :
DFG - Deutsche Forschungsgemeinschaft
IWT-SBO
CE - Commission Européenne
Disponible sur ORBilu :
depuis le 05 novembre 2019

Statistiques


Nombre de vues
161 (dont 1 Unilu)
Nombre de téléchargements
126 (dont 0 Unilu)

citations Scopus®
 
11
citations Scopus®
sans auto-citations
8

Bibliographie


Publications similaires



Contacter ORBilu