Communication publiée dans un ouvrage (Colloques, congrès, conférences scientifiques et actes)
Semantic Analysis of Spoken Input Using Markov Logic Networks
DESPOTOVIC, Vladimir; Walter, Oliver; Haeb-Umbach, Reinhold
2015In Proceedings of the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015)
Peer reviewed
 

Documents


Texte intégral
INTERSPEECH 2015.pdf
Postprint Éditeur (176.16 kB)
Télécharger

Tous les documents dans ORBilu sont protégés par une licence d'utilisation.

Envoyer vers



Détails



Mots-clés :
Unsupervised learning; Acoustic units; Speech; Markov Logic Networks; Semantic frame
Résumé :
[en] We present a semantic analysis technique for spoken input using Markov Logic Networks (MLNs). MLNs combine graphical models with first-order logic. They are particularly suitable for providing inference in the presence of inconsistent and in- complete data, which are typical of an automatic speech recognizer’s (ASR) output in the presence of degraded speech. The target application is a speech interface to a home automation system to be operated by people with speech impairments, where the ASR output is particularly noisy. In order to cater for dysarthric speech with non-canonical phoneme realizations, acoustic representations of the input speech are learned in an unsupervised fashion. While training data transcripts are not required for the acoustic model training, the MLN training requires supervision, however, at a rather loose and abstract level. Results on two databases, one of them for dysarthric speech, show that MLN-based semantic analysis clearly outperforms baseline approaches employing non-negative matrix factorization, multinomial naive Bayes models, or support vector machines.
Disciplines :
Sciences informatiques
Auteur, co-auteur :
DESPOTOVIC, Vladimir ;  University of Belgrade > Technical Faculty in Bor
Walter, Oliver;  University of Paderborn > Department of Communications Engineering
Haeb-Umbach, Reinhold;  University of Paderborn > Department of Communications Engineering
Co-auteurs externes :
yes
Langue du document :
Anglais
Titre :
Semantic Analysis of Spoken Input Using Markov Logic Networks
Date de publication/diffusion :
septembre 2015
Nom de la manifestation :
16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015)
Lieu de la manifestation :
Dresden, Allemagne
Date de la manifestation :
from 06-09-2015 to 10-09-2015
Manifestation à portée :
International
Titre de l'ouvrage principal :
Proceedings of the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015)
Pagination :
1859-1863
Peer reviewed :
Peer reviewed
Organisme subsidiant :
DFG - Deutsche Forschungsgemeinschaft
CE - Commission Européenne
Disponible sur ORBilu :
depuis le 05 novembre 2019

Statistiques


Nombre de vues
124 (dont 0 Unilu)
Nombre de téléchargements
89 (dont 0 Unilu)

citations Scopus®
 
1
citations Scopus®
sans auto-citations
0

Bibliographie


Publications similaires



Contacter ORBilu