Paper published in a book (Scientific congresses, symposiums and conference proceedings)
"¿Te vienes? Sure!" Joint Fine-tuning of Language Detection and Transcription Improves Automatic Recognition of Code-Switching Speech
HILLAH, Léopold Edem Ayité; DUBIEL, Mateusz; LEIVA, Luis A.
2024In Proceedings of the 6th ACM Conference on Conversational User Interfaces
Peer reviewed
 

Files


Full Text
Joint_Fine_tuning_of_Language_Detection_and_ASR_for_Code_Switching_Speech.pdf
Author postprint (694.09 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Code Switching; Multilingual Conversations; Language Identification; Automatic Speech Recognition; Whisper; Speech
Abstract :
[en] Human communication in multilingual communities often leads to code-switching, where individuals seamlessly alternate between two or more languages in their daily interactions. While this phenomenon has been increasingly prevalent thanks to linguistic globalization, it presents challenges for Automatic Speech Recognition (ASR) systems since they are designed with the assumption of transcribing a single language at a time. In this work, we propose a simple yet unexplored approach to tackle this challenge by fine-tuning the Whisper pre-trained model jointly on language identification (LID) and transcription tasks through the introduction of an auxiliary LID loss term. Our results show significant improvements in transcription errors, ranging between 14 and 36 percentage points of difference. Ultimately, our work opens a new direction for research on code-switching speech, offering an opportunity to enhance current capabilities of conversational agents.
Disciplines :
Computer science
Author, co-author :
HILLAH, Léopold Edem Ayité  ;  University of Luxembourg
DUBIEL, Mateusz  ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)
LEIVA, Luis A.  ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)
External co-authors :
no
Language :
English
Title :
"¿Te vienes? Sure!" Joint Fine-tuning of Language Detection and Transcription Improves Automatic Recognition of Code-Switching Speech
Publication date :
08 July 2024
Event name :
CUI '24: 6th ACM Conference on Conversational User Interfaces
Event organizer :
Association for Computing Machinery (ACM)
Event place :
Luxembourg City, Luxembourg
Event date :
from 8 to 10 July 2024
Audience :
International
Main work title :
Proceedings of the 6th ACM Conference on Conversational User Interfaces
Publisher :
Association for Computing Machinery, New York, NY, United States
Peer reviewed :
Peer reviewed
Focus Area :
Computational Sciences
European Projects :
HE - 101071147 - SYMBIOTIK - Context-aware adaptive visualizations for critical decision making
FnR Project :
FNR15722813 - Brainsourcing For Affective Attention Estimation, 2021 (01/02/2022-31/01/2025) - Luis Leiva
Funders :
Union Européenne
Funding text :
This work is supported by the Horizon 2020 FET program of the European Union through the ERA-NET Cofund funding (BANANA, grant CHIST-ERA-20-BCI-001) and Horizon Europe's European Innovation Council through the Pathfinder program (SYMBIOTIK, grant 101071147).
Available on ORBilu :
since 12 July 2024

Statistics


Number of views
133 (12 by Unilu)
Number of downloads
243 (10 by Unilu)

Scopus citations®
 
1
Scopus citations®
without self-citations
1
OpenCitations
 
0
OpenAlex citations
 
1

Bibliography


Similar publications



Contact ORBilu