Paper published in a journal (Scientific congresses, symposiums and conference proceedings)
Dynamic Data Pruning for Automatic Speech Recognition
Xiao, Qiao; Ma, Pingchuan; Fernandez-Lopez, Adriana et al.
2024In Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, p. 4488 - 4492
Peer reviewed
 

Files


Full Text
xiao24b_interspeech.pdf
Author postprint (477.56 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
learning efficiency; Automatic speech recognition; Computational demands; Data performance; Data pruning; Dynamic data; Model training; Signal Processing; Language and Linguistics; Human-Computer Interaction; Machine Learning
Abstract :
[en] The recent success of Automatic Speech Recognition (ASR) is largely attributed to the ever-growing amount of training data.However, this trend has made model training prohibitively costly and imposed computational demands.While data pruning has been proposed to mitigate this issue by identifying a small subset of relevant data, its application in ASR has been barely explored, and existing works often entail significant overhead to achieve meaningful results.To fill this gap, this paper presents the first investigation of dynamic data pruning for ASR, finding that we can reach the full-data performance by dynamically selecting 70% of data.Furthermore, we introduce Dynamic Data Pruning for ASR (DDP-ASR), which offers several fine-grained pruning granularities specifically tailored for speech-related datasets, going beyond the conventional pruning of entire time sequences.Our intensive experiments show that DDP-ASR can save up to 1.6× training time with negligible performance loss.
Disciplines :
Computer science
Author, co-author :
Xiao, Qiao;  Eindhoven University of Technology, Netherlands
Ma, Pingchuan;  Meta AI, United Kingdom ; Imperial College London, United Kingdom
Fernandez-Lopez, Adriana;  Meta AI, United Kingdom
WU, Boqian ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS) ; University of Twente, Netherlands
Yin, Lu;  University of Surrey, United Kingdom
Petridis, Stavros;  Meta AI, United Kingdom ; Imperial College London, United Kingdom
Pechenizkiy, Mykola;  Eindhoven University of Technology, Netherlands
Pantic, Maja;  Meta AI, United Kingdom ; Imperial College London, United Kingdom
MOCANU, Decebal Constantin  ;  University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)
Liu, Shiwei;  University of Oxford, United Kingdom
External co-authors :
yes
Language :
English
Title :
Dynamic Data Pruning for Automatic Speech Recognition
Publication date :
01 September 2024
Event name :
Interspeech 2024
Event place :
Kos Island, Greece
Event date :
01-09-2024 => 05-09-2024
Audience :
International
Journal title :
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
ISSN :
2308-457X
eISSN :
1990-9772
Publisher :
International Speech Communication Association
Pages :
4488 - 4492
Peer reviewed :
Peer reviewed
Focus Area :
Computational Sciences
Development Goals :
9. Industry, innovation and infrastructure
Available on ORBilu :
since 01 February 2026

Statistics


Number of views
7 (3 by Unilu)
Number of downloads
2 (0 by Unilu)

Scopus citations®
 
1
Scopus citations®
without self-citations
1
OpenCitations
 
0
OpenAlex citations
 
1

Bibliography


Similar publications



Contact ORBilu