[en] We applied state-space modelling technique to estimate the cognitive workload of a speech-in-noise (SIN) recall task, based on participants’ oculo-motor responses to speech signals. We estimated common latent attention levels in 15 time bins and observed temporal changes between pupillary dilations and saccade frequencies, given that the both conditions were independent. We also compared two speech type factors (natural vs. synthetic) and three levels of signal-to-noise (-1dB, -3dB, and -5dB) using the estimated parameter distribution. The comparison of experimental factors provided us with insights into differences in participants’ processing of spoken information during a SIN recall task.
Disciplines :
Computer science
Author, co-author :
DUBIEL, Mateusz ; University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)
Nakayama, Minoru; Tokyo Institute of Technology
Wang, Xin; National Institute of Informatics - Tokyo, Japan
External co-authors :
yes
Language :
English
Title :
Modelling Attention Levels with Ocular Responses in a Speech-in-Noise Recall Task
Publication date :
30 May 2023
Event name :
ETRA '23: Proceedings of the 2023 Symposium on Eye Tracking Research and Applications
Event date :
from 30-05-2023 to 02-06-2023
Audience :
International
Main work title :
ETRA '23: Proceedings of the 2023 Symposium on Eye Tracking Research and Applications
Publisher :
Association for Computing Machinery, Tubingen, Germany
scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.
Bibliography
Arif Ahmed, Gondy Leroy, Han Yu Lu, David Kauchak, Jeff Stone, Philip Harber, Stephen A Rains, Prashant Mishra, and Bhumi Chitroda. 2023. Audio delivery of health information: An NLP study of information difficulty and bias in listeners. Procedia Computer Science 219 (2023), 1509-1517.
Samantha F. Anderson, Ken Kelly, and Scott E. Maxwell. 2017. Sample-Size Planning for More Accurate Statistical Power: A Met hod Adjusting Sample Effect Sizes for Publication Bias and Uncertainty. Psychological Science 28(11) (2017), 1547-1562.
David Andrewes. 2015. Neuropsychology: From theory to practice. Psychology Press.
Jackson Beatty. 1982. Task-evoked pupillary responses, processing load, and the structure of processing resources. Psychological bulletin 91, 2 (1982), 276.
Jacob Cohen. 2013. Statistical power analysis for the behavioral sciences. Academic press.
Mateusz Dubiel, Minoru Nakayama, and XinWang. 2021a. Combining Oculo-motor Indices to Measure Cognitive Load of Synthetic Speech in Noisy Listening Conditions. In ACM Symposium on Eye Tracking Research and Applications. 1-6.
Mateusz Dubiel, Minoru Nakayama, and XinWang. 2021b. Evaluating Synthetic Speech Workload with Oculo-motor indices: Preliminary Observations for Japanese Speech. In Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC2021), Vol. 4:BIOSIGNALS. INSTICC publishing, Lisbon, 335-342.
Yoshinobu Ebisawa and Mitsuhiro Sugiura. 1998. Influences of Target and Fixation Point Conditions on Characteristics of Visually Guided Voluntary Saccade. The Journal of the Institute of Image Information and Television En gineers 52, 11 (1998), 1730-1737.
Avashna Govender and Simon King. 2018. Measuring the Cognitive Load of Synthetic Speech Using a Dual Task Paradigm. In Interspeech. 2843-2847.
Avashna Govender, Anita E Wagner, and Simon King. 2019. Using Pupil Dilation to Measure Cognitive Load When Listening to Text-to-Speech in Quiet and in Noise. In INTERSPEECH. 1551-1555.
Julia M. Haaf and Jeffrey N. Rounder. 2017. Developing constraint in Bayesian mixed models. Psychological Methods 22 (2017), 779-798.
Daniel Kahneman and Jackson Beatty. 1966. Pupil diameter and load on memory. Science 154, 3756 (1966), 1583-1585.
Michael D. Lee. 2011. How cognitive modeling can benefit from hierarchical Bayesian models. Journal of Mathematical Psychology 55 (2011), 1-7.
David McNaughton, Janice Light, David R Beukelman, Chris Klein, Dana Nieder, and Godfrey Nazareth. 2019. Building capacity in AAC: A person-centred approach to supporting participation by people with complex communication needs. Augmentative and Alternative Communication 35, 1 (2019), 56-68.
Minoru Nakayama and Yoshiya Hayakawa. 2021. Influence of Task-evoked Mental Workloads on Oculo-motor indices and their connections. EAI Trans. Context-aware Systems and Application 7, 23 (2021), e2:1-10.
Tomomi Okano and Minoru Nakayama. 2022. Research on Time Series Evaluation of Cognitive Load Factors using Features of Eye Movement. In Proc. ETRA2022, COGAIN Workshop. ACM, NY, USA, 61:1-6.
MK Pichora-Fuller. 2007. Audition and cognition: What audiologists need to know about listening. Hearing care for adults (2007), 71-85.
Patrick MA Rabbitt. 1968. Channel-capacity, intelligibility and immediate memory. The Quarterly journal of experimental psychology 20, 3 (1968), 241-248.
Jerker Rönnberg, Thomas Lunner, Adriana Zekveld, Patrik Sörqvist, Henrik Danielsson, Björn Lyxell, Orjan Dahlström, Carine Signoret, Stefan Stenfelt,MKathleen Pichora-Fuller, et al. 2013. The Ease of Language Understanding (ELU) model: Theoretical, empirical, and clinical advances. Frontiers in systems neuroscience 7 (2013), 31.
Takahiro Ueno and Minoru Nakayama. 2021. Estimation of Visual Attention using Microsaccades in response to Vibrations in the Peripheral Field of Vision. In Proc. ETRA2021. ACM, NY, USA, 19:1-6.
Sumio Watanabe and Manfred Opper. 2010. Asymptotic equivalence of Bayes cross validation and widely applicable information criterion in singular learning theory. Journal of machine learning research 11, 12 (2010).
Matthew B Winn, Dorothea Wendt, Thomas Koelewijn, and Stefanie E Kuchinsky. 2018. Best practices and advice for using pupillometry to measure listening effort: An introduction for those who want to get started. Trends in hearing 22 (2018), 2331216518800869.
Adriana A Zekveld, Sophia E Kramer, and Joost M Festen. 2010. Pupil response as an indication of effortful listening: The influence of sentence intelligibility. Ear and hearing 31, 4 (2010), 480-490.
Heiga Zen, Andrew Senior, and Mike Schuster. 2013. Statistical parametric speech synthesis using deep neural networks. In 2013 ieee international conference on acoustics, speech and signal processing. IEEE, 7962-7966.
Similar publications
Sorry the service is unavailable at the moment. Please try again later.