Modelling Attention Levels with Ocular Responses in a Speech-in-Noise Recall Task

DUBIEL, Mateusz; Nakayama, Minoru; Wang, Xin

doi:10.1145/3588015.3589665

Download

Paper published in a book (Scientific congresses, symposiums and conference proceedings)

Modelling Attention Levels with Ocular Responses in a Speech-in-Noise Recall Task

DUBIEL, Mateusz; Nakayama, Minoru; Wang, Xin

2023 • In ETRA '23: Proceedings of the 2023 Symposium on Eye Tracking Research and Applications

Peer reviewed

Permalink
https://hdl.handle.net/10993/55536

DOI
10.1145/3588015.3589665

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

Modelling_Attention_Levels.pdf

Publisher postprint (2.54 MB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Speech perception; Eye movement; Cognitive load; Statistical modelling; Synthetic speech

Abstract :

[en] We applied state-space modelling technique to estimate the cognitive workload of a speech-in-noise (SIN) recall task, based on participants’ oculo-motor responses to speech signals. We estimated common latent attention levels in 15 time bins and observed temporal changes between pupillary dilations and saccade frequencies, given that the both conditions were independent. We also compared two speech type factors (natural vs. synthetic) and three levels of signal-to-noise (-1dB, -3dB, and -5dB) using the estimated parameter distribution. The comparison of experimental factors provided us with insights into differences in participants’ processing of spoken information during a SIN recall task.

Disciplines :

Computer science

Author, co-author :

DUBIEL, Mateusz ; University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)

Nakayama, Minoru; Tokyo Institute of Technology

Wang, Xin; National Institute of Informatics - Tokyo, Japan

External co-authors :

yes

Language :

English

Title :

Modelling Attention Levels with Ocular Responses in a Speech-in-Noise Recall Task

Publication date :

30 May 2023

Event name :

ETRA '23: Proceedings of the 2023 Symposium on Eye Tracking Research and Applications

Event date :

from 30-05-2023 to 02-06-2023

Audience :

International

Main work title :

ETRA '23: Proceedings of the 2023 Symposium on Eye Tracking Research and Applications

Publisher :

Association for Computing Machinery, Tubingen, Germany

ISBN/EAN :

979-8-4007-0150-4

Peer reviewed :

Peer reviewed

Focus Area :

Computational Sciences

Additional URL :

https://dl.acm.org/doi/10.1145/3588015.3589665

Available on ORBilu :

since 06 July 2023

Statistics

Number of views

66 (6 by Unilu)

Number of downloads

11 (3 by Unilu)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

OpenAlex citations

publications

supporting

mentioning

contrasting

Smart Citations

Citing PublicationsSupportingMentioningContrasting

View Citations

See how this article has been cited at scite.ai

scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.

Bibliography

Arif Ahmed, Gondy Leroy, Han Yu Lu, David Kauchak, Jeff Stone, Philip Harber, Stephen A Rains, Prashant Mishra, and Bhumi Chitroda. 2023. Audio delivery of health information: An NLP study of information difficulty and bias in listeners. Procedia Computer Science 219 (2023), 1509-1517.
Samantha F. Anderson, Ken Kelly, and Scott E. Maxwell. 2017. Sample-Size Planning for More Accurate Statistical Power: A Met hod Adjusting Sample Effect Sizes for Publication Bias and Uncertainty. Psychological Science 28(11) (2017), 1547-1562.
David Andrewes. 2015. Neuropsychology: From theory to practice. Psychology Press.
Jackson Beatty. 1982. Task-evoked pupillary responses, processing load, and the structure of processing resources. Psychological bulletin 91, 2 (1982), 276.
Jacob Cohen. 2013. Statistical power analysis for the behavioral sciences. Academic press.
Mateusz Dubiel, Minoru Nakayama, and XinWang. 2021a. Combining Oculo-motor Indices to Measure Cognitive Load of Synthetic Speech in Noisy Listening Conditions. In ACM Symposium on Eye Tracking Research and Applications. 1-6.
Mateusz Dubiel, Minoru Nakayama, and XinWang. 2021b. Evaluating Synthetic Speech Workload with Oculo-motor indices: Preliminary Observations for Japanese Speech. In Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC2021), Vol. 4:BIOSIGNALS. INSTICC publishing, Lisbon, 335-342.
Yoshinobu Ebisawa and Mitsuhiro Sugiura. 1998. Influences of Target and Fixation Point Conditions on Characteristics of Visually Guided Voluntary Saccade. The Journal of the Institute of Image Information and Television En gineers 52, 11 (1998), 1730-1737.
Avashna Govender and Simon King. 2018. Measuring the Cognitive Load of Synthetic Speech Using a Dual Task Paradigm. In Interspeech. 2843-2847.
Avashna Govender, Anita E Wagner, and Simon King. 2019. Using Pupil Dilation to Measure Cognitive Load When Listening to Text-to-Speech in Quiet and in Noise. In INTERSPEECH. 1551-1555.
Julia M. Haaf and Jeffrey N. Rounder. 2017. Developing constraint in Bayesian mixed models. Psychological Methods 22 (2017), 779-798.
Daniel Kahneman and Jackson Beatty. 1966. Pupil diameter and load on memory. Science 154, 3756 (1966), 1583-1585.
Michael D. Lee. 2011. How cognitive modeling can benefit from hierarchical Bayesian models. Journal of Mathematical Psychology 55 (2011), 1-7.
David McNaughton, Janice Light, David R Beukelman, Chris Klein, Dana Nieder, and Godfrey Nazareth. 2019. Building capacity in AAC: A person-centred approach to supporting participation by people with complex communication needs. Augmentative and Alternative Communication 35, 1 (2019), 56-68.
Minoru Nakayama and Yoshiya Hayakawa. 2021. Influence of Task-evoked Mental Workloads on Oculo-motor indices and their connections. EAI Trans. Context-aware Systems and Application 7, 23 (2021), e2:1-10.
Tomomi Okano and Minoru Nakayama. 2022. Research on Time Series Evaluation of Cognitive Load Factors using Features of Eye Movement. In Proc. ETRA2022, COGAIN Workshop. ACM, NY, USA, 61:1-6.
MK Pichora-Fuller. 2007. Audition and cognition: What audiologists need to know about listening. Hearing care for adults (2007), 71-85.
Patrick MA Rabbitt. 1968. Channel-capacity, intelligibility and immediate memory. The Quarterly journal of experimental psychology 20, 3 (1968), 241-248.
Jerker Rönnberg, Thomas Lunner, Adriana Zekveld, Patrik Sörqvist, Henrik Danielsson, Björn Lyxell, Orjan Dahlström, Carine Signoret, Stefan Stenfelt,MKathleen Pichora-Fuller, et al. 2013. The Ease of Language Understanding (ELU) model: Theoretical, empirical, and clinical advances. Frontiers in systems neuroscience 7 (2013), 31.
Takahiro Ueno and Minoru Nakayama. 2021. Estimation of Visual Attention using Microsaccades in response to Vibrations in the Peripheral Field of Vision. In Proc. ETRA2021. ACM, NY, USA, 19:1-6.
Sumio Watanabe and Manfred Opper. 2010. Asymptotic equivalence of Bayes cross validation and widely applicable information criterion in singular learning theory. Journal of machine learning research 11, 12 (2010).
Matthew B Winn, Dorothea Wendt, Thomas Koelewijn, and Stefanie E Kuchinsky. 2018. Best practices and advice for using pupillometry to measure listening effort: An introduction for those who want to get started. Trends in hearing 22 (2018), 2331216518800869.
Adriana A Zekveld, Sophia E Kramer, and Joost M Festen. 2010. Pupil response as an indication of effortful listening: The influence of sentence intelligibility. Ear and hearing 31, 4 (2010), 480-490.
Heiga Zen, Andrew Senior, and Mike Schuster. 2013. Statistical parametric speech synthesis using deep neural networks. In 2013 ieee international conference on acoustics, speech and signal processing. IEEE, 7962-7966.