Getting the Cocktail Party Started: Masking Effects in Speech Perception - PubMed
Getting the Cocktail Party Started: Masking Effects in Speech Perception
Samuel Evans et al. J Cogn Neurosci. 2016 Mar.
Abstract
Spoken conversations typically take place in noisy environments, and different kinds of masking sounds place differing demands on cognitive resources. Previous studies, examining the modulation of neural activity associated with the properties of competing sounds, have shown that additional speech streams engage the superior temporal gyrus. However, the absence of a condition in which target speech was heard without additional masking made it difficult to identify brain networks specific to masking and to ascertain the extent to which competing speech was processed equivalently to target speech. In this study, we scanned young healthy adults with continuous fMRI, while they listened to stories masked by sounds that differed in their similarity to speech. We show that auditory attention and control networks are activated during attentive listening to masked speech in the absence of an overt behavioral task. We demonstrate that competing speech is processed predominantly in the left hemisphere within the same pathway as target speech but is not treated equivalently within that stream and that individuals who perform better in speech in noise tasks activate the left mid-posterior superior temporal gyrus more. Finally, we identify neural responses associated with the onset of sounds in the auditory environment; activity was found within right lateralized frontal regions consistent with a phasic alerting response. Taken together, these results provide a comprehensive account of the neural processes involved in listening in noise.
Figures

(A) Oscillograms and spectrograms of masking stimuli. SP=speech; ROT=rotated speech; SMN=Speech modulated noise. (B) The organisation of a set of trials (left – rounded boxes) and the statistical models (right). Epoch Model (Red), the first column in the design matrix models the presence of the voice of speaker 1, and thus models all auditory trials for their full duration, excluding “silent” implicit baseline trials. Additional columns model the presence of competing masking sounds derived from speaker 2, with each column representing a different masking sound. These events partially overlap with events specified in the first column. This design identifies unique variance associated with hearing speaker 1 in clear, and the additional effect of masking with competing sounds. Onset Model (Red + Orange), the design matrix models events in the same way as the Epoch Model (Red) with additional events modelling the onset of clear speech and masking (Orange). This allows the identification of unique variance associated with the onset of masking. (C) Behavioural post-scanning accuracy for each condition. Bar graphs of the mean beta value for each condition with within subject error bars representing one standard error (Loftus and Masson 1994).

Masking and intelligibility networks. White – regions responding to clear speech as compared to the resting baseline. Red – regions responding more to clear than to masked speech. Blue – regions responding more to masked than to clear speech. Orange – regions in which activity correlated at a whole brain level with accuracy in comprehension of speech in post-scanning masking tasks (averaged across condition). Bar graphs of the mean beta value for each condition with within subject error bars representing one standard error (Loftus and Masson 1994). Scatter plot of the relationship between neural activity and comprehension of masked speech in post scanning masking tasks.

Regions showing increasing activation in response to masking sounds with increasing informational content. Bar graphs of the mean beta value for each condition with within subject error bars representing one standard error (Loftus and Masson 1994).

(A) Activation overlap map for the different contrasts, including the conjunction of [SP > ROT] ∩ [SP > SMN] in red box with plot of the response. (B) Region of interest analyses comparing the neural response to the intelligibility of the masking stimulus [SP > ROT] for bilateral anterior and posterior STS. Plots show mean beta values for each condition with within subject error bars representing one standard error (Loftus and Masson 1994). (C) Lateralisation curve for [SP > ROT] within temporal cortex.

(A) Average effect of masking onsets – activation associated with the onset of masking sounds in the presence of on-going scanner noise and target speech. Red rendering shows the effect of masking onsets and the time course of the response in selected regions in plots (1) and (2). The blue rendering shows regions responding more to masked epochs, as compared to clear speech, and the time course of the responses in selected regions in plots (3) and (4). (B) Modulation of onset effects by informational content. (C) Clear speech onsets – activation associated with the onset of clear speech in the presence of on-going scanner. (D) Conjunction of clear speech and masking onsets. Bar graphs show the mean beta value for each condition with within subject error bars representing one standard error (Loftus & Masson, 1994), (E) Lateralisation curves for the frontal cortex for (i) Masking onsets (ii) clear speech onsets.
Similar articles
-
Left Superior Temporal Gyrus Is Coupled to Attended Speech in a Cocktail-Party Auditory Scene.
Vander Ghinst M, Bourguignon M, Op de Beeck M, Wens V, Marty B, Hassid S, Choufani G, Jousmäki V, Hari R, Van Bogaert P, Goldman S, De Tiège X. Vander Ghinst M, et al. J Neurosci. 2016 Feb 3;36(5):1596-606. doi: 10.1523/JNEUROSCI.1730-15.2016. J Neurosci. 2016. PMID: 26843641 Free PMC article.
-
Hwang JH, Wu CW, Chen JH, Liu TC. Hwang JH, et al. Acta Otolaryngol. 2006 Sep;126(9):916-20. doi: 10.1080/00016480500546375. Acta Otolaryngol. 2006. PMID: 16864487
-
Neural Tuning to Low-Level Features of Speech throughout the Perisylvian Cortex.
Berezutskaya J, Freudenburg ZV, Güçlü U, van Gerven MAJ, Ramsey NF. Berezutskaya J, et al. J Neurosci. 2017 Aug 16;37(33):7906-7920. doi: 10.1523/JNEUROSCI.0238-17.2017. Epub 2017 Jul 17. J Neurosci. 2017. PMID: 28716965 Free PMC article.
-
The neural processing of masked speech.
Scott SK, McGettigan C. Scott SK, et al. Hear Res. 2013 Sep;303:58-66. doi: 10.1016/j.heares.2013.05.001. Epub 2013 May 16. Hear Res. 2013. PMID: 23685149 Free PMC article. Review.
-
[Auditory perception and language: functional imaging of speech sensitive auditory cortex].
Samson Y, Belin P, Thivard L, Boddaert N, Crozier S, Zilbovicius M. Samson Y, et al. Rev Neurol (Paris). 2001 Sep;157(8-9 Pt 1):837-46. Rev Neurol (Paris). 2001. PMID: 11677406 Review. French.
Cited by
-
Listening under difficult conditions: An activation likelihood estimation meta-analysis.
Alain C, Du Y, Bernstein LJ, Barten T, Banai K. Alain C, et al. Hum Brain Mapp. 2018 Jul;39(7):2695-2709. doi: 10.1002/hbm.24031. Epub 2018 Mar 13. Hum Brain Mapp. 2018. PMID: 29536592 Free PMC article.
-
Alickovic E, Lunner T, Wendt D, Fiedler L, Hietkamp R, Ng EHN, Graversen C. Alickovic E, et al. Front Neurosci. 2020 Sep 10;14:846. doi: 10.3389/fnins.2020.00846. eCollection 2020. Front Neurosci. 2020. PMID: 33071722 Free PMC article.
-
Perron M, Vuong V, Grassi MW, Imran A, Alain C. Perron M, et al. Hum Brain Mapp. 2024 Sep;45(13):e70023. doi: 10.1002/hbm.70023. Hum Brain Mapp. 2024. PMID: 39268584 Free PMC article.
-
A Tutorial on Auditory Attention Identification Methods.
Alickovic E, Lunner T, Gustafsson F, Ljung L. Alickovic E, et al. Front Neurosci. 2019 Mar 19;13:153. doi: 10.3389/fnins.2019.00153. eCollection 2019. Front Neurosci. 2019. PMID: 30941002 Free PMC article.
-
Attentional Modulation of Hierarchical Speech Representations in a Multitalker Environment.
Kiremitçi I, Yilmaz Ö, Çelik E, Shahdloo M, Huth AG, Çukur T. Kiremitçi I, et al. Cereb Cortex. 2021 Oct 1;31(11):4986-5005. doi: 10.1093/cercor/bhab136. Cereb Cortex. 2021. PMID: 34115102 Free PMC article.
References
-
- Adank P. The neural bases of difficult speech comprehension and speech production: Two Activation Likelihood Estimation (ALE) meta-analyses. Brain and Language. 2012;122(1):42–54. Retrieved from http://www.sciencedirect.com/science/article/pii/S0093934X12000867. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources