A cocktail party with a cortical twist: how cortical mechanisms contribute to sound segregation
- PMID: 19206802
- PMCID: PMC2676630
- DOI: 10.1121/1.3001672
A cocktail party with a cortical twist: how cortical mechanisms contribute to sound segregation
Abstract
Sound systems and speech technologies can benefit greatly from a deeper understanding of how the auditory system, and particularly the auditory cortex, is able to parse complex acoustic scenes into meaningful auditory objects and streams under adverse conditions. In the current work, a biologically plausible model of this process is presented, where the role of cortical mechanisms in organizing complex auditory scenes is explored. The model consists of two stages: (i) a feature analysis stage that maps the acoustic input into a multidimensional cortical representation and (ii) an integrative stage that recursively builds up expectations of how streams evolve over time and reconciles its predictions with the incoming sensory input by sorting it into different clusters. This approach yields a robust computational scheme for speaker separation under conditions of speech or music interference. The model can also emulate the archetypal streaming percepts of tonal stimuli that have long been tested in human subjects. The implications of this model are discussed with respect to the physiological correlates of streaming in the cortex as well as the role of attention and other top-down influences in guiding sound organization.
Figures
Similar articles
-
Cortical Representations of Speech in a Multitalker Auditory Scene.J Neurosci. 2017 Sep 20;37(38):9189-9196. doi: 10.1523/JNEUROSCI.0938-17.2017. Epub 2017 Aug 18. J Neurosci. 2017. PMID: 28821680 Free PMC article.
-
ARTSTREAM: a neural network model of auditory scene analysis and source segregation.Neural Netw. 2004 May;17(4):511-36. doi: 10.1016/j.neunet.2003.10.002. Neural Netw. 2004. PMID: 15109681
-
Temporal coherence and the streaming of complex sounds.Adv Exp Med Biol. 2013;787:535-43. doi: 10.1007/978-1-4614-1590-9_59. Adv Exp Med Biol. 2013. PMID: 23716261 Free PMC article.
-
Pitch, harmonicity and concurrent sound segregation: psychoacoustical and neurophysiological findings.Hear Res. 2010 Jul;266(1-2):36-51. doi: 10.1016/j.heares.2009.09.012. Epub 2009 Sep 27. Hear Res. 2010. PMID: 19788920 Free PMC article. Review.
-
Functional imaging of auditory scene analysis.Hear Res. 2014 Jan;307:98-110. doi: 10.1016/j.heares.2013.08.003. Epub 2013 Aug 19. Hear Res. 2014. PMID: 23968821 Review.
Cited by
-
Preliminary Evidence for Global Properties in Human Listeners During Natural Auditory Scene Perception.Open Mind (Camb). 2024 Mar 26;8:333-365. doi: 10.1162/opmi_a_00131. eCollection 2024. Open Mind (Camb). 2024. PMID: 38571530 Free PMC article.
-
Increased reliance on temporal coding when target sound is softer than the background.Sci Rep. 2024 Feb 23;14(1):4457. doi: 10.1038/s41598-024-54865-5. Sci Rep. 2024. PMID: 38396044 Free PMC article.
-
Effect of Reverberation on Neural Responses to Natural Speech in Rabbit Auditory Midbrain: No Evidence for a Neural Dereverberation Mechanism.eNeuro. 2023 May 10;10(5):ENEURO.0447-22.2023. doi: 10.1523/ENEURO.0447-22.2023. Print 2023 May. eNeuro. 2023. PMID: 37072174 Free PMC article.
-
Modeling the Repetition-Based Recovering of Acoustic and Visual Sources With Dendritic Neurons.Front Neurosci. 2022 Apr 28;16:855753. doi: 10.3389/fnins.2022.855753. eCollection 2022. Front Neurosci. 2022. PMID: 35573290 Free PMC article.
-
Making sense of periodicity glimpses in a prediction-update-loop-A computational model of attentive voice tracking.J Acoust Soc Am. 2022 Feb;151(2):712. doi: 10.1121/10.0009337. J Acoust Soc Am. 2022. PMID: 35232067 Free PMC article.
References
-
- Anstis, S., and Saida, S. (1985). “Adaptation to auditory streaming of frequency-modulated tones,” J. Exp. Psychol. Hum. Percept. Perform. JPHPDH10.1037//0096-1523.11.3.257 11, 257–271. - DOI
-
- Aubin, T., and Jouventin, P. (1998). “Cocktail-part effect in king penguin colonies,” Proc. R. Soc. London, Ser. B PRLBA410.1098/rspb.1998.0486 265, 1665–1673. - DOI
-
- Barlow, H. (1994). Large-Scale Neuronal Theories of the Brain (MIT Press, Cambridge, MA: ), pp. 1–22.
-
- Bay, J. S. (1999). Fundamentals of Linear State Space Systems (McGraw-Hill, Boston: ).
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources