TY - JOUR
T1 - Recovering sound sources from embedded repetition
AU - McDermott, Josh H.
AU - Wrobleski, David
AU - Oxenham, Andrew J.
PY - 2011/1/18
Y1 - 2011/1/18
N2 - Cocktail parties and other natural auditory environments present organisms with mixtures of sounds. Segregating individual sound sources is thought to require prior knowledge of source properties, yet these presumably cannot be learned unless the sources are segregated first. Here we show that the auditory system can bootstrap its way around this problem by identifying sound sources as repeating patterns embedded in the acoustic input. Due to the presence of competing sounds, source repetition is not explicit in the input to the ear, but it produces temporal regularities that listeners detect and use for segregation. We used a simple generative model to synthesize novel sounds with naturalistic properties. We found that such sounds could be segregated and identified if they occurred more than once across different mixtures, even when the same sounds were impossible to segregate in single mixtures. Sensitivity to the repetition of sound sources can permit their recovery in the absence of other segregation cues or prior knowledge of sounds, and could help solve the cocktail party problem.
AB - Cocktail parties and other natural auditory environments present organisms with mixtures of sounds. Segregating individual sound sources is thought to require prior knowledge of source properties, yet these presumably cannot be learned unless the sources are segregated first. Here we show that the auditory system can bootstrap its way around this problem by identifying sound sources as repeating patterns embedded in the acoustic input. Due to the presence of competing sounds, source repetition is not explicit in the input to the ear, but it produces temporal regularities that listeners detect and use for segregation. We used a simple generative model to synthesize novel sounds with naturalistic properties. We found that such sounds could be segregated and identified if they occurred more than once across different mixtures, even when the same sounds were impossible to segregate in single mixtures. Sensitivity to the repetition of sound sources can permit their recovery in the absence of other segregation cues or prior knowledge of sounds, and could help solve the cocktail party problem.
KW - Auditory scene analysis
KW - Cocktail party problem
KW - Generative models of sound
KW - Natural sound statistics
KW - Sound segregation
UR - http://www.scopus.com/inward/record.url?scp=79551629887&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79551629887&partnerID=8YFLogxK
U2 - 10.1073/pnas.1004765108
DO - 10.1073/pnas.1004765108
M3 - Article
C2 - 21199948
AN - SCOPUS:79551629887
SN - 0027-8424
VL - 108
SP - 1188
EP - 1193
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 3
ER -