Sound texture synthesis via filter statistics

Josh H. McDermott, Andrew J Oxenham, Eero P. Simoncelli

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    33 Scopus citations

    Abstract

    Many natural sounds, such as those produced by rainstorms, fires, or insects at night, consist of large numbers of rapidly occurring acoustic events. We hypothesize that humans encode these "sound textures" with statistical measurements that capture their constituent features and the relationship between them. We explored this hypothesis using a synthesis algorithm that measures statistics in a real sound and imposes them on a sample of noise. Simply matching the marginal statistics (variance, kurtosis) of individual frequency subbands was generally necessary, but insufficient, to yield good results. Imposing various pairwise envelope statistics (correlations between bands, and autocorrelations within each band) greatly improved the results, frequently producing synthetic textures that sounded natural and that listeners could reliably recognize. The results suggest that such statistical representations could underlie sound texture perception, and that the auditory system may use fairly simple statistics to recognize many natural sound textures.

    Original languageEnglish (US)
    Title of host publication2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009
    Pages297-300
    Number of pages4
    DOIs
    StatePublished - Dec 1 2009
    Event2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009 - New Paltz, NY, United States
    Duration: Oct 18 2009Oct 21 2009

    Publication series

    NameIEEE Workshop on Applications of Signal Processing to Audio and Acoustics

    Other

    Other2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009
    CountryUnited States
    CityNew Paltz, NY
    Period10/18/0910/21/09

    Keywords

    • Correlations
    • Envelope
    • Statistics
    • Synthesis
    • Texture

    Fingerprint Dive into the research topics of 'Sound texture synthesis via filter statistics'. Together they form a unique fingerprint.

  • Cite this

    McDermott, J. H., Oxenham, A. J., & Simoncelli, E. P. (2009). Sound texture synthesis via filter statistics. In 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2009 (pp. 297-300). [5346467] (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics). https://doi.org/10.1109/ASPAA.2009.5346467