SOLAR: Sound object localization and retrieval in complex audio environments

Derek Hoiem, Yan Ke, Rahul Sukthankar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The ability to identify sounds in complex audio environments is highly useful for multimedia retrieval, security, and many mobile robotic applications, but very little work has been done in this area. We present the SOLAR system, a system capable of finding sound objects, such as dog barks or car horns, in complex audio data extracted from movies. SOLAR avoids the need for segmentation by scanning over the audio data in fixed increments and classifying each short audio window separately. SOLAR employs boosted decision tree classifiers to select suitable features for modeling each sound object and to discriminate between the object of interest and all other sounds. We demonstrate the effectiveness of our approach with experiments on thirteen sound object classes trained using only tens of positive examples and tested on hours of audio data extracted from popular movies.

Original languageEnglish (US)
Title of host publication2005 IEEE ICASSP '05 - Proc. - Design and Implementation of Signal Proces.Syst.,Indust. Technol. Track,Machine Learning for Signal Proces. Education, Spec. Sessions
PublisherInstitute of Electrical and Electronics Engineers Inc.
PagesV429-V432
ISBN (Print)0780388747, 9780780388741
DOIs
StatePublished - Jan 1 2005
Externally publishedYes
Event2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Philadelphia, PA, United States
Duration: Mar 18 2005Mar 23 2005

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
VolumeV
ISSN (Print)1520-6149

Other

Other2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05
CountryUnited States
CityPhiladelphia, PA
Period3/18/053/23/05

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'SOLAR: Sound object localization and retrieval in complex audio environments'. Together they form a unique fingerprint.

  • Cite this

    Hoiem, D., Ke, Y., & Sukthankar, R. (2005). SOLAR: Sound object localization and retrieval in complex audio environments. In 2005 IEEE ICASSP '05 - Proc. - Design and Implementation of Signal Proces.Syst.,Indust. Technol. Track,Machine Learning for Signal Proces. Education, Spec. Sessions (pp. V429-V432). [1416332] (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. V). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICASSP.2005.1416332