TY - GEN
T1 - Improving faster-than-real-time human acoustic event detection by saliency-maximized audio visualization
AU - Lin, Kai Hsiang
AU - Zhuang, Xiaodan
AU - Goudeseune, Camille
AU - King, Sarah
AU - Hasegawa-Johnson, Mark
AU - Huang, Thomas S.
PY - 2012
Y1 - 2012
N2 - We propose a saliency-maximized audio spectrogram as a representation that lets human analysts quickly search for and detect events in audio recordings. By rendering target events as visually salient patterns, this representation minimizes the time and effort needed to examine a recording. In particular, we propose a transformation of a conventional spectrogram that maximizes the mutual information between the spectrograms of isolated target events and the estimated saliency of the overall visual representation. When subjects are shown spectrograms that are saliency-maximized, they perform significantly better in a 1/10-real-time acoustic event detection task.
AB - We propose a saliency-maximized audio spectrogram as a representation that lets human analysts quickly search for and detect events in audio recordings. By rendering target events as visually salient patterns, this representation minimizes the time and effort needed to examine a recording. In particular, we propose a transformation of a conventional spectrogram that maximizes the mutual information between the spectrograms of isolated target events and the estimated saliency of the overall visual representation. When subjects are shown spectrograms that are saliency-maximized, they perform significantly better in a 1/10-real-time acoustic event detection task.
KW - acoustic event detection
KW - audio visualization
KW - visual saliency
UR - http://www.scopus.com/inward/record.url?scp=84867596119&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84867596119&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2012.6288368
DO - 10.1109/ICASSP.2012.6288368
M3 - Conference contribution
AN - SCOPUS:84867596119
SN - 9781467300469
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 2277
EP - 2280
BT - 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 - Proceedings
T2 - 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012
Y2 - 25 March 2012 through 30 March 2012
ER -