TY - GEN
T1 - Feature analysis and selection for acoustic event detection
AU - Zhuang, Xiaodan
AU - Zhou, Xi
AU - Huang, Thomas S.
AU - Hasegawa-Johnson, Mark
PY - 2008
Y1 - 2008
N2 - Speech perceptual features, such as Mel-frequency Cepstral Coefficients (MFCC), have been widely used in acoustic event detection. However, the different spectral structures between speech and acoustic events degrade the performance of the speech feature sets. We propose quantifying the discriminative capability of each feature component according to the approximated Bayesian accuracy and deriving a discriminative feature set for acoustic event detection. Compared to MFCC, feature sets derived using the proposed approaches achieve about 30% relative accuracy improvement in acoustic event detection.
AB - Speech perceptual features, such as Mel-frequency Cepstral Coefficients (MFCC), have been widely used in acoustic event detection. However, the different spectral structures between speech and acoustic events degrade the performance of the speech feature sets. We propose quantifying the discriminative capability of each feature component according to the approximated Bayesian accuracy and deriving a discriminative feature set for acoustic event detection. Compared to MFCC, feature sets derived using the proposed approaches achieve about 30% relative accuracy improvement in acoustic event detection.
KW - Acoustic event detection
KW - Bayesian Accuracy
KW - Feature Selection
KW - Hidden Markov Models
UR - http://www.scopus.com/inward/record.url?scp=51449101221&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51449101221&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2008.4517535
DO - 10.1109/ICASSP.2008.4517535
M3 - Conference contribution
AN - SCOPUS:51449101221
SN - 1424414849
SN - 9781424414840
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 17
EP - 20
BT - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
T2 - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
Y2 - 31 March 2008 through 4 April 2008
ER -