If you made any changes in Pure these will be visible here soon.

Research Output

Filter
Conference contribution
2011

Improving acoustic event detection using generalizable visual features and multi-modality modeling

Huang, P. S., Zhuang, X. & Hasegawa-Johnson, M. A., Aug 18 2011, 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 - Proceedings. p. 349-352 4 p. 5946412. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Multi-sensory features for personnel detection at border crossings

Huang, P. S., Damarla, T. & Hasegawa-Johnson, M. A., 2011, Fusion 2011 - 14th International Conference on Information Fusion. 5977673

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2010

Joint estimation of DOA and speech based on EM beamforming

Kim, L. H., Hasegawa-Johnson, M. A., Potamianos, G. & Libal, V., Nov 8 2010, 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings. p. 121-124 4 p. 5496144. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Landmark-based automated pronunciation error detection

Yoon, S. Y., Hasegawa-Johnson, M. A. & Sproat, R., 2010, Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010. p. 614-617 4 p.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Non-frontal view facial expression recognition based on ergodic hidden Markov model supervectors

Tang, H., Hasegawa-Johnson, M. A. & Huang, T. S., Nov 22 2010, 2010 IEEE International Conference on Multimedia and Expo, ICME 2010. p. 1202-1207 6 p. 5582576. (2010 IEEE International Conference on Multimedia and Expo, ICME 2010).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Toward overcoming fundamental limitation in frequency-domain blind source separation for reverberant speech mixtures

Kim, L. H. & Hasegawa-Johnson, M. A., Dec 1 2010, Conference Record of the 44th Asilomar Conference on Signals, Systems and Computers, Asilomar 2010. p. 542-545 4 p. 5757618. (Conference Record - Asilomar Conference on Signals, Systems and Computers).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Toward robust learning of the Gaussian mixture state emission densities for hidden Markov models

Tang, H., Hasegawa-Johnson, M. A. & Huang, T. S., Nov 8 2010, 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010 - Proceedings. p. 5242-5245 4 p. 5494989. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2009

Acoustic fall detection using Gaussian mixture models and GMM supervectors

Zhuang, X., Huang, J., Potamianos, G. & Hasegawa-Johnson, M., Sep 23 2009, 2009 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings, ICASSP 2009. p. 69-72 4 p. 4959522. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Efficient object localization with gaussianized vector representation

Zhuang, X., Zhou, X., Hasegawa-Johnson, M. A. & Huang, T. S., Dec 21 2009, 1st ACM International Workshop on Interactive Multimedia for Consumer Electronics - IMCE'09, Co-located with the 2009 ACM International Conference on Multimedia, MM'09. p. 89-95 7 p. (1st ACM International Workshop on Interactive Multimedia for Consumer Electronics - IMCE'09, Co-located with the 2009 ACM International Conference on Multimedia, MM'09).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Emotion recognition from speech via boosted Gaussian mixture models

Tang, H., Chu, S. M., Hasegawa-Johnson, M. A. & Huang, T. S., Nov 20 2009, Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009. p. 294-297 4 p. 5202493. (Proceedings - 2009 IEEE International Conference on Multimedia and Expo, ICME 2009).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Estimation of high-variance vehicular noise

Lee, B. & Hasegawa-Johnson, M. A., Dec 1 2009, In-Vehicle Corpus and Signal Processing for Driver Behavior. p. 221-232 12 p. (In-Vehicle Corpus and Signal Processing for Driver Behavior).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kernel metric learning for phonetic classification

Huang, J. T., Zhou, X., Hasegawa-Johnson, M. A. & Huang, T. S., Dec 1 2009, Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009. p. 141-145 5 p. 5373389. (Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2008

A novel Gaussianized vector representation for natural scene categorization

Zhou, X., Zhuang, X., Tang, H., Hasegawa-Johnson, M. & Huang, T. S., Dec 1 2008, 2008 19th International Conference on Pattern Recognition, ICPR 2008. 4761665. (Proceedings - International Conference on Pattern Recognition).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Detecting non-modal phonation in telephone speech

Yoon, T. J., Cole, J. & Hasegawa-Johnson, M., Jan 1 2008, Proceedings of the 4th International Conference on Speech Prosody, SP 2008. International Speech Communications Association, p. 33-36 4 p. (Proceedings of the 4th International Conference on Speech Prosody, SP 2008).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

EAVA: A 3D emotive audio-visual avatar

Tang, H., Fu, Y., Tu, J., Huang, T. S. & Hasegawa-Johnson, M., Sep 8 2008, 2008 IEEE Workshop on Applications of Computer Vision, WACV. 4544003. (2008 IEEE Workshop on Applications of Computer Vision, WACV).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Face age estimation using patch-based hidden Markov model supervectors

Zhuang, X., Zhou, X., Hasegawa-Johnson, M. & Huang, T., Dec 1 2008, 2008 19th International Conference on Pattern Recognition, ICPR 2008. 4761364. (Proceedings - International Conference on Pattern Recognition).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Feature analysis and selection for acoustic event detection

Zhuang, X., Zhou, X., Huang, T. S. & Hasegawa-Johnson, M. A., Sep 16 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. p. 17-20 4 p. 4517535. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Optimal speech estimator considering room response as well as additive noise: Different approaches in low and high frequency range

Kim, L. H. & Hasegawa-Johnson, M., Sep 16 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. p. 4573-4576 4 p. 4518674. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Real-time conversion from a single 2D face image to a 3D text-driven emotive audio-visual avatar

Tang, H., Hu, Y., Fu, Y., Hasegawa-Johnson, M. A. & Huang, T. S., 2008, 2008 IEEE International Conference on Multimedia and Expo, ICME 2008 - Proceedings. p. 1205-1208 4 p. 4607657

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Regression from patch-kernel

Yan, S., Zhou, X., MingLiu, Hasegawa-Johnson, M. A. & Huang, T. S., Sep 23 2008, 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR. 4587405. (26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

SIFT-bag kernel for video event analysis

Zhou, X., Zhuang, X., Yan, S., Chang, S. F., Hasegawa-Johnson, M. A. & Huang, T. S., Dec 1 2008, MM'08 - Proceedings of the 2008 ACM International Conference on Multimedia, with co-located Symposium and Workshops. p. 229-238 10 p. (MM'08 - Proceedings of the 2008 ACM International Conference on Multimedia, with co-located Symposium and Workshops).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Stream weight tuning in dynamic Bayesian networks

Kantor, A. & Hasegawa-Johnson, M., Sep 16 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. p. 4525-4528 4 p. 4518662. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Towards interpretation of creakiness in switchboard

Zhuang, X. & Hasegawa-Johnson, M., Jan 1 2008, Proceedings of the 4th International Conference on Speech Prosody, SP 2008. International Speech Communications Association, p. 37-40 4 p. (Proceedings of the 4th International Conference on Speech Prosody, SP 2008).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Unsupervised prosodic break detection in Mandarin speech

Huang, J. T., Hasegawa-Johnson, M. A. & Shih, C., Jan 1 2008, Proceedings of the 4th International Conference on Speech Prosody. International Speech Communications Association, p. 165-168 4 p. (Proceedings of the 4th International Conference on Speech Prosody, SP 2008).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2007

A multi-stream approach to audiovisual automatic speech recognition

Hasegawa-Johnson, M. A., Dec 1 2007, 2007 IEEE 9Th International Workshop on Multimedia Signal Processing, MMSP 2007 - Proceedings. p. 328-331 4 p. 4412884. (2007 IEEE 9Th International Workshop on Multimedia Signal Processing, MMSP 2007 - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Exploring discriminative learning for text-independent speaker recognition

Liu, M., Zhang, Z., Hasegawa-Johnson, M. & Huang, T. S., Jan 1 2007, Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007. IEEE Computer Society, p. 56-59 4 p. 4284585. (Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Frequency domain correspondence for speaker normalization

Liu, M., Zhou, X., Hasegawa-Johnson, M. A., Huang, T. S. & Zhang, Z., Dec 1 2007, International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007. p. 45-48 4 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH; vol. 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Robust analysis and weighting on MFCC components for speech recognition and speaker identification

Zhou, X., Fu, Y., Liu, M., Hasegawa-Johnson, M. & Huang, T. S., Dec 1 2007, Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007. p. 188-191 4 p. 4284618. (Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Room equalization based on acoustic and human perceptual features

Kim, L. H., Hasegawa-Johnson, M., Lim, J. S. & Sung, K. M., Dec 1 2007, Audio Engineering Society - 122nd Audio Engineering Society Convention 2007. p. 1370-1374 5 p. (Audio Engineering Society - 122nd Audio Engineering Society Convention 2007; vol. 3).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2006

Generalized optimal multi-microphone speech enhancement using sequential minimum variance distortionless response (MVDR) beamforming and postfiltering

Kim, L. H., Hasegawa-Johnson, M. A. & Sung, K. M., 2006, 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings. Vol. 3. 1660591

Research output: Chapter in Book/Report/Conference proceedingConference contribution

HMM-based and SVM-based recognition of the speech of talkers with spastic dysarthria

Hasegawa-Johnson, M., Gunderson, J., Perlman, A. & Huang, T., Dec 1 2006, 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings. p. III1060-III1063 1660840. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 3).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lipreading by locality discriminant graph

Fu, Y., Zhou, X., Liu, M., Hasegawa-Johnson, M. A. & Huang, T. S., Dec 1 2006, 2007 IEEE International Conference on Image Processing, ICIP 2007 Proceedings. 4379312. (Proceedings - International Conference on Image Processing, ICIP; vol. 3).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Novel entropy based moving average refiners for HMM landmarks

Chitturi, R. & Hasegawa-Johnson, M. A., Jan 1 2006, INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP. International Speech Communication Association, p. 1682-1685 4 p. (INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP; vol. 4).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Novel time domain multi-class SVMs for landmark detection

Chitturi, R. & Johnson, M. H., Jan 1 2006, INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP. International Speech Communication Association, p. 2354-2357 4 p. (INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP; vol. 5).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2005

Landmark-based speech recognition: Report of the 2004 Johns Hopkins summer workshop

Hasegawa-Johnson, M., Baker, J., Borys, S., Chen, K., Coogan, E., Greenberg, S., Juneja, A., Kirchhoff, K., Livescu, K., Mohan, S., Muller, J., Sonmez, K. & Wang, T., Jan 1 2005, 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing. Institute of Electrical and Electronics Engineers Inc., p. I213-I216 1415088. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. I).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2003

Bayesian learning for models of human speech perception

Hasegawa-Johnson, M., Jan 1 2003, Proceedings of the 2003 IEEE Workshop on Statistical Signal Processing, SSP 2003. IEEE Computer Society, p. 408-411 4 p. 1289432. (IEEE Workshop on Statistical Signal Processing Proceedings; vol. 2003-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Improving the robustness of prosody dependent language modeling based on prosody syntax dependence

Chen, K. & Hasegawa-Johnson, M., Jan 1 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003. Institute of Electrical and Electronics Engineers Inc., p. 435-440 6 p. 1318480. (2003 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2003).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Particle filtering approach to Bayesian formant tracking

Zheng, Y. & Hasegawa-Johnson, M., Jan 1 2003, Proceedings of the 2003 IEEE Workshop on Statistical Signal Processing, SSP 2003. IEEE Computer Society, p. 601-604 4 p. 1289549. (IEEE Workshop on Statistical Signal Processing Proceedings; vol. 2003-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Strong-sense class-dependent features for statistical recognition

Omar, M. K. & Hasegawa-Johnson, M., Jan 1 2003, Proceedings of the 2003 IEEE Workshop on Statistical Signal Processing, SSP 2003. IEEE Computer Society, p. 490-493 4 p. 1289454. (IEEE Workshop on Statistical Signal Processing Proceedings; vol. 2003-January).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2001

Gaussian mixture models of phonetic boundaries for speech recognition

Omar, M. K., Hasegawa-Johnson, M. & Levinson, S., Jan 1 2001, 2001 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2001 - Conference Proceedings. Institute of Electrical and Electronics Engineers Inc., p. 33-36 4 p. 1034582. (2001 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2001 - Conference Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2000

Multivariate-state hidden Markov models for simultaneous transcription of phones and formants

Hasegawa-Johnson, M., Jan 1 2000, Speech Processing II. Institute of Electrical and Electronics Engineers Inc., p. 1323-1326 4 p. 861822. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 3).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Signal approximation in Hilbert space and its application on articulatory speech synthesis

Huang, J., Levinson, S. & Hasegawa-Johnson, M., Jan 1 2000, 6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association, (6th International Conference on Spoken Language Processing, ICSLP 2000).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Time-frequency distribution of partial phonetic information measured using mutual information

Hasegawa-Johnson, M. A., 2000, 6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1999

CTMRedit: A Matlab-based tool for segmenting and interpolating MRI and CT images in three orthogonal planes

Hasegawa-Johnson, M., Cha, J. S. & Haker, K., Dec 1 1999, Annual International Conference of the IEEE Engineering in Medicine and Biology - Proceedings. IEEE, 1 p. (Annual International Conference of the IEEE Engineering in Medicine and Biology - Proceedings; vol. 2).

Research output: Chapter in Book/Report/Conference proceedingConference contribution