If you made any changes in Pure these will be visible here soon.

Research Output

2008

HMM-based acoustic event detection with adaboost feature selection

Zhou, X., Zhuang, X., Liu, M., Tang, H., Hasegawa-Johnson, M. & Huang, T., Jul 28 2008, In : Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4625 LNCS, p. 345-353 9 p.

Research output: Contribution to journalConference article

Humanoid audio-visual avatar with emotive text-to-speech synthesis

Tang, H., Fu, Y., Tu, J., Hasegawa-Johnson, M. & Huang, T. S., Oct 1 2008, In : IEEE Transactions on Multimedia. 10, 6, p. 969-981 13 p., 4637888.

Research output: Contribution to journalArticle

Multichannel and multimodality person identification

Liu, M., Chen, Y., Zhou, X., Zhuang, X., Hasegawa-Johnson, M. & Huang, T., Jul 28 2008, In : Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4625 LNCS, p. 248-255 8 p.

Research output: Contribution to journalConference article

Optimal speech estimator considering room response as well as additive noise: Different approaches in low and high frequency range

Kim, L. H. & Hasegawa-Johnson, M., Sep 16 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. p. 4573-4576 4 p. 4518674. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Real-time conversion from a single 2D face image to a 3D text-driven emotive audio-visual avatar

Tang, H., Hu, Y., Fu, Y., Hasegawa-Johnson, M. A. & Huang, T. S., 2008, 2008 IEEE International Conference on Multimedia and Expo, ICME 2008 - Proceedings. p. 1205-1208 4 p. 4607657

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Regression from patch-kernel

Yan, S., Zhou, X., MingLiu, Hasegawa-Johnson, M. A. & Huang, T. S., Sep 23 2008, 26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR. 4587405. (26th IEEE Conference on Computer Vision and Pattern Recognition, CVPR).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

SIFT-bag kernel for video event analysis

Zhou, X., Zhuang, X., Yan, S., Chang, S. F., Hasegawa-Johnson, M. A. & Huang, T. S., Dec 1 2008, MM'08 - Proceedings of the 2008 ACM International Conference on Multimedia, with co-located Symposium and Workshops. p. 229-238 10 p. (MM'08 - Proceedings of the 2008 ACM International Conference on Multimedia, with co-located Symposium and Workshops).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Stream weight tuning in dynamic Bayesian networks

Kantor, A. & Hasegawa-Johnson, M., Sep 16 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP. p. 4525-4528 4 p. 4518662. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

The entropy of the articulatory phonological code: Recognizing gestures from tract variables

Zhuang, X., Nam, H., Hasegawa-Johnson, M., Goldstein, L. & Saltzman, E., Dec 1 2008, In : Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. p. 1489-1492 4 p.

Research output: Contribution to journalConference article

Towards interpretation of creakiness in switchboard

Zhuang, X. & Hasegawa-Johnson, M., Jan 1 2008, Proceedings of the 4th International Conference on Speech Prosody, SP 2008. International Speech Communications Association, p. 37-40 4 p. (Proceedings of the 4th International Conference on Speech Prosody, SP 2008).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Two-stage prosody prediction for emotional text-to-speech synthesis

Tang, H., Zhou, X., Odisio, M., Hasegawa-Johnson, M. & Huang, T. S., Dec 1 2008, In : Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. p. 2138-2141 4 p.

Research output: Contribution to journalConference article

Unsupervised prosodic break detection in Mandarin speech

Huang, J. T., Hasegawa-Johnson, M. A. & Shih, C., Jan 1 2008, Proceedings of the 4th International Conference on Speech Prosody. International Speech Communications Association, p. 165-168 4 p. (Proceedings of the 4th International Conference on Speech Prosody, SP 2008).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2007

A multi-stream approach to audiovisual automatic speech recognition

Hasegawa-Johnson, M. A., Dec 1 2007, 2007 IEEE 9Th International Workshop on Multimedia Signal Processing, MMSP 2007 - Proceedings. p. 328-331 4 p. 4412884. (2007 IEEE 9Th International Workshop on Multimedia Signal Processing, MMSP 2007 - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Exploring discriminative learning for text-independent speaker recognition

Liu, M., Zhang, Z., Hasegawa-Johnson, M. & Huang, T. S., Jan 1 2007, Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007. IEEE Computer Society, p. 56-59 4 p. 4284585. (Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Frequency domain correspondence for speaker normalization

Liu, M., Zhou, X., Hasegawa-Johnson, M. A., Huang, T. S. & Zhang, Z., Dec 1 2007, International Speech Communication Association - 8th Annual Conference of the International Speech Communication Association, Interspeech 2007. p. 45-48 4 p. (Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH; vol. 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Prosodic effects on acoustic cues to stop voicing and place of articulation: Evidence from Radio News speech

Cole, J., Kim, H., Choi, H. & Hasegawa-Johnson, M., Apr 1 2007, In : Journal of Phonetics. 35, 2, p. 180-209 30 p.

Research output: Contribution to journalArticle

Robust analysis and weighting on MFCC components for speech recognition and speaker identification

Zhou, X., Fu, Y., Liu, M., Hasegawa-Johnson, M. & Huang, T. S., Dec 1 2007, Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007. p. 188-191 4 p. 4284618. (Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Room equalization based on acoustic and human perceptual features

Kim, L. H., Hasegawa-Johnson, M., Lim, J. S. & Sung, K. M., Dec 1 2007, Audio Engineering Society - 122nd Audio Engineering Society Convention 2007. p. 1370-1374 5 p. (Audio Engineering Society - 122nd Audio Engineering Society Convention 2007; vol. 3).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2006

Cognitive state classification in a spoken tutorial dialogue system

Zhang, T., Hasegawa-Johnson, M. & Levinson, S. E., Jun 1 2006, In : Speech Communication. 48, 6, p. 616-632 17 p.

Research output: Contribution to journalArticle

Extraction of pragmatic and semantic salience from spontaneous spoken English

Zhang, T., Hasegawa-Johnson, M. & Levinson, S. E., Mar 1 2006, In : Speech Communication. 48, 3-4, p. 437-462 26 p.

Research output: Contribution to journalArticle

Generalized optimal multi-microphone speech enhancement using sequential minimum variance distortionless response (MVDR) beamforming and postfiltering

Kim, L. H., Hasegawa-Johnson, M. A. & Sung, K. M., 2006, 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings. Vol. 3. 1660591

Research output: Chapter in Book/Report/Conference proceedingConference contribution

HMM-based and SVM-based recognition of the speech of talkers with spastic dysarthria

Hasegawa-Johnson, M., Gunderson, J., Perlman, A. & Huang, T., Dec 1 2006, 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings. p. III1060-III1063 1660840. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. 3).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lipreading by locality discriminant graph

Fu, Y., Zhou, X., Liu, M., Hasegawa-Johnson, M. A. & Huang, T. S., Dec 1 2006, 2007 IEEE International Conference on Image Processing, ICIP 2007 Proceedings. 4379312. (Proceedings - International Conference on Image Processing, ICIP; vol. 3).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Novel entropy based moving average refiners for HMM landmarks

Chitturi, R. & Hasegawa-Johnson, M. A., Jan 1 2006, INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP. International Speech Communication Association, p. 1682-1685 4 p. (INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP; vol. 4).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Novel time domain multi-class SVMs for landmark detection

Chitturi, R. & Johnson, M. H., Jan 1 2006, INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP. International Speech Communication Association, p. 2354-2357 4 p. (INTERSPEECH 2006 and 9th International Conference on Spoken Language Processing, INTERSPEECH 2006 - ICSLP; vol. 5).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Prosody dependent speech recognition on radio news corpus of American English

Chen, K., Hasegawa-Johnson, M., Cohen, A., Borys, S., Kim, S. S., Cole, J. & Choi, J. Y., Jan 1 2006, In : IEEE Transactions on Audio, Speech and Language Processing. 14, 1, p. 232-244 13 p.

Research output: Contribution to journalArticle

2005

Distinctive feature based SVM discriminant features for improvements to phone recognition on telephone band speech

Borys, S. & Hasegawa-Johnson, M., Dec 1 2005, p. 697-700. 4 p.

Research output: Contribution to conferencePaper

Finding intonational boundaries using acoustic cues related to the voice source

Choi, J. Y., Hasegawa-Johnson, M. & Cole, J., Oct 1 2005, In : Journal of the Acoustical Society of America. 118, 4, p. 2579-2587 9 p.

Research output: Contribution to journalArticle

Landmark-based speech recognition: Report of the 2004 Johns Hopkins summer workshop

Hasegawa-Johnson, M., Baker, J., Borys, S., Chen, K., Coogan, E., Greenberg, S., Juneja, A., Kirchhoff, K., Livescu, K., Mohan, S., Muller, J., Sonmez, K. & Wang, T., Jan 1 2005, 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing. Institute of Electrical and Electronics Engineers Inc., p. I213-I216 1415088. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; vol. I).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus

Hasegawa-Johnson, M. A., Chen, K., Cole, J. S., Borys, S., Kim, S. S., Cohen, A., Zhang, T., Choi, J. Y., Kim, H., Yoon, T. & Chavarria, S., Jul 1 2005, In : Speech Communication. 46, 3-4, p. 418-439 22 p.

Research output: Contribution to journalArticle

2004

A factorial HMM approach to robust isolated digit recognition in background music

Deoras, A. N. & Hasegawa-Johnson, M., Jan 1 2004, p. 2093-2096. 4 p.

Research output: Contribution to conferencePaper

Automatic detection of contrast for speech understanding

Zhang, T., Hasegawa-Johnson, M. A. & Levinson, S. E., Jan 1 2004, p. 581-584. 4 p.

Research output: Contribution to conferencePaper

Automatic recognition of pitch movements using multilayer perception and time-delay recursive neural network

Kim, S. S., Hasegawa-Johnson, M. & Chen, K., Jul 1 2004, In : IEEE Signal Processing Letters. 11, 7, p. 645-648 4 p.

Research output: Contribution to journalArticle

AVICAR: Audio-Visual Speech Corpus in a Car Environment

Lee, B., Hasegawa-Johnson, M. A., Goudeseune, C., Kamdar, S., Borys, S., Liu, M. & Huang, T. S., Jan 1 2004, p. 2489-2492. 4 p.

Research output: Contribution to conferencePaper

Children's emotion recognition in an intelligent tutoring scenario

Zhang, T., Hasegawa-Johnson, M. & Levinson, S. E., Jan 1 2004, p. 1441-1444. 4 p.

Research output: Contribution to conferencePaper

Intertranscriber reliability of prosodic labeling on telephone conversation using ToBI

Yoon, T. J., Chavarría, S., Cole, J. & Hasegawa-Johnson, M., Jan 1 2004, p. 2729-2732. 4 p.

Research output: Contribution to conferencePaper

Model Enforcement: A Unified Feature Transformation Framework for Classification and Recognition

Omar, M. K. & Hasegawa-Johnson, M., Oct 2004, In : IEEE Transactions on Signal Processing. 52, 10, p. 2701-2710 10 p.

Research output: Contribution to journalArticle

Modeling and recognition of phonetic and prosodic factors for improvements to acoustic speech recognition models

Borys, S., Hasegawa-Johnson, M., Cole, J. & Cohen, A., Jan 1 2004, p. 3013-3016. 4 p.

Research output: Contribution to conferencePaper

Modeling pronunciation variation using artificial neural networks for English spontaneous speech

Chen, K. & Hasegawa-Johnson, M. A., Jan 1 2004, p. 1461-1464. 4 p.

Research output: Contribution to conferencePaper

Semantic analysis for a speech user interface in an intelligent tutoring system

Ren, Y., Hasegawa-Johnson, M. A. & Levinson, S. E., Dec 1 2004, p. 313-315. 3 p.

Research output: Contribution to conferencePaper

Source separation using particle filters

Gandhi, M. A. & Hasegawa-Johnson, M. A., Jan 1 2004, p. 2673-2676. 4 p.

Research output: Contribution to conferencePaper

Stop consonant classification by dynamic formant trajectory

Zheng, Y., Hasegawa-Johnson, M. & Borys, S., Jan 1 2004, p. 2481-2484. 4 p.

Research output: Contribution to conferencePaper

2003

Analysis of the three-dimensional tongue shape using a three-index factor analysis model

Zheng, Y., Hasegawa-Johnson, M. & Pizza, S., Jan 1 2003, In : Journal of the Acoustical Society of America. 113, 1, p. 478-486 9 p.

Research output: Contribution to journalArticle

Approximately Independent Factors of Speech Using Nonlinear Symplectic Transformation

Omar, M. K. & Hasegawa-Johnson, M., Nov 1 2003, In : IEEE Transactions on Speech and Audio Processing. 11, 6, p. 660-671 12 p.

Research output: Contribution to journalArticle