Multichannel and multimodality person identification

Ming Liu, Yanxiang Chen, Xi Zhou, Xiaodan Zhuang, Mark Allan Hasegawa-Johnson, Thomas S Huang

Research output: Contribution to journalConference article

Abstract

Person's identity is a very important high level information for video analysis and retrieval. Along the growth of multimedia data, the recording is not only multimodality and also multichannel(microphone array, camera array). In this paper, we describe a multimodal person identification system of UIUC team for CLEAR 2007 evaluation. The audio only system is based on a new proposed model - Chain of Gaussian Mixtures. The visual only system is a face recognition module based on nearest neighbor classifier at appearance space. Final system fuses 7 channel microphone recordings and 4 camera recordings at decision level. The experimental results indicate the effectiviness of speaker modeling methods and the fusion scheme.

Original languageEnglish (US)
Pages (from-to)248-255
Number of pages8
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4625 LNCS
DOIs
StatePublished - Jul 28 2008
Event2nd Annual Classifcation of Events Activities and Relationships, CLEAR 2007 and Rich Transcription, RT 2007 - Baltimore, MD, United States
Duration: May 8 2007May 11 2007

Fingerprint

Multimodality
Microphones
Person
Cameras
Electric fuses
Face recognition
Camera
Microphone Array
Video Retrieval
Video Analysis
Identification (control systems)
Classifiers
Gaussian Mixture
Fusion reactions
System Identification
Face Recognition
Modeling Method
Multimedia
Nearest Neighbor
Fusion

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Multichannel and multimodality person identification. / Liu, Ming; Chen, Yanxiang; Zhou, Xi; Zhuang, Xiaodan; Hasegawa-Johnson, Mark Allan; Huang, Thomas S.

In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 4625 LNCS, 28.07.2008, p. 248-255.

Research output: Contribution to journalConference article

@article{dc656b32529e48aa9ea77ef467569a23,
title = "Multichannel and multimodality person identification",
abstract = "Person's identity is a very important high level information for video analysis and retrieval. Along the growth of multimedia data, the recording is not only multimodality and also multichannel(microphone array, camera array). In this paper, we describe a multimodal person identification system of UIUC team for CLEAR 2007 evaluation. The audio only system is based on a new proposed model - Chain of Gaussian Mixtures. The visual only system is a face recognition module based on nearest neighbor classifier at appearance space. Final system fuses 7 channel microphone recordings and 4 camera recordings at decision level. The experimental results indicate the effectiviness of speaker modeling methods and the fusion scheme.",
author = "Ming Liu and Yanxiang Chen and Xi Zhou and Xiaodan Zhuang and Hasegawa-Johnson, {Mark Allan} and Huang, {Thomas S}",
year = "2008",
month = "7",
day = "28",
doi = "10.1007/978-3-540-68585-2_23",
language = "English (US)",
volume = "4625 LNCS",
pages = "248--255",
journal = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
issn = "0302-9743",
publisher = "Springer Verlag",

}

TY - JOUR

T1 - Multichannel and multimodality person identification

AU - Liu, Ming

AU - Chen, Yanxiang

AU - Zhou, Xi

AU - Zhuang, Xiaodan

AU - Hasegawa-Johnson, Mark Allan

AU - Huang, Thomas S

PY - 2008/7/28

Y1 - 2008/7/28

N2 - Person's identity is a very important high level information for video analysis and retrieval. Along the growth of multimedia data, the recording is not only multimodality and also multichannel(microphone array, camera array). In this paper, we describe a multimodal person identification system of UIUC team for CLEAR 2007 evaluation. The audio only system is based on a new proposed model - Chain of Gaussian Mixtures. The visual only system is a face recognition module based on nearest neighbor classifier at appearance space. Final system fuses 7 channel microphone recordings and 4 camera recordings at decision level. The experimental results indicate the effectiviness of speaker modeling methods and the fusion scheme.

AB - Person's identity is a very important high level information for video analysis and retrieval. Along the growth of multimedia data, the recording is not only multimodality and also multichannel(microphone array, camera array). In this paper, we describe a multimodal person identification system of UIUC team for CLEAR 2007 evaluation. The audio only system is based on a new proposed model - Chain of Gaussian Mixtures. The visual only system is a face recognition module based on nearest neighbor classifier at appearance space. Final system fuses 7 channel microphone recordings and 4 camera recordings at decision level. The experimental results indicate the effectiviness of speaker modeling methods and the fusion scheme.

UR - http://www.scopus.com/inward/record.url?scp=47749156390&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=47749156390&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-68585-2_23

DO - 10.1007/978-3-540-68585-2_23

M3 - Conference article

VL - 4625 LNCS

SP - 248

EP - 255

JO - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

JF - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SN - 0302-9743

ER -