Modeling video using input/output markov models with application to multi-modal event detection

Ashutosh Garg, Milind R. Naphade, Thomas S. Huang

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Generation and dissemination of digital media content poses a challenging problem of efficient storage and retrieval. Of particular interest to us are audio and visual content. From sharing of picture albums and home videos to movie advertisement through interactive preview clips, live broadcasts of various shows or multimedia reports of news as it happens, multimedia information has found in the internet and the television powerful media to reach us. With innovations in hand-held and portable computing devices and wired and wireless communication technology (pocket PCs, organizers, cell-phones) on one end and broadband internet devices on the other, supply and dissemination of unclassified multimedia is overwhelming. Humans assimilate content at a semantic level and apply their knowledge to the task of sifting through large volumes of multimodal data. To invent tools that can gain widespread popularity we must try to emulate human assimilation of this content. We are thus faced with the problem of multimedia understanding if we are to bridge the gap between media features and semantics.

Original languageEnglish (US)
Title of host publicationHandbook of Video Databases
Subtitle of host publicationDesign and Applications
PublisherCRC Press
Pages23-44
Number of pages22
ISBN (Electronic)9780203489864
ISBN (Print)9780849370069
StatePublished - Jan 1 2003

Fingerprint

Semantics
Internet
Digital storage
Television
Innovation
Communication

ASJC Scopus subject areas

  • Computer Science(all)
  • Engineering(all)

Cite this

Garg, A., Naphade, M. R., & Huang, T. S. (2003). Modeling video using input/output markov models with application to multi-modal event detection. In Handbook of Video Databases: Design and Applications (pp. 23-44). CRC Press.

Modeling video using input/output markov models with application to multi-modal event detection. / Garg, Ashutosh; Naphade, Milind R.; Huang, Thomas S.

Handbook of Video Databases: Design and Applications. CRC Press, 2003. p. 23-44.

Research output: Chapter in Book/Report/Conference proceedingChapter

Garg, A, Naphade, MR & Huang, TS 2003, Modeling video using input/output markov models with application to multi-modal event detection. in Handbook of Video Databases: Design and Applications. CRC Press, pp. 23-44.
Garg A, Naphade MR, Huang TS. Modeling video using input/output markov models with application to multi-modal event detection. In Handbook of Video Databases: Design and Applications. CRC Press. 2003. p. 23-44
Garg, Ashutosh ; Naphade, Milind R. ; Huang, Thomas S. / Modeling video using input/output markov models with application to multi-modal event detection. Handbook of Video Databases: Design and Applications. CRC Press, 2003. pp. 23-44
@inbook{643521d8d1ee46f09adb28bae61ed9a6,
title = "Modeling video using input/output markov models with application to multi-modal event detection",
abstract = "Generation and dissemination of digital media content poses a challenging problem of efficient storage and retrieval. Of particular interest to us are audio and visual content. From sharing of picture albums and home videos to movie advertisement through interactive preview clips, live broadcasts of various shows or multimedia reports of news as it happens, multimedia information has found in the internet and the television powerful media to reach us. With innovations in hand-held and portable computing devices and wired and wireless communication technology (pocket PCs, organizers, cell-phones) on one end and broadband internet devices on the other, supply and dissemination of unclassified multimedia is overwhelming. Humans assimilate content at a semantic level and apply their knowledge to the task of sifting through large volumes of multimodal data. To invent tools that can gain widespread popularity we must try to emulate human assimilation of this content. We are thus faced with the problem of multimedia understanding if we are to bridge the gap between media features and semantics.",
author = "Ashutosh Garg and Naphade, {Milind R.} and Huang, {Thomas S.}",
year = "2003",
month = "1",
day = "1",
language = "English (US)",
isbn = "9780849370069",
pages = "23--44",
booktitle = "Handbook of Video Databases",
publisher = "CRC Press",

}

TY - CHAP

T1 - Modeling video using input/output markov models with application to multi-modal event detection

AU - Garg, Ashutosh

AU - Naphade, Milind R.

AU - Huang, Thomas S.

PY - 2003/1/1

Y1 - 2003/1/1

N2 - Generation and dissemination of digital media content poses a challenging problem of efficient storage and retrieval. Of particular interest to us are audio and visual content. From sharing of picture albums and home videos to movie advertisement through interactive preview clips, live broadcasts of various shows or multimedia reports of news as it happens, multimedia information has found in the internet and the television powerful media to reach us. With innovations in hand-held and portable computing devices and wired and wireless communication technology (pocket PCs, organizers, cell-phones) on one end and broadband internet devices on the other, supply and dissemination of unclassified multimedia is overwhelming. Humans assimilate content at a semantic level and apply their knowledge to the task of sifting through large volumes of multimodal data. To invent tools that can gain widespread popularity we must try to emulate human assimilation of this content. We are thus faced with the problem of multimedia understanding if we are to bridge the gap between media features and semantics.

AB - Generation and dissemination of digital media content poses a challenging problem of efficient storage and retrieval. Of particular interest to us are audio and visual content. From sharing of picture albums and home videos to movie advertisement through interactive preview clips, live broadcasts of various shows or multimedia reports of news as it happens, multimedia information has found in the internet and the television powerful media to reach us. With innovations in hand-held and portable computing devices and wired and wireless communication technology (pocket PCs, organizers, cell-phones) on one end and broadband internet devices on the other, supply and dissemination of unclassified multimedia is overwhelming. Humans assimilate content at a semantic level and apply their knowledge to the task of sifting through large volumes of multimodal data. To invent tools that can gain widespread popularity we must try to emulate human assimilation of this content. We are thus faced with the problem of multimedia understanding if we are to bridge the gap between media features and semantics.

UR - http://www.scopus.com/inward/record.url?scp=33646687616&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33646687616&partnerID=8YFLogxK

M3 - Chapter

AN - SCOPUS:33646687616

SN - 9780849370069

SP - 23

EP - 44

BT - Handbook of Video Databases

PB - CRC Press

ER -