A Unified Framework for Video Summarization, Browsing, and Retrieval

Ziyou Xiong, Yong Rui, Regunathan Radhakrishnan, Ajay Divakaran, Thomas S. Huang

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

This chapter presents techniques for constructing the video "table of contents" (ToC), video highlights, and video index as well as the ways to integrate them into a unified framework. It reviews video analysis, representation, summarization, and retrieval. Further, a unified framework for video summarization and retrieval is described, followed by algorithms and experimental results on real-world video clips. Video content can be accessed by using either a top-down approach or a bottom-up approach. The top-down approach-that is, video browsing-is useful when one needs to get an "essence" of the content. The bottom-up approach-that is, video retrieval-is useful when one knows exactly what is needed from the content. In video summarization, the "essence" a summary should capture depends on whether or not the content is scripted. Because scripted content, such as news, drama, and movies, is carefully structured as a sequence of semantic units, one can get its essence by enabling a traversal through representative items from these semantic units. Hence, ToC-based video browsing caters to summarization of scripted content.

Original languageEnglish (US)
Title of host publicationHandbook of Image and Video Processing
PublisherElsevier Inc.
Pages1013-1029
Number of pages17
ISBN (Print)9780121197926
DOIs
StatePublished - Dec 1 2005

Fingerprint

Semantics

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Xiong, Z., Rui, Y., Radhakrishnan, R., Divakaran, A., & Huang, T. S. (2005). A Unified Framework for Video Summarization, Browsing, and Retrieval. In Handbook of Image and Video Processing (pp. 1013-1029). Elsevier Inc.. https://doi.org/10.1016/B978-012119792-6/50122-4

A Unified Framework for Video Summarization, Browsing, and Retrieval. / Xiong, Ziyou; Rui, Yong; Radhakrishnan, Regunathan; Divakaran, Ajay; Huang, Thomas S.

Handbook of Image and Video Processing. Elsevier Inc., 2005. p. 1013-1029.

Research output: Chapter in Book/Report/Conference proceedingChapter

Xiong, Z, Rui, Y, Radhakrishnan, R, Divakaran, A & Huang, TS 2005, A Unified Framework for Video Summarization, Browsing, and Retrieval. in Handbook of Image and Video Processing. Elsevier Inc., pp. 1013-1029. https://doi.org/10.1016/B978-012119792-6/50122-4
Xiong Z, Rui Y, Radhakrishnan R, Divakaran A, Huang TS. A Unified Framework for Video Summarization, Browsing, and Retrieval. In Handbook of Image and Video Processing. Elsevier Inc. 2005. p. 1013-1029 https://doi.org/10.1016/B978-012119792-6/50122-4
Xiong, Ziyou ; Rui, Yong ; Radhakrishnan, Regunathan ; Divakaran, Ajay ; Huang, Thomas S. / A Unified Framework for Video Summarization, Browsing, and Retrieval. Handbook of Image and Video Processing. Elsevier Inc., 2005. pp. 1013-1029
@inbook{b23e5cbacbbe4634a3a58b613f68f42c,
title = "A Unified Framework for Video Summarization, Browsing, and Retrieval",
abstract = "This chapter presents techniques for constructing the video {"}table of contents{"} (ToC), video highlights, and video index as well as the ways to integrate them into a unified framework. It reviews video analysis, representation, summarization, and retrieval. Further, a unified framework for video summarization and retrieval is described, followed by algorithms and experimental results on real-world video clips. Video content can be accessed by using either a top-down approach or a bottom-up approach. The top-down approach-that is, video browsing-is useful when one needs to get an {"}essence{"} of the content. The bottom-up approach-that is, video retrieval-is useful when one knows exactly what is needed from the content. In video summarization, the {"}essence{"} a summary should capture depends on whether or not the content is scripted. Because scripted content, such as news, drama, and movies, is carefully structured as a sequence of semantic units, one can get its essence by enabling a traversal through representative items from these semantic units. Hence, ToC-based video browsing caters to summarization of scripted content.",
author = "Ziyou Xiong and Yong Rui and Regunathan Radhakrishnan and Ajay Divakaran and Huang, {Thomas S.}",
year = "2005",
month = "12",
day = "1",
doi = "10.1016/B978-012119792-6/50122-4",
language = "English (US)",
isbn = "9780121197926",
pages = "1013--1029",
booktitle = "Handbook of Image and Video Processing",
publisher = "Elsevier Inc.",

}

TY - CHAP

T1 - A Unified Framework for Video Summarization, Browsing, and Retrieval

AU - Xiong, Ziyou

AU - Rui, Yong

AU - Radhakrishnan, Regunathan

AU - Divakaran, Ajay

AU - Huang, Thomas S.

PY - 2005/12/1

Y1 - 2005/12/1

N2 - This chapter presents techniques for constructing the video "table of contents" (ToC), video highlights, and video index as well as the ways to integrate them into a unified framework. It reviews video analysis, representation, summarization, and retrieval. Further, a unified framework for video summarization and retrieval is described, followed by algorithms and experimental results on real-world video clips. Video content can be accessed by using either a top-down approach or a bottom-up approach. The top-down approach-that is, video browsing-is useful when one needs to get an "essence" of the content. The bottom-up approach-that is, video retrieval-is useful when one knows exactly what is needed from the content. In video summarization, the "essence" a summary should capture depends on whether or not the content is scripted. Because scripted content, such as news, drama, and movies, is carefully structured as a sequence of semantic units, one can get its essence by enabling a traversal through representative items from these semantic units. Hence, ToC-based video browsing caters to summarization of scripted content.

AB - This chapter presents techniques for constructing the video "table of contents" (ToC), video highlights, and video index as well as the ways to integrate them into a unified framework. It reviews video analysis, representation, summarization, and retrieval. Further, a unified framework for video summarization and retrieval is described, followed by algorithms and experimental results on real-world video clips. Video content can be accessed by using either a top-down approach or a bottom-up approach. The top-down approach-that is, video browsing-is useful when one needs to get an "essence" of the content. The bottom-up approach-that is, video retrieval-is useful when one knows exactly what is needed from the content. In video summarization, the "essence" a summary should capture depends on whether or not the content is scripted. Because scripted content, such as news, drama, and movies, is carefully structured as a sequence of semantic units, one can get its essence by enabling a traversal through representative items from these semantic units. Hence, ToC-based video browsing caters to summarization of scripted content.

UR - http://www.scopus.com/inward/record.url?scp=33645750769&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33645750769&partnerID=8YFLogxK

U2 - 10.1016/B978-012119792-6/50122-4

DO - 10.1016/B978-012119792-6/50122-4

M3 - Chapter

AN - SCOPUS:33645750769

SN - 9780121197926

SP - 1013

EP - 1029

BT - Handbook of Image and Video Processing

PB - Elsevier Inc.

ER -