Skip to main navigation Skip to search Skip to main content

Pseudo Dataset Generation for Out-of-domain Multi-Camera View Recommendation

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Multi-camera systems are indispensable in movies, TV shows, and other media. Selecting the appropriate camera at every timestamp has a decisive impact on production quality and audience preferences. Learning-based view recommendation frameworks can assist professionals in decision-making. However, they often struggle outside of their training domains. The scarcity of labeled multi-camera view recommendation datasets exacerbates the issue. Based on the insight that many videos are edited from the original multi-camera videos, we propose transforming regular videos into pseudo-labeled multi-camera view recommendation datasets. Promisingly, by training the model on pseudo-labeled datasets stemming from videos in the target domain, we achieve a 68% relative improvement in the model's accuracy in the target domain and bridge the accuracy gap between in-domain and never-before-seen domains.

Original languageEnglish (US)
Title of host publication2024 IEEE International Conference on Visual Communications and Image Processing, VCIP 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331529543
DOIs
StatePublished - 2024
Event2024 IEEE International Conference on Visual Communications and Image Processing, VCIP 2024 - Tokyo, Japan
Duration: Dec 8 2024Dec 11 2024

Publication series

Name2024 IEEE International Conference on Visual Communications and Image Processing, VCIP 2024

Conference

Conference2024 IEEE International Conference on Visual Communications and Image Processing, VCIP 2024
Country/TerritoryJapan
CityTokyo
Period12/8/2412/11/24

Keywords

  • cinematography
  • semi-supervised learning

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Hardware and Architecture
  • Signal Processing
  • Media Technology

Fingerprint

Dive into the research topics of 'Pseudo Dataset Generation for Out-of-domain Multi-Camera View Recommendation'. Together they form a unique fingerprint.

Cite this