TY - JOUR
T1 - Optimizing video analytics with declarative model relationships
AU - Romero, Francisco
AU - Hauswald, Johann
AU - Partap, Aditi
AU - Kang, Daniel
AU - Zaharia, Matei
AU - Kozyrakis, Christos
N1 - Funding Information:
We thank the Stanford Platform Lab and its affiliates (Cisco, Face-book, Google, Nasdaq, NEC, VMware, and Wells Fargo), the Open Philanthropy project, and Sutter Hill Ventures. We also thank affiliates of the Stanford DAWN project—Ant Financial, Facebook, Google, and VMware—as well as Toyota Research Institute (“TRI”), Cisco, SAP, and the NSF under CAREER grant CNS-1651570. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF. TRI provided funds to assist the authors with their research but this article solely reflects the opinions and conclusions of its authors and not TRI or any other Toyota entity. Francisco Romero was supported by a Stanford DARE Fellowship.
Funding Information:
We thank the Stanford Platform Lab and its affiliates (Cisco, Facebook, Google, Nasdaq, NEC, VMware, and Wells Fargo), the Open Philanthropy project, and Sutter Hill Ventures. We also thank affiliates of the Stanford DAWN project—Ant Financial, Facebook, Google, and VMware—as well as Toyota Research Institute (“TRI”), Cisco, SAP, and the NSF under CAREER grant CNS-1651570. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF. TRI provided funds to assist the authors with their research but this article solely reflects the opinions and conclusions of its authors and not TRI or any other Toyota entity. Francisco Romero was supported by a Stanford DARE Fellowship.
Publisher Copyright:
© 2022, VLDB Endowment. All rights reserved.
PY - 2022/11
Y1 - 2022/11
N2 - The availability of vast video collections and the accuracy of ML models has generated significant interest in video analytics systems. Since naively processing all frames using expensive models is impractical, researchers have proposed optimizations such as selectively using faster but less accurate models to replace or filter frames for expensive models. However, these optimizations are difficult to apply on queries with multiple predicates and models, as users must manually explore a large optimization space. Without significant systems expertise or time investment, an analyst may manually create an execution plan that is unnecessarily expensive and/or terribly inaccurate. We propose Relational Hints, a declarative interface that allows users to suggest ML model relationships based on domain knowledge. Users can express two key relationships: when a model can replace another (CAN REPLACE) and when a model can be used to filter frames for another (CAN FILTER). We aim to design an interface to express model relationships informed by domain specific knowledge and define the constraints by which these relationships hold. We then present the VIVA video analytics system that uses relational hints to optimize SQL queries on video datasets. VIVA automatically selects and validates the hints applicable to the query, generates possible query plans using a formal set of transformations, and finds the best performance plan that meets a user’s accuracy requirements. VIVA relieves users from rewriting and manually optimizing video queries as new models become available and execution environments evolve. We evaluate VIVA implemented on top of Spark and show that hints improve performance up to 16.6× without sacrificing accuracy.
AB - The availability of vast video collections and the accuracy of ML models has generated significant interest in video analytics systems. Since naively processing all frames using expensive models is impractical, researchers have proposed optimizations such as selectively using faster but less accurate models to replace or filter frames for expensive models. However, these optimizations are difficult to apply on queries with multiple predicates and models, as users must manually explore a large optimization space. Without significant systems expertise or time investment, an analyst may manually create an execution plan that is unnecessarily expensive and/or terribly inaccurate. We propose Relational Hints, a declarative interface that allows users to suggest ML model relationships based on domain knowledge. Users can express two key relationships: when a model can replace another (CAN REPLACE) and when a model can be used to filter frames for another (CAN FILTER). We aim to design an interface to express model relationships informed by domain specific knowledge and define the constraints by which these relationships hold. We then present the VIVA video analytics system that uses relational hints to optimize SQL queries on video datasets. VIVA automatically selects and validates the hints applicable to the query, generates possible query plans using a formal set of transformations, and finds the best performance plan that meets a user’s accuracy requirements. VIVA relieves users from rewriting and manually optimizing video queries as new models become available and execution environments evolve. We evaluate VIVA implemented on top of Spark and show that hints improve performance up to 16.6× without sacrificing accuracy.
UR - http://www.scopus.com/inward/record.url?scp=85144539254&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85144539254&partnerID=8YFLogxK
U2 - 10.14778/3570690.3570695
DO - 10.14778/3570690.3570695
M3 - Article
AN - SCOPUS:85144539254
SN - 2150-8097
VL - 16
SP - 447
EP - 460
JO - Proceedings of the VLDB Endowment
JF - Proceedings of the VLDB Endowment
IS - 3
ER -