TY - JOUR
T1 - Predicting Object Interactions with Behavior Primitives
T2 - 7th Conference on Robot Learning, CoRL 2023
AU - Chen, Haonan
AU - Niu, Yilong
AU - Hong, Kaiwen
AU - Liu, Shuijing
AU - Wang, Yixuan
AU - Li, Yunzhu
AU - Driggs-Campbell, Katherine
N1 - This work was supported by ZJU-UIUC Joint Research Center Project No. funded by Zhejiang University.
We thank Haochen Shi's tireless assistance with the GNN implementation, as well as Neeloy Chakraborty, Peter Du, Pulkit Katdare, Ye-Ji Mun, and Zhe Huang for their insightful feedback and suggestions. This work was supported by ZJU-UIUC Joint Research Center Project No. DREMES 202003, funded by Zhejiang University.
PY - 2023
Y1 - 2023
N2 - Stowing, the task of placing objects in cluttered shelves or bins, is a common task in warehouse and manufacturing operations. However, this task is still predominantly carried out by human workers as stowing is challenging to automate due to the complex multi-object interactions and long-horizon nature of the task. Previous works typically involve extensive data collection and costly human labeling of semantic priors across diverse object categories. This paper presents a method to learn a generalizable robot stowing policy from predictive model of object interactions and a single demonstration with behavior primitives. We propose a novel framework that utilizes Graph Neural Networks to predict object interactions within the parameter space of behavioral primitives. We further employ primitive-augmented trajectory optimization to search the parameters of a predefined library of heterogeneous behavioral primitives to instantiate the control action. Our framework enables robots to proficiently execute long-horizon stowing tasks with a few keyframes (3-4) from a single demonstration. Despite being solely trained in a simulation, our framework demonstrates remarkable generalization capabilities. It efficiently adapts to a broad spectrum of real-world conditions, including various shelf widths, fluctuating quantities of objects, and objects with diverse attributes such as sizes and shapes.
AB - Stowing, the task of placing objects in cluttered shelves or bins, is a common task in warehouse and manufacturing operations. However, this task is still predominantly carried out by human workers as stowing is challenging to automate due to the complex multi-object interactions and long-horizon nature of the task. Previous works typically involve extensive data collection and costly human labeling of semantic priors across diverse object categories. This paper presents a method to learn a generalizable robot stowing policy from predictive model of object interactions and a single demonstration with behavior primitives. We propose a novel framework that utilizes Graph Neural Networks to predict object interactions within the parameter space of behavioral primitives. We further employ primitive-augmented trajectory optimization to search the parameters of a predefined library of heterogeneous behavioral primitives to instantiate the control action. Our framework enables robots to proficiently execute long-horizon stowing tasks with a few keyframes (3-4) from a single demonstration. Despite being solely trained in a simulation, our framework demonstrates remarkable generalization capabilities. It efficiently adapts to a broad spectrum of real-world conditions, including various shelf widths, fluctuating quantities of objects, and objects with diverse attributes such as sizes and shapes.
KW - Graph-Based Neural Dynamics
KW - Model Learning
KW - Multi-Object Interactions
KW - Robotic Manipulation
UR - http://www.scopus.com/inward/record.url?scp=85174109701&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85174109701&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85174109701
SN - 2640-3498
VL - 229
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
Y2 - 6 November 2023 through 9 November 2023
ER -