TY - GEN
T1 - Data Augmentation for Human Activity Recognition via Condition Space Interpolation within a Generative Model
AU - Wang, Tianshi
AU - Chen, Yizhuo
AU - Yang, Qikai
AU - Sun, Dachun
AU - Wang, Ruijie
AU - Li, Jinyang
AU - Kimura, Tomoyoshi
AU - Abdelzaher, Tarek
N1 - Research reported in this paper was sponsored in part byDEVCOM ARL under Cooperative Agreement W911NF- 17-2-0196, NSF CNS 20-38817, and the Boeing Company. Itwas also supported in part by ACE, one of the seven centersin JUMP 2.0, a Semiconductor Research Corporation (SRC)program sponsored by DARPA. The views and conclusionscontained in this document are those of the authors andshould not be interpreted as representing the official policies,either expressed or implied, of the Army Research Laboratory,DARPA, or the U.S. Government. The U.S. Government isauthorized to reproduce and distribute reprints for Governmentpurposes notwithstanding any copyright notation herein.
PY - 2024
Y1 - 2024
N2 - This paper presents a generative data augmentation approach for human activity recognition (HAR) to close the distribution gap between laboratory training and real-world deployment. Despite the recent success of deep learning methods in wearable sensor-based HAR tasks, performance degradation occurs during real-world deployment due to training data scarcity and the vast variability in human activities. In light of this, we aim to enhance the diversity of training datasets by generating new data points within the vicinity of existing samples, as informed by domain expertise. Unlike the commonly utilized methods that augment data by interpolating in data space or feature space, we innovate by applying interpolation in the condition space of a conditional generative model to augment HAR datasets. We use domain-specific knowledge to extract statistical metrics from sensor data, which serve as conditions to direct the generation process. We demonstrate how a conditional generative diffusion model, steered by interpolated conditions, can synthesize realistic new data with various high-level features that benefit the robustness of the downstream HAR models. Our methodology advances the use of interpolation in data augmentation by exploring the capability of a state-of-the-art generative model, offering novel perspectives for bolstering the robustness and generalizability of HAR systems. Experimental results demonstrate that condition space interpolation outperforms the conventional interpolation-based and generative model-based augmentation methods across various datasets and downstream classifier combinations.
AB - This paper presents a generative data augmentation approach for human activity recognition (HAR) to close the distribution gap between laboratory training and real-world deployment. Despite the recent success of deep learning methods in wearable sensor-based HAR tasks, performance degradation occurs during real-world deployment due to training data scarcity and the vast variability in human activities. In light of this, we aim to enhance the diversity of training datasets by generating new data points within the vicinity of existing samples, as informed by domain expertise. Unlike the commonly utilized methods that augment data by interpolating in data space or feature space, we innovate by applying interpolation in the condition space of a conditional generative model to augment HAR datasets. We use domain-specific knowledge to extract statistical metrics from sensor data, which serve as conditions to direct the generation process. We demonstrate how a conditional generative diffusion model, steered by interpolated conditions, can synthesize realistic new data with various high-level features that benefit the robustness of the downstream HAR models. Our methodology advances the use of interpolation in data augmentation by exploring the capability of a state-of-the-art generative model, offering novel perspectives for bolstering the robustness and generalizability of HAR systems. Experimental results demonstrate that condition space interpolation outperforms the conventional interpolation-based and generative model-based augmentation methods across various datasets and downstream classifier combinations.
KW - data augmentation
KW - diffusion model
KW - generative model
KW - human activity recognition
UR - http://www.scopus.com/inward/record.url?scp=85203246929&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85203246929&partnerID=8YFLogxK
U2 - 10.1109/ICCCN61486.2024.10637566
DO - 10.1109/ICCCN61486.2024.10637566
M3 - Conference contribution
AN - SCOPUS:85203246929
T3 - Proceedings - International Conference on Computer Communications and Networks, ICCCN
BT - ICCCN 2024 - 2024 33rd International Conference on Computer Communications and Networks
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 33rd International Conference on Computer Communications and Networks, ICCCN 2024
Y2 - 29 July 2024 through 31 July 2024
ER -