TY - GEN
T1 - VibroFM
T2 - 21st IEEE International Conference on Mobile Ad-Hoc and Smart Systems, MASS 2024
AU - Kimura, Tomoyoshi
AU - Li, Jinyang
AU - Wang, Tianshi
AU - Chen, Yizhuo
AU - Wang, Ruijie
AU - Kara, Denizhan
AU - Wigness, Maggie
AU - Bhattacharyya, Joydeep
AU - Srivatsa, Mudhakar
AU - Liu, Shengzhong
AU - Srivastava, Mani
AU - Diggavi, Suhas
AU - Abdelzaher, Tarek
N1 - Research reported in this paper was sponsored in part by DEVCOM ARL under Cooperative Agreement W911NF-17-2-0196 (ARL IoBT CRA), and in part by NSF CNS 20-38817, and the Boeing Company. It was also supported in part by ACE, one of the seven centers in JUMP 2.0, a Semiconductor Research Corporation (SRC) program sponsored by DARPA. The views and conclusions contained in this document are those of the authors, not the Army Research Laboratory or the U.S. Government. The U.S. Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation herein.
PY - 2024
Y1 - 2024
N2 - The paper argues for the feasibility and utility of micro foundation models (μFMs), a key direction for future smart IoT/CPS systems that exploits advances in self-supervised pretraining to support multiple downstream tasks. We demonstrate key beneficial properties such as latent representation independence from the downstream task, robustness to domain shifts, and ability to learn from unlabeled data. Importantly, we demonstrate the emergence of these properties after pretraining with only moderate amounts of unlabeled data, earning the name μFMs. To make the argument, evaluate model efficacy, and surface some of the underlying challenges, this paper describes a vibration-based μFM, called VibroFM, pretrained with moderate amounts of unlabeled acoustic and seismic sensing data, to support target classification and tracking applications. VibroFM is pretrained in an environment-agnostic fashion using unlabeled sensor data. It can then be fine-tuned to a given deployment using a small amount of in-situ labeled data. The paper shows that VibroFM (i) improves the robustness of several downstream tasks, (ii) efficiently adapts to different environmental conditions (using only small amounts of fine-tuning), and (iii) allows few-shot generalization to unseen targets. We further show that VibroFM can execute in real time on embedded sensor nodes. We compare the robustness and performance of VibroFM to conventional supervised deep neural networks, showing the advantages of the former. Combined with the feasibility of executing μFMs in resource-limited settings and the sufficiency of only moderate amounts of data for their pretraining, we conclude the importance of micro foundation models as a promising research direction for the IoT/CPS community.
AB - The paper argues for the feasibility and utility of micro foundation models (μFMs), a key direction for future smart IoT/CPS systems that exploits advances in self-supervised pretraining to support multiple downstream tasks. We demonstrate key beneficial properties such as latent representation independence from the downstream task, robustness to domain shifts, and ability to learn from unlabeled data. Importantly, we demonstrate the emergence of these properties after pretraining with only moderate amounts of unlabeled data, earning the name μFMs. To make the argument, evaluate model efficacy, and surface some of the underlying challenges, this paper describes a vibration-based μFM, called VibroFM, pretrained with moderate amounts of unlabeled acoustic and seismic sensing data, to support target classification and tracking applications. VibroFM is pretrained in an environment-agnostic fashion using unlabeled sensor data. It can then be fine-tuned to a given deployment using a small amount of in-situ labeled data. The paper shows that VibroFM (i) improves the robustness of several downstream tasks, (ii) efficiently adapts to different environmental conditions (using only small amounts of fine-tuning), and (iii) allows few-shot generalization to unseen targets. We further show that VibroFM can execute in real time on embedded sensor nodes. We compare the robustness and performance of VibroFM to conventional supervised deep neural networks, showing the advantages of the former. Combined with the feasibility of executing μFMs in resource-limited settings and the sufficiency of only moderate amounts of data for their pretraining, we conclude the importance of micro foundation models as a promising research direction for the IoT/CPS community.
KW - Foundation Model
KW - Internet of Things
KW - Self-Supervised Learning
UR - http://www.scopus.com/inward/record.url?scp=85210262194&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85210262194&partnerID=8YFLogxK
U2 - 10.1109/MASS62177.2024.00014
DO - 10.1109/MASS62177.2024.00014
M3 - Conference contribution
AN - SCOPUS:85210262194
T3 - Proceedings - 2024 IEEE 21st International Conference on Mobile Ad-Hoc and Smart Systems, MASS 2024
SP - 10
EP - 18
BT - Proceedings - 2024 IEEE 21st International Conference on Mobile Ad-Hoc and Smart Systems, MASS 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 23 September 2024 through 25 September 2024
ER -