While the deployment of deep learning models on edge devices is increasing, these models often lack robustness when faced with dynamic changes in sensed data. This can be attributed to sensor drift, or variations in the data compared to what was used during offline training due to factors such as specific sensor placement or naturally changing sensing conditions.
Hence, achieving the desired robustness necessitates the utilization of either an invariant architecture or specialized training approaches, like data augmentation techniques. Alternatively, input transformations can be treated as a domain shift problem, and solved by post-deployment model adaptation.
In this paper, we train a parameterized subspace of configurable networks, where an optimal network for a particular parameter setting is part of this subspace. The obtained subspace is low-dimensional and has a surprisingly simple structure even for complex, non-invertible transformations of the input, leading to an exceptionally high efficiency of subspace-configurable networks (SCNs) when limited storage and computing resources are at stake.
TL;DR? Click the
Play
button and listern to a discussion podcast generated by AI.
SCNs achieve high test accuracy already for low D
SCN test set accuracy on 2D rotations.
See the interactive SCN's beta-subspace visualization page for all experiments in the paper.
Fruit Classification on Arduino nano 33 BLE
Traffuc Sign Classiffication on ESP32-S3-eye
This paper addresses the problem of model reconfiguration and robustness to input transformations under severe resource constraints. We design subspace-configurable networks (SCNs) that learn low-dimensional configuration subspaces and draw optimal model weights from this structure. We achieve surprisingly high accuracies even for low number of configuration dimensions and observe a simple and intuitive structure of the subspace of optimal solutions for all investigated input transformations.
x
1@misc{wang2024subspaceconfigurablenetworks,
2 title={Subspace-Configurable Networks},
3 author={Dong Wang and Olga Saukh and Xiaoxi He and Lothar Thiele},
4 year={2024},
5 eprint={2305.13536},
6 archivePrefix={arXiv},
7 primaryClass={cs.LG},
8 url={https://arxiv.org/abs/2305.13536},
9}