TY - GEN
T1 - LC-NAS
T2 - 10th International Conference on 3D Vision, 3DV 2022
AU - Li, Guohao
AU - Xu, Mengmeng
AU - Giancola, Silvio
AU - Thabet, Ali
AU - Ghanem, Bernard
N1 - Funding Information:
We presented an automatic neural architecture search that considers the latency factor in the search. We designed a loss function that constrains the latency for a given hardware. We show with empirical results that our architectures LC-NAS reach the latency for which they have been designed on ModelNet10 and generalize on ModelNet40. Furthermore, we showed transfer capabilities of LC-NAS for part segmentation, showing state-of-the-art results on the PartNet benchmark. We envision LC-NAS to be used in time-constrained applications such as autonomous driv- ing, robotics, and embedded systems, where latency is of paramount importance for the fulfillment of the visual task. Acknowledgments. This work was supported by the King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research through the Visual Computing Center (VCC) funding.
Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Point cloud architecture design has become a crucial problem for deep learning in 3D. Several efforts have been made to manually design architectures targeting high accuracy in point cloud tasks such as classification, segmentation, and detection. Recent progress in automatic Neural Architecture Search (NAS) minimizes the human effort in network design and optimizes architectures for high performance. However, those efforts fail to consider crucial factors such as latency during inference, which is of high importance in time-critical and hardware-bounded applications like self-driving cars, robot navigation, and mobile applications. In this paper, we introduce a new NAS framework, dubbed LC-NAS, that searches for point cloud architectures constrained to a target latency. We implement a novel latency constraint formulation for the trade-off between accuracy and latency in our architecture search. Contrary to previous works, our latency loss enables us to find the best architecture with latency near a specific target value, which is crucial when the end task is to be deployed in a limited hardware setting. Extensive experiments show that LC-NAS is able to find state-of-the-art architectures for point cloud classification in ModelNet40 with a minimal computational cost. We also show how our searched architectures achieve any desired latency with a reasonably low drop in accuracy. Finally, we show how our searched architectures easily transfer to the part segmentation task on PartNet, where we achieve state-of-the-art results with significantly lower latency.
AB - Point cloud architecture design has become a crucial problem for deep learning in 3D. Several efforts have been made to manually design architectures targeting high accuracy in point cloud tasks such as classification, segmentation, and detection. Recent progress in automatic Neural Architecture Search (NAS) minimizes the human effort in network design and optimizes architectures for high performance. However, those efforts fail to consider crucial factors such as latency during inference, which is of high importance in time-critical and hardware-bounded applications like self-driving cars, robot navigation, and mobile applications. In this paper, we introduce a new NAS framework, dubbed LC-NAS, that searches for point cloud architectures constrained to a target latency. We implement a novel latency constraint formulation for the trade-off between accuracy and latency in our architecture search. Contrary to previous works, our latency loss enables us to find the best architecture with latency near a specific target value, which is crucial when the end task is to be deployed in a limited hardware setting. Extensive experiments show that LC-NAS is able to find state-of-the-art architectures for point cloud classification in ModelNet40 with a minimal computational cost. We also show how our searched architectures achieve any desired latency with a reasonably low drop in accuracy. Finally, we show how our searched architectures easily transfer to the part segmentation task on PartNet, where we achieve state-of-the-art results with significantly lower latency.
KW - 3D Computer Vision
KW - Graph Neural Networks
KW - Neural Architecture Search
UR - http://www.scopus.com/inward/record.url?scp=85149344864&partnerID=8YFLogxK
U2 - 10.1109/3DV57658.2022.00018
DO - 10.1109/3DV57658.2022.00018
M3 - Conference contribution
AN - SCOPUS:85149344864
T3 - Proceedings - 2022 International Conference on 3D Vision, 3DV 2022
SP - 52
EP - 62
BT - Proceedings - 2022 International Conference on 3D Vision, 3DV 2022
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 12 September 2022 through 15 September 2022
ER -