TY - GEN
T1 - Enhancing cross-task black-box transferability of adversarial examples with dispersion reduction
AU - Lu, Yantao
AU - Jia, Yunhan
AU - Wang, Jianyu
AU - Li, Bai
AU - Chai, Weiheng
AU - Carin, Lawrence
AU - Velipasalar, Senem
N1 - Generated from Scopus record by KAUST IRTS on 2021-02-09
PY - 2020/1/1
Y1 - 2020/1/1
N2 - Neural networks are known to be vulnerable to carefully crafted adversarial examples, and these malicious samples often transfer, i.e., they remain adversarial even against other models. Although significant effort has been devoted to the transferability across models, surprisingly little attention has been paid to cross-task transferability, which represents the real-world cybercriminal's situation, where an ensemble of different defense/detection mechanisms need to be evaded all at once. We investigate the transferability of adversarial examples across a wide range of real-world computer vision tasks, including image classification, object detection, semantic segmentation, explicit content detection, and text detection. Our proposed attack minimizes the “dispersion” of the internal feature map, overcoming the limitations of existing attacks, that require task-specific loss functions and/or probing a target model. We conduct evaluation on open-source detection and segmentation models, as well as four different computer vision tasks provided by Google Cloud Vision (GCV) APIs. We demonstrate that our approach outperforms existing attacks by degrading performance of multiple CV tasks by a large margin with only modest perturbations.
AB - Neural networks are known to be vulnerable to carefully crafted adversarial examples, and these malicious samples often transfer, i.e., they remain adversarial even against other models. Although significant effort has been devoted to the transferability across models, surprisingly little attention has been paid to cross-task transferability, which represents the real-world cybercriminal's situation, where an ensemble of different defense/detection mechanisms need to be evaded all at once. We investigate the transferability of adversarial examples across a wide range of real-world computer vision tasks, including image classification, object detection, semantic segmentation, explicit content detection, and text detection. Our proposed attack minimizes the “dispersion” of the internal feature map, overcoming the limitations of existing attacks, that require task-specific loss functions and/or probing a target model. We conduct evaluation on open-source detection and segmentation models, as well as four different computer vision tasks provided by Google Cloud Vision (GCV) APIs. We demonstrate that our approach outperforms existing attacks by degrading performance of multiple CV tasks by a large margin with only modest perturbations.
UR - https://ieeexplore.ieee.org/document/9157043/
UR - http://www.scopus.com/inward/record.url?scp=85094207043&partnerID=8YFLogxK
U2 - 10.1109/CVPR42600.2020.00102
DO - 10.1109/CVPR42600.2020.00102
M3 - Conference contribution
SP - 937
EP - 946
BT - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
PB - IEEE Computer [email protected]
ER -