Towards Practical Lottery Ticket Hypothesis for Adversarial Training

Bai Li, Shiqi Wang, Yunhan Jia, Yantao Lu, Zhenyu Zhong, Lawrence Carin, Suman Jana

Research output: Contribution to journalArticlepeer-review

22 Downloads (Pure)

Abstract

Recent research has proposed the lottery ticket hypothesis, suggesting that for a deep neural network, there exist trainable sub-networks performing equally or better than the original model with commensurate training steps. While this discovery is insightful, finding proper sub-networks requires iterative training and pruning. The high cost incurred limits the applications of the lottery ticket hypothesis. We show there exists a subset of the aforementioned sub-networks that converge significantly faster during the training process and thus can mitigate the cost issue. We conduct extensive experiments to show such sub-networks consistently exist across various model structures for a restrictive setting of hyperparameters ($e.g.$, carefully selected learning rate, pruning ratio, and model capacity). As a practical application of our findings, we demonstrate that such sub-networks can help in cutting down the total time of adversarial training, a standard approach to improve robustness, by up to 49\% on CIFAR-10 to achieve the state-of-the-art robustness.
Original languageEnglish (US)
JournalArxiv preprint
StatePublished - Mar 6 2020
Externally publishedYes

Keywords

  • cs.LG
  • stat.ML

Fingerprint

Dive into the research topics of 'Towards Practical Lottery Ticket Hypothesis for Adversarial Training'. Together they form a unique fingerprint.

Cite this