TY - JOUR
T1 - Learning from Weak and Noisy Labels for Semantic Segmentation
AU - Lu, Zhiwu
AU - Fu, Zhenyong
AU - Xiang, Tao
AU - Han, Peng
AU - Wang, Liwei
AU - Gao, Xin
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: This work was partially supported by National Natural Science
Foundation of China (61573363 and 61573026), 973
Program of China (2014CB340403 and 2015CB352502),
the Fundamental Research Funds for the Central Universities
and the Research Funds of Renmin University of China
(15XNLQ01), IBM Global SUR Award Program, European
Research Council FP7 Project SUNNY (313243), and the
funding from KAUST.
PY - 2016/4/9
Y1 - 2016/4/9
N2 - A weakly supervised semantic segmentation (WSSS) method aims to learn a segmentation model from weak (image-level) as opposed to strong (pixel-level) labels. By avoiding the tedious pixel-level annotation process, it can exploit the unlimited supply of user-tagged images from media-sharing sites such as Flickr for large scale applications. However, these ‘free’ tags/labels are often noisy and few existing works address the problem of learning with both weak and noisy labels. In this work, we cast the WSSS problem into a label noise reduction problem. Specifically, after segmenting each image into a set of superpixels, the weak and potentially noisy image-level labels are propagated to the superpixel level resulting in highly noisy labels; the key to semantic segmentation is thus to identify and correct the superpixel noisy labels. To this end, a novel L1-optimisation based sparse learning model is formulated to directly and explicitly detect noisy labels. To solve the L1-optimisation problem, we further develop an efficient learning algorithm by introducing an intermediate labelling variable. Extensive experiments on three benchmark datasets show that our method yields state-of-the-art results given noise-free labels, whilst significantly outperforming the existing methods when the weak labels are also noisy.
AB - A weakly supervised semantic segmentation (WSSS) method aims to learn a segmentation model from weak (image-level) as opposed to strong (pixel-level) labels. By avoiding the tedious pixel-level annotation process, it can exploit the unlimited supply of user-tagged images from media-sharing sites such as Flickr for large scale applications. However, these ‘free’ tags/labels are often noisy and few existing works address the problem of learning with both weak and noisy labels. In this work, we cast the WSSS problem into a label noise reduction problem. Specifically, after segmenting each image into a set of superpixels, the weak and potentially noisy image-level labels are propagated to the superpixel level resulting in highly noisy labels; the key to semantic segmentation is thus to identify and correct the superpixel noisy labels. To this end, a novel L1-optimisation based sparse learning model is formulated to directly and explicitly detect noisy labels. To solve the L1-optimisation problem, we further develop an efficient learning algorithm by introducing an intermediate labelling variable. Extensive experiments on three benchmark datasets show that our method yields state-of-the-art results given noise-free labels, whilst significantly outperforming the existing methods when the weak labels are also noisy.
UR - http://hdl.handle.net/10754/608585
UR - http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=7450177
UR - http://www.scopus.com/inward/record.url?scp=85012895343&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2016.2552172
DO - 10.1109/TPAMI.2016.2552172
M3 - Article
SN - 0162-8828
VL - 39
SP - 486
EP - 500
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 3
ER -