TY - GEN
T1 - Pornographic image recognition by strongly-supervised deep multiple instance learning
AU - Wang, Yuhui
AU - Jin, Xin
AU - Tan, Xiaoyang
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/8/3
Y1 - 2016/8/3
N2 - In this paper, we propose a principled framework for pornographic image recognition. Specifically, we present our definition of pornographic images, which characterizes the pornographic contents in images as the exposure of private body parts. As the private body parts often lie in local image regions, we model each image as a bag of local image patches (instances), and assume that for each pornographic image at least one instance accounts for the pornographic content within it. This treatment allows us to cast the model training as a Multiple Instance Learning (MIL) problem. Furthermore, we propose a strongly-supervised setting for MIL by identifying the most likely pornographic instances in positive bags, which effectively prevents the algorithm from getting trapped in a bad local optima. Last but not least, we formulate our strongly-supervised MIL under the deep CNN framework to learn deep representations; hence we call it Strongly-supervised Deep MIL (SD-MIL). We demonstrate that our SD-MIL based system produces remarkable accuracy with 97.01% TPR at 1% FPR, testing on 117K pornographic images and 117K normal images from our newly-collected large scale dataset.
AB - In this paper, we propose a principled framework for pornographic image recognition. Specifically, we present our definition of pornographic images, which characterizes the pornographic contents in images as the exposure of private body parts. As the private body parts often lie in local image regions, we model each image as a bag of local image patches (instances), and assume that for each pornographic image at least one instance accounts for the pornographic content within it. This treatment allows us to cast the model training as a Multiple Instance Learning (MIL) problem. Furthermore, we propose a strongly-supervised setting for MIL by identifying the most likely pornographic instances in positive bags, which effectively prevents the algorithm from getting trapped in a bad local optima. Last but not least, we formulate our strongly-supervised MIL under the deep CNN framework to learn deep representations; hence we call it Strongly-supervised Deep MIL (SD-MIL). We demonstrate that our SD-MIL based system produces remarkable accuracy with 97.01% TPR at 1% FPR, testing on 117K pornographic images and 117K normal images from our newly-collected large scale dataset.
KW - Deep learning
KW - Multiple Instance Learning
KW - Pornographic image recognition
UR - http://www.scopus.com/inward/record.url?scp=85006789557&partnerID=8YFLogxK
U2 - 10.1109/ICIP.2016.7533195
DO - 10.1109/ICIP.2016.7533195
M3 - Conference contribution
AN - SCOPUS:85006789557
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 4418
EP - 4422
BT - 2016 IEEE International Conference on Image Processing, ICIP 2016 - Proceedings
PB - IEEE Computer Society
T2 - 23rd IEEE International Conference on Image Processing, ICIP 2016
Y2 - 25 September 2016 through 28 September 2016
ER -