TY - JOUR
T1 - Pornographic Image Recognition via Weighted Multiple Instance Learning
AU - Jin, Xin
AU - Wang, Yuhui
AU - Tan, Xiaoyang
N1 - Funding Information:
This work was supported in part by the National Science Foundation of China under Grant 61672280, Grant 61373060, and Grant 61732006, in part by the National Key Research and Development Program of China under Grant 2017YFB0802300, in part by the Jiangsu 333 Project under Grant BRA2017377, and in part by the Qing Lan Project.
Publisher Copyright:
© 2018 IEEE.
PY - 2019/12/1
Y1 - 2019/12/1
N2 - In the era of Internet, recognizing pornographic images is of great significance for protecting children’s physical and mental health. However, this task is very challenging as the key pornographic contents (e.g., breast and private part) in an image often lie in local regions of small size. In this paper, we model each image as a bag of regions, and follow a multiple instance learning (MIL) approach to train a generic region-based recognition model. Specifically, we take into account the regions’ degree of pornography, and make three main contributions. First, we show that based on very few annotations of the key pornographic contents in a training image, we can generate a bag of properly sized regions, among which the potential positive regions usually contain useful contexts that can aid recognition. Second, we present a simple quantitative measure of a region’s degree of pornography, which can be used to weigh the importance of different regions in a positive image. Third, we formulate the recognition task as a weighted MIL problem under the convolutional neural network framework, with a bag probability function introduced to combine the importance of different regions. Experiments on our newly collected large scale dataset demonstrate the effectiveness of the proposed method, achieving an accuracy with 97.52% true positive rate at 1% false positive rate, tested on 100K pornographic images and 100K normal images.
AB - In the era of Internet, recognizing pornographic images is of great significance for protecting children’s physical and mental health. However, this task is very challenging as the key pornographic contents (e.g., breast and private part) in an image often lie in local regions of small size. In this paper, we model each image as a bag of regions, and follow a multiple instance learning (MIL) approach to train a generic region-based recognition model. Specifically, we take into account the regions’ degree of pornography, and make three main contributions. First, we show that based on very few annotations of the key pornographic contents in a training image, we can generate a bag of properly sized regions, among which the potential positive regions usually contain useful contexts that can aid recognition. Second, we present a simple quantitative measure of a region’s degree of pornography, which can be used to weigh the importance of different regions in a positive image. Third, we formulate the recognition task as a weighted MIL problem under the convolutional neural network framework, with a bag probability function introduced to combine the importance of different regions. Experiments on our newly collected large scale dataset demonstrate the effectiveness of the proposed method, achieving an accuracy with 97.52% true positive rate at 1% false positive rate, tested on 100K pornographic images and 100K normal images.
KW - Deep learning
KW - multiple instance learning (MIL)
KW - pornographic image recognition
UR - http://www.scopus.com/inward/record.url?scp=85053334952&partnerID=8YFLogxK
U2 - 10.1109/TCYB.2018.2864870
DO - 10.1109/TCYB.2018.2864870
M3 - Article
C2 - 30222590
AN - SCOPUS:85053334952
SN - 2168-2267
VL - 49
SP - 4412
EP - 4420
JO - IEEE Transactions on Cybernetics
JF - IEEE Transactions on Cybernetics
IS - 12
ER -