Multi-task Generative Adversarial Network for Detecting Small Objects in the Wild

Yongqiang Zhang, Yancheng Bai, Mingli Ding, Bernard Ghanem

Research output: Contribution to journalArticlepeer-review

36 Scopus citations

Abstract

Object detection results have been rapidly improved over a short period of time with the development of deep convolutional neural networks. Although impressive results have been achieved on large/medium sized objects, the performance on small objects is far from satisfactory and one of remaining open challenges is detecting small object in unconstrained conditions (e.g. COCO and WIDER FACE benchmarks). The reason is that small objects usually lack sufficient detailed appearance information, which can distinguish them from the backgrounds or similar objects. To deal with the small object detection problem, in this paper, we propose an end-to-end multi-task generative adversarial network (MTGAN), which is a general framework. In the MTGAN, the generator is a super-resolution network, which can up-sample small blurred images into fine-scale ones and recover detailed information for more accurate detection. The discriminator is a multi-task network, which describes each inputted image patch with a real/fake score, object category scores, and bounding box regression offsets. Furthermore, to make the generator recover more details for easier detection, the classification and regression losses in the discriminator are back-propagated into the generator during training process. Extensive experiments on the challenging COCO and WIDER FACE datasets demonstrate the effectiveness of the proposed method in restoring a clear super-resolved image from a blurred small one, and show that the detection performance, especially for small sized objects, improves over state-of-the-art methods by a large margin.
Original languageEnglish (US)
JournalInternational Journal of Computer Vision
DOIs
StatePublished - Feb 18 2020

Fingerprint

Dive into the research topics of 'Multi-task Generative Adversarial Network for Detecting Small Objects in the Wild'. Together they form a unique fingerprint.

Cite this