TY - GEN
T1 - Learning to Detect Human-Object Interactions
AU - Chao, Yu-Wei
AU - Liu, Yunfan
AU - Liu, Xieyang
AU - Zeng, Huayi
AU - Deng, Jia
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledged KAUST grant number(s): OSR-2015-CRG4-2639
Acknowledgements: This publication is based upon work supported by the King Abdullah University of Science and Technology (KAUST) Office of Sponsored Research (OSR) under Award No. OSR-2015-CRG4-2639.
This publication acknowledges KAUST support, but has no KAUST affiliated authors.
PY - 2018/5/7
Y1 - 2018/5/7
N2 - We study the problem of detecting human-object interactions (HOI) in static images, defined as predicting a human and an object bounding box with an interaction class label that connects them. HOI detection is a fundamental problem in computer vision as it provides semantic information about the interactions among the detected objects. We introduce HICO-DET, a new large benchmark for HOI detection, by augmenting the current HICO classification benchmark with instance annotations. To solve the task, we propose Human-Object Region-based Convolutional Neural Networks (HO-RCNN). At the core of our HO-RCNN is the Interaction Pattern, a novel DNN input that characterizes the spatial relations between two bounding boxes. Experiments on HICO-DET demonstrate that our HO-RCNN, by exploiting human-object spatial relations through Interaction Patterns, significantly improves the performance of HOI detection over baseline approaches.
AB - We study the problem of detecting human-object interactions (HOI) in static images, defined as predicting a human and an object bounding box with an interaction class label that connects them. HOI detection is a fundamental problem in computer vision as it provides semantic information about the interactions among the detected objects. We introduce HICO-DET, a new large benchmark for HOI detection, by augmenting the current HICO classification benchmark with instance annotations. To solve the task, we propose Human-Object Region-based Convolutional Neural Networks (HO-RCNN). At the core of our HO-RCNN is the Interaction Pattern, a novel DNN input that characterizes the spatial relations between two bounding boxes. Experiments on HICO-DET demonstrate that our HO-RCNN, by exploiting human-object spatial relations through Interaction Patterns, significantly improves the performance of HOI detection over baseline approaches.
UR - http://hdl.handle.net/10754/626708
UR - https://ieeexplore.ieee.org/document/8354152/
UR - http://www.scopus.com/inward/record.url?scp=85050928969&partnerID=8YFLogxK
U2 - 10.1109/wacv.2018.00048
DO - 10.1109/wacv.2018.00048
M3 - Conference contribution
SN - 9781538648865
SP - 381
EP - 389
BT - 2018 IEEE Winter Conference on Applications of Computer Vision (WACV)
PB - IEEE
ER -