TY - GEN
T1 - RAID: a relation-augmented image descriptor
AU - Guerrero, Paul
AU - Mitra, Niloy J.
AU - Wonka, Peter
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledged KAUST grant number(s): OCRF-2014-CGR3-62140401
Acknowledgements: The research described here was supported by the Office of Sponsored Research (OSR) under Award No. OCRF-2014-CGR3-62140401, the Visual Computing Center at KAUST, ERC Starting Grant SmartGeometry (StG-2013 335373), Marie Curie CIG 303541 and the Open3D Project (EPSRC Grant EP/M013685/1).
PY - 2016/7/11
Y1 - 2016/7/11
N2 - As humans, we regularly interpret scenes based on how objects are related, rather than based on the objects themselves. For example, we see a person riding an object X or a plank bridging two objects. Current methods provide limited support to search for content based on such relations. We present RAID, a relation-augmented image descriptor that supports queries based on inter-region relations. The key idea of our descriptor is to encode region-to-region relations as the spatial distribution of point-to-region relationships between two image regions. RAID allows sketch-based retrieval and requires minimal training data, thus making it suited even for querying uncommon relations. We evaluate the proposed descriptor by querying into large image databases and successfully extract nontrivial images demonstrating complex inter-region relations, which are easily missed or erroneously classified by existing methods. We assess the robustness of RAID on multiple datasets even when the region segmentation is computed automatically or very noisy.
AB - As humans, we regularly interpret scenes based on how objects are related, rather than based on the objects themselves. For example, we see a person riding an object X or a plank bridging two objects. Current methods provide limited support to search for content based on such relations. We present RAID, a relation-augmented image descriptor that supports queries based on inter-region relations. The key idea of our descriptor is to encode region-to-region relations as the spatial distribution of point-to-region relationships between two image regions. RAID allows sketch-based retrieval and requires minimal training data, thus making it suited even for querying uncommon relations. We evaluate the proposed descriptor by querying into large image databases and successfully extract nontrivial images demonstrating complex inter-region relations, which are easily missed or erroneously classified by existing methods. We assess the robustness of RAID on multiple datasets even when the region segmentation is computed automatically or very noisy.
UR - http://hdl.handle.net/10754/620947
UR - http://dl.acm.org/citation.cfm?doid=2897824.2925939
UR - http://www.scopus.com/inward/record.url?scp=84980044688&partnerID=8YFLogxK
U2 - 10.1145/2897824.2925939
DO - 10.1145/2897824.2925939
M3 - Conference contribution
SP - 1
EP - 12
BT - ACM Transactions on Graphics
PB - Association for Computing Machinery (ACM)
ER -