TY - JOUR
T1 - Multiple instance learning tracking method with local sparse representation
AU - Xie, Chengjun
AU - Tan, Jieqing
AU - Chen, Peng
AU - Zhang, Jie
AU - Helg, Lei
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: This work was supported by the NSFC-Guangdong Joint Foundation Key Project under grant (no. U1135003), the National Nature Science Foundation of China (grant no. 61070227).
PY - 2013/10/1
Y1 - 2013/10/1
N2 - When objects undergo large pose change, illumination variation or partial occlusion, most existed visual tracking algorithms tend to drift away from targets and even fail in tracking them. To address this issue, in this study, the authors propose an online algorithm by combining multiple instance learning (MIL) and local sparse representation for tracking an object in a video system. The key idea in our method is to model the appearance of an object by local sparse codes that can be formed as training data for the MIL framework. First, local image patches of a target object are represented as sparse codes with an overcomplete dictionary, where the adaptive representation can be helpful in overcoming partial occlusion in object tracking. Then MIL learns the sparse codes by a classifier to discriminate the target from the background. Finally, results from the trained classifier are input into a particle filter framework to sequentially estimate the target state over time in visual tracking. In addition, to decrease the visual drift because of the accumulative errors when updating the dictionary and classifier, a two-step object tracking method combining a static MIL classifier with a dynamical MIL classifier is proposed. Experiments on some publicly available benchmarks of video sequences show that our proposed tracker is more robust and effective than others. © The Institution of Engineering and Technology 2013.
AB - When objects undergo large pose change, illumination variation or partial occlusion, most existed visual tracking algorithms tend to drift away from targets and even fail in tracking them. To address this issue, in this study, the authors propose an online algorithm by combining multiple instance learning (MIL) and local sparse representation for tracking an object in a video system. The key idea in our method is to model the appearance of an object by local sparse codes that can be formed as training data for the MIL framework. First, local image patches of a target object are represented as sparse codes with an overcomplete dictionary, where the adaptive representation can be helpful in overcoming partial occlusion in object tracking. Then MIL learns the sparse codes by a classifier to discriminate the target from the background. Finally, results from the trained classifier are input into a particle filter framework to sequentially estimate the target state over time in visual tracking. In addition, to decrease the visual drift because of the accumulative errors when updating the dictionary and classifier, a two-step object tracking method combining a static MIL classifier with a dynamical MIL classifier is proposed. Experiments on some publicly available benchmarks of video sequences show that our proposed tracker is more robust and effective than others. © The Institution of Engineering and Technology 2013.
UR - http://hdl.handle.net/10754/563031
UR - http://mr.crossref.org/iPage?doi=10.1049%2Fiet-cvi.2012.0228
UR - http://www.scopus.com/inward/record.url?scp=84886671977&partnerID=8YFLogxK
U2 - 10.1049/iet-cvi.2012.0228
DO - 10.1049/iet-cvi.2012.0228
M3 - Article
SN - 1751-9632
VL - 7
SP - 320
EP - 334
JO - IET Computer Vision
JF - IET Computer Vision
IS - 5
ER -