TY - GEN
T1 - Trajectory-based Fisher kernel representation for action recognition in videos
AU - Atmosukarto, Indriyati
AU - Ghanem, Bernard
AU - Ahuja, Narendra
N1 - KAUST Repository Item: Exported on 2020-04-23
Acknowledgements: This study is supported by the re-search grant for the Human Sixth Sense Program at ADSC from Singapore’s Agency for Science, Technol-
ogy and Research (A*STAR)
PY - 2012
Y1 - 2012
N2 - Action recognition is an important computer vision problem that has many applications including video indexing and retrieval, event detection, and video summarization. In this paper, we propose to apply the Fisher kernel paradigm to action recognition. The Fisher kernel framework combines the strengths of generative and discriminative models. In this approach, given the trajectories extracted from a video and a generative Gaussian Mixture Model (GMM), we use the Fisher Kernel method to describe how much the GMM parameters are modified to best fit the video trajectories. We experiment in using the Fisher Kernel vector to create the video representation and to train an SVM classifier. We further extend our framework to select the most discriminative trajectories using a novel MIL-KNN framework. We compare the performance of our approach to the current state-of-the-art bag-of-features (BOF) approach on two benchmark datasets. Experimental results show that our proposed approach outperforms the state-of-the-art method [8] and that the selected discriminative trajectories are descriptive of the action class.
AB - Action recognition is an important computer vision problem that has many applications including video indexing and retrieval, event detection, and video summarization. In this paper, we propose to apply the Fisher kernel paradigm to action recognition. The Fisher kernel framework combines the strengths of generative and discriminative models. In this approach, given the trajectories extracted from a video and a generative Gaussian Mixture Model (GMM), we use the Fisher Kernel method to describe how much the GMM parameters are modified to best fit the video trajectories. We experiment in using the Fisher Kernel vector to create the video representation and to train an SVM classifier. We further extend our framework to select the most discriminative trajectories using a novel MIL-KNN framework. We compare the performance of our approach to the current state-of-the-art bag-of-features (BOF) approach on two benchmark datasets. Experimental results show that our proposed approach outperforms the state-of-the-art method [8] and that the selected discriminative trajectories are descriptive of the action class.
UR - http://www.scopus.com/inward/record.url?scp=84874577192&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84874577192
SN - 9784990644109
T3 - Proceedings - International Conference on Pattern Recognition
SP - 3333
EP - 3336
BT - ICPR 2012 - 21st International Conference on Pattern Recognition
T2 - 21st International Conference on Pattern Recognition, ICPR 2012
Y2 - 11 November 2012 through 15 November 2012
ER -