Evaluation Of Classifiers Based On I2C Distance For Action Recognition

Naive Bayes Nearest Neighbor (NBNN) and its variants, i,e., local NBNN and the NBNN kernels, are local feature-based classifiers that have achieved impressive performance in image classification. By exploiting instance-to-class (I2C) distances (instance means image/video in image/video classificatio...

Full description

Bibliographic Details
Main Authors: Zhang, Lei, Wang, Tao, Xiantong Zhen
Format: Text
Language:English
Published: Zenodo 2012
Subjects:
Online Access:https://dx.doi.org/10.5281/zenodo.1079268
https://zenodo.org/record/1079268
Description
Summary:Naive Bayes Nearest Neighbor (NBNN) and its variants, i,e., local NBNN and the NBNN kernels, are local feature-based classifiers that have achieved impressive performance in image classification. By exploiting instance-to-class (I2C) distances (instance means image/video in image/video classification), they avoid quantization errors of local image descriptors in the bag of words (BoW) model. However, the performances of NBNN, local NBNN and the NBNN kernels have not been validated on video analysis. In this paper, we introduce these three classifiers into human action recognition and conduct comprehensive experiments on the benchmark KTH and the realistic HMDB datasets. The results shows that those I2C based classifiers consistently outperform the SVM classifier with the BoW model. : {"references": ["Oren Boiman, Eli Shechtman, Michal Irani. \"In Defense of Nearest-Neighbor Based Image Classification\". In CVPR 2008.", "T. Tuytelaars, M. Fritz, K. Saenko, T. Darrell. \"The NBNN kernel\". In ICCV 2011.", "S. Gao, I. Tsang, L. Chia, and P. Zhao. \"Local features are not\nlonely-Laplacian sparse coding for image classification\". In CVPR, 2010.", "J. Wang, J. Yang, K. Yu, F. Lv, T. Huang, and Y. Gong. \"Locality\n-constrained linear coding for image classification\". In CVPR, 2010.", "Lingqiao Liu, Lei Wang, Xinwang Liu. \"In Defense of Soft-assignment\nCoding\". In ICCV 2011.", "Sancho McCann, David G. Lowe. \"Local Naive Bayes nearest Neighbor\nfor Image Classification\". Technical Report TR-2011-11, University of\nBritish Columbia.", "C. Schuldt, I. Laptev, and B. Caputo, \"Recognizing human actions: a local\nSVM approach,\" in ICPR, 2004.", "H. Kuehne, H. Jhuang, E. Garrote, T. Poggio, T. Serre. \"HMDB: A Large\nVideo Database for Human Motion Recognition\". In ICCV 2011.", "David G. Lowe. \"Distinctive Image Features from Scale-Invariant Keypoints\". International Journal of Computer Vision 60(2), 91-110, 2004.\n[10] I. Laptev, \"On space-time interest points,\" IJCV, vol. 64, no. 2, pp.\n107-123, 2005.\n[11] Zhengxiang Wang, Yiqun Hu, Liang-Tien Chia. \"Image-to-Class\nDistance Metric Learning for Image Classification\", In ECCV 2010.\n[12] Heng Wang, Muhammad Muneeb Ullah, Alexander Kl\u251c\u00f1ser, Ivan Laptev,\nCordelia Schmid. \"Evaluation of local spatio-temporal features for action\nrecognition\". In BMVC 2009.\n[13] Sadanand S., Corso J. \"Action bank : a high level representation of\nactivity in video\". In CVPR 2012."]}