Action Recognition Using Hierarchical STIP Saliency and Mixed Neighborhood Features

Jiangfeng Yang; Zheng Ma

Action Recognition Using Hierarchical STIP Saliency and Mixed Neighborhood Features

원문정보

Jiangfeng Yang, Zheng Ma

보안공학연구지원센터(IJCA) International Journal of Control and Automation Vol.9 No.3 2016.03 pp.245-260 SCOPUS

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

In video action recognition, the Dollar detector has been widely used to extract Spatio-Temporal Interest Points (STIPs) from action video sequence. It generates two kinds of information: STIP position and the respond value. However, in many cases, the detector respond, which measures the strength of local motion changes, is ignored. By utilizing such information, we propose to build a Hierarchical STIP Saliency (HSS) framework to provide different types of motion information. A novel local feature named Mixed Neighborhood Feature (MNF), which integrates the similarity and position relationship between local features, is put forward, and encoded by localityconstrained linear coding. Then, by partitioning video sequence along temporal direction, a group of sub-STVs are produced, and their corresponding descriptors are obtained with a max-pooling-on-absolute-value technique. In classification stage, Locality-constrained Group Sparse Representation (LGSR) is adopted as classifier to utilize the intrinsic group information of these sub-STV features. The experiments on the KTH and UCF Sports datasets show that in contrast to the classical recognition systems published recently, our recognition system based on the HSS and MNF achieves good performance.

Abstract
1. Introduction
2. Constructing Hierarchical STIP Saliency (HSS)
3. Forming Mixed Neighborhood Features (MNF)
4. Encoding Mnfs By Locality-Constrained Linear Coding (LLC)
  4.1. VQ, SVQ and SC
  4.2. Locality-Constrained Linear Coding (LLC)
5. Classifying Action Videos with LGSR
  5.1. Extracting Sub-STVS with Multi-Temporal-Scale Sampling (MTS)
  5.2. Constructing Sub-STV Descriptors
  5.3. LGSR
  5.4. Classification Methods
6. Experiments
  6.1. Experimental Configuration
  6.2. Human Action Datasets
  6.3. Experimental Results and Discussion
7. Conclusion
Acknowledgements
References

키워드

저자정보

Jiangfeng Yang School of Communication and Information Engineering University of Electronic Science and Technology of China, Xiyuan Ave, No.2006, West Hi-Tech Zone
Zheng Ma School of Communication and Information Engineering University of Electronic Science and Technology of China, Xiyuan Ave, No.2006, West Hi-Tech Zone

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle