TY - GEN
T1 - A study on sampling strategies in space-time domain for recognition applications
AU - Dikmen, Mert
AU - Lin, Dennis J.
AU - Del Pozo, Andrey
AU - Cao, Liang Liang
AU - Fu, Yun
AU - Huang, Thomas S.
PY - 2009
Y1 - 2009
N2 - We investigate the relative strengths of existing space-time interest points in the context of action detection and recognition. The interest point operators evaluated are an extension of the Harris corner detector (Laptev et al. [1]), a space-time Gabor filter (Dollar et al. [2]), and randomized sampling on the motion boundaries. In the first level of experiments we study the low level attributes of interest points such as stability, repeatability and sparsity with respect to the sources of variations such as actors, viewpoint and action category. In the second level we measure the discriminative power of interest points by extracting generic region descriptors around the interest points (1. histogram of optical flow[3], 2. motion history images[4], 3. histograms of oriented gradients[3]). Then we build a simple action recognition scheme by constructing a dictionary of codewords and learning a recognition system using the histograms of these codewords. We demonstrate that although there may be merits due to the structural information contained in the interest point detections, ultimately getting as many data samples as possible, even with random sampling, is the decisive factor in the interpretation of space-time data.
AB - We investigate the relative strengths of existing space-time interest points in the context of action detection and recognition. The interest point operators evaluated are an extension of the Harris corner detector (Laptev et al. [1]), a space-time Gabor filter (Dollar et al. [2]), and randomized sampling on the motion boundaries. In the first level of experiments we study the low level attributes of interest points such as stability, repeatability and sparsity with respect to the sources of variations such as actors, viewpoint and action category. In the second level we measure the discriminative power of interest points by extracting generic region descriptors around the interest points (1. histogram of optical flow[3], 2. motion history images[4], 3. histograms of oriented gradients[3]). Then we build a simple action recognition scheme by constructing a dictionary of codewords and learning a recognition system using the histograms of these codewords. We demonstrate that although there may be merits due to the structural information contained in the interest point detections, ultimately getting as many data samples as possible, even with random sampling, is the decisive factor in the interpretation of space-time data.
UR - http://www.scopus.com/inward/record.url?scp=77249124366&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77249124366&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-11301-7_47
DO - 10.1007/978-3-642-11301-7_47
M3 - Conference contribution
AN - SCOPUS:77249124366
SN - 3642113001
SN - 9783642113000
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 465
EP - 476
BT - Advances in Multimedia Modeling - 16th International Multimedia Modeling Conference, MMM 2010, Proceedings
T2 - 16th International Multimedia Modeling Conference on Advances in Multimedia Modeling, MMM 2010
Y2 - 6 October 2010 through 8 October 2010
ER -