Abstract
This notebook paper summarizes Team NEC-UIUC's approaches for TRECVid 2009 Evaluation of Surveillance Event Detection. Our submissions include two types of systems. One system employs the brute force search method to test each space-time location in the video by a binary classifier on whether a specific event occurs. The other system takes advantage of human detection and tracking to avoid the costly brute force search and evaluates the candidate space-time cubes by combining 3D convolutional neural networks (CNN) and SVM classifiers based on bag-ofwords local features to detect the presence of events of interests. Via thorough cross-validation on the development set, we select proper combining weights and thresholds to minimize the detection cost rates (DCR). Our systems achieve good performance on event categories which involve actions of a single person, e.g. CellToEar, ObjectPut, and Pointing.
Original language | English (US) |
---|---|
State | Published - 2009 |
Externally published | Yes |
Event | TREC Video Retrieval Evaluation, TRECVID 2009 - Gaithersburg, MD, United States Duration: Nov 16 2009 → Nov 17 2009 |
Conference
Conference | TREC Video Retrieval Evaluation, TRECVID 2009 |
---|---|
Country/Territory | United States |
City | Gaithersburg, MD |
Period | 11/16/09 → 11/17/09 |
ASJC Scopus subject areas
- Computer Graphics and Computer-Aided Design
- Computer Vision and Pattern Recognition
- Human-Computer Interaction
- Software