Abstract
This paper describes the approach proposed by the ARTEMIS team at TRECVID 2013, Instance Search (INS) task. The method is based on the Bag-of-Words representation obtained from uniform sampling of the frames of the videos. We propose two types of shot descriptors: one relying on a single representative frame for each video shot and another one collecting visual descriptors from multiple frames of the video clips.
| Original language | English |
|---|---|
| Publication status | Published - 1 Jan 2013 |
| Externally published | Yes |
| Event | 2013 TREC Video Retrieval Evaluation, TRECVID 2013 - Gaithersburg, United States Duration: 20 Nov 2013 → 22 Nov 2013 |
Conference
| Conference | 2013 TREC Video Retrieval Evaluation, TRECVID 2013 |
|---|---|
| Country/Territory | United States |
| City | Gaithersburg |
| Period | 20/11/13 → 22/11/13 |