Abstract
The IRIM group is a consortium of French teams working on Multimedia Indexing and Retrieval. This paper describes its participation to the TRECVID 2012 semantic indexing and instance search tasks. For the semantic indexing task, our approach uses a six-stages processing pipelines for computing scores for the likelihood of a video shot to contain a target concept. These scores are then used for producing a ranked list of images or shots that are the most likely to contain the target concept. The pipeline is composed of the following steps: descriptor extraction, descriptor optimization, classification, fusion of descriptor variants, higher-level fusion, and re-ranking. We evaluated a number of different descriptors and tried different fusion strategies. The best IRIM run has a Mean Inferred Average Precision of 0.2378, which ranked us 4th out of 16 participants.
| Original language | English |
|---|---|
| Publication status | Published - 1 Jan 2012 |
| Externally published | Yes |
| Event | TREC Video Retrieval Evaluation, TRECVID 2012 - Gaithersburg, MD, United States Duration: 26 Nov 2012 → 28 Nov 2012 |
Conference
| Conference | TREC Video Retrieval Evaluation, TRECVID 2012 |
|---|---|
| Country/Territory | United States |
| City | Gaithersburg, MD |
| Period | 26/11/12 → 28/11/12 |