Passer à la navigation principale Passer à la recherche Passer au contenu principal

Towards Few-Annotation Learning for Object Detection: Are Transformer-based Models More Efficient?

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

For specialized and dense downstream tasks such as object detection, labeling data requires expertise and can be very expensive, making few-shot and semi-supervised models much more attractive alternatives. While in the few-shot setup we observe that transformer-based object detectors perform better than convolution-based two-stage models for a similar amount of parameters, they are not as effective when used with recent approaches in the semi-supervised setting. In this paper, we propose a semi-supervised method tailored for the current state-of-the-art object detector Deformable DETR in the few-annotation learning setup using a student-teacher architecture, which avoids relying on a sensitive post-processing of the pseudo-labels generated by the teacher model. We evaluate our method on the semi-supervised object detection benchmarks COCO and Pascal VOC, and it outperforms previous methods, especially when annotations are scarce. We believe that our contributions open new possibilities to adapt similar object detection methods in this setup as well.

langue originaleAnglais
titreProceedings - 2023 IEEE Winter Conference on Applications of Computer Vision, WACV 2023
EditeurInstitute of Electrical and Electronics Engineers Inc.
Pages75-84
Nombre de pages10
ISBN (Electronique)9781665493468
Les DOIs
étatPublié - 1 janv. 2023
Modification externeOui
Evénement23rd IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2023 - Waikoloa, États-Unis
Durée: 3 janv. 20237 janv. 2023

Série de publications

NomProceedings - 2023 IEEE Winter Conference on Applications of Computer Vision, WACV 2023

Une conférence

Une conférence23rd IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2023
Pays/TerritoireÉtats-Unis
La villeWaikoloa
période3/01/237/01/23

Empreinte digitale

Examiner les sujets de recherche de « Towards Few-Annotation Learning for Object Detection: Are Transformer-based Models More Efficient? ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation