Passer à la navigation principale Passer à la recherche Passer au contenu principal

CATALOG: A Camera Trap Language-Guided Contrastive Learning Model

  • Universidad de Antioquia

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Foundation Models (FMs) have been successful in various computer vision tasks like image classification, object detection and image segmentation. However, these tasks remain challenging when these models are tested on datasets with different distributions from the training dataset, a problem known as domain shift. This is especially problematic for recognizing animal species in camera-trap images where we have variability in factors like lighting, camouflage and occlusions. In this paper, we propose the Camera Trap Language-guided Contrastive Learning (CATALOG) model to address these issues. Our approach combines multiple FMs to extract visual and textual features from camera-trap data and uses a contrastive loss function to train the model. We evaluate CATALOG on two benchmark datasets and show that it outperforms previous state-of-the-art methods in camera-trap image recognition, especially when the training and testing data have different animal species or come from different geographical areas. Our approach demonstrates the potential of using FMs in combination with multi-modal fusion and contrastive learning for addressing domain shifts in camera-trap image recognition. The code of CATALOG is publicly available at https://github.com/Julian075/CATALOG.

langue originaleAnglais
titreProceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025
EditeurInstitute of Electrical and Electronics Engineers Inc.
Pages1197-1206
Nombre de pages10
ISBN (Electronique)9798331510831
Les DOIs
étatPublié - 1 janv. 2025
Evénement2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025 - Tucson, États-Unis
Durée: 28 févr. 20254 mars 2025

Série de publications

NomProceedings - 2025 IEEE Winter Conference on Applications of Computer Vision, WACV 2025

Une conférence

Une conférence2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025
Pays/TerritoireÉtats-Unis
La villeTucson
période28/02/254/03/25

SDG des Nations Unies

Ce résultat contribue à ou aux Objectifs de développement durable suivants

  1. SDG 3 - Bonne santé et bien-être
    SDG 3 Bonne santé et bien-être

Empreinte digitale

Examiner les sujets de recherche de « CATALOG: A Camera Trap Language-Guided Contrastive Learning Model ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation