Passer à la navigation principale Passer à la recherche Passer au contenu principal

Bridging Text and Image for Artist Style Transfer via Contrastive Learning

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Image style transfer has attracted widespread attention in the past few years. Despite its remarkable results, it requires additional style images available as references, making it less flexible and inconvenient. Using text is the most natural way to describe the style. More importantly, text can describe implicit abstract styles, like styles of specific artists or art movements. In this paper, we propose a Contrastive Learning for Artistic Style Transfer (CLAST) that leverages advanced image-text encoders to control arbitrary style transfer. We introduce a supervised contrastive training strategy to effectively extract style descriptions from the image-text model (i.e., CLIP), which aligns stylization with the text description. To this end, we also propose a novel and efficient adaLN based state space models that explore style-content fusion. Finally, we achieve a text-driven image style transfer. Extensive experiments demonstrate that our approach outperforms the state-of-the-art methods in artistic style transfer. More importantly, it does not require online fine-tuning and can render a 512×512 image in 0.03 s.

langue originaleAnglais
titreComputer Vision – ECCV 2024 Workshops, Proceedings
rédacteurs en chefAlessio Del Bue, Cristian Canton, Jordi Pont-Tuset, Tatiana Tommasi
EditeurSpringer Science and Business Media Deutschland GmbH
Pages1-18
Nombre de pages18
ISBN (imprimé)9783031928079
Les DOIs
étatPublié - 1 janv. 2025
EvénementWorkshops that were held in conjunction with the 18th European Conference on Computer Vision, ECCV 2024 - Milan, Italie
Durée: 29 sept. 20244 oct. 2024

Série de publications

NomLecture Notes in Computer Science
Volume15627 LNCS
ISSN (imprimé)0302-9743
ISSN (Electronique)1611-3349

Une conférence

Une conférenceWorkshops that were held in conjunction with the 18th European Conference on Computer Vision, ECCV 2024
Pays/TerritoireItalie
La villeMilan
période29/09/244/10/24

Empreinte digitale

Examiner les sujets de recherche de « Bridging Text and Image for Artist Style Transfer via Contrastive Learning ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation