Passer à la navigation principale Passer à la recherche Passer au contenu principal

NERAF: 3D SCENE INFUSED NEURAL RADIANCE AND ACOUSTIC FIELDS

  • Mines ParisTech

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Sound plays a major role in human perception. Along with vision, it provides essential information for understanding our surroundings. Despite advances in neural implicit representations, learning acoustics that align with visual scenes remains a challenge. We propose NeRAF, a method that jointly learns radiance and acoustic fields. NeRAF synthesizes both novel views and spatialized room impulse responses (RIR) at new positions by conditioning the acoustic field on 3D scene geometric and appearance priors from the radiance field. The generated RIR can be applied to auralize any audio signal. Each modality can be rendered independently and at spatially distinct positions, offering greater versatility. We demonstrate that NeRAF generates high-quality audio on SoundSpaces and RAF datasets, achieving significant performance improvements over prior methods while being more data-efficient. Additionally, NeRAF enhances novel view synthesis of complex scenes trained with sparse data through cross-modal learning. NeRAF is designed as a Nerfstudio module, providing convenient access to realistic audio-visual generation. Project page: https://amandinebtto.github.io/NeRAF.

langue originaleAnglais
titre13th International Conference on Learning Representations, ICLR 2025
EditeurInternational Conference on Learning Representations, ICLR
Pages61672-61695
Nombre de pages24
ISBN (Electronique)9798331320850
étatPublié - 1 janv. 2025
Modification externeOui
Evénement13th International Conference on Learning Representations, ICLR 2025 - Singapore, Singapour
Durée: 24 avr. 202528 avr. 2025

Série de publications

Nom13th International Conference on Learning Representations, ICLR 2025

Une conférence

Une conférence13th International Conference on Learning Representations, ICLR 2025
Pays/TerritoireSingapour
La villeSingapore
période24/04/2528/04/25

Empreinte digitale

Examiner les sujets de recherche de « NERAF: 3D SCENE INFUSED NEURAL RADIANCE AND ACOUSTIC FIELDS ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation