Passer à la navigation principale Passer à la recherche Passer au contenu principal

Structure-Informed Positional Encoding for Music Generation

  • Institut Polytechnique de Paris

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

Music generated by deep learning methods often suffers from a lack of coherence and long-term organization. Yet, multi-scale hierarchical structure is a distinctive feature of music signals. To leverage this information, we propose a structure-informed positional encoding framework for music generation with Transformers. We design three variants in terms of absolute, relative and non-stationary positional information. We comprehensively test them on two symbolic music generation tasks: next-timestep prediction and accompaniment generation. As a comparison, we choose multiple baselines from the literature and demonstrate the merits of our methods using several musically-motivated evaluation metrics. In particular, our methods improve the melodic and structural consistency of the generated pieces.

langue originaleAnglais
Pages (de - à)951-955
Nombre de pages5
journalICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Les DOIs
étatPublié - 1 janv. 2024
Evénement2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024 - Seoul, Corée du Sud
Durée: 14 avr. 202419 avr. 2024

Empreinte digitale

Examiner les sujets de recherche de « Structure-Informed Positional Encoding for Music Generation ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation