Passer à la navigation principale Passer à la recherche Passer au contenu principal

Probabilistic model for main melody extraction using constant-Q transform

  • CNRS LTCI

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Dimension reduction techniques such as Nonnegative Tensor Factorization are now classical for both source separation and estimation of multiple fundamental frequencies in audio mixtures. Still, few studies jointly addressed these tasks so far, mainly because separation is often based on the Short Term Fourier Transform (STFT) whereas recent music analysis algorithms are rather based on the Constant-Q Transform (CQT). The CQT is practical for pitch estimation because a pitch shift amounts to a translation of the CQT representation, whereas it produces a scaling of the STFT. Conversely, no simple inversion of the CQT was available until recently, preventing it from being used for source separation. Benefiting from advances both in the inversion of the CQT and in statistical modeling, we show how recent techniques designed for music analysis can also be used for source separation with encouraging results, thus opening the path to many crossovers between separation and analysis.

langue originaleAnglais
titre2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 - Proceedings
Pages5357-5360
Nombre de pages4
Les DOIs
étatPublié - 23 oct. 2012
Modification externeOui
Evénement2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 - Kyoto, Japon
Durée: 25 mars 201230 mars 2012

Série de publications

NomICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (imprimé)1520-6149

Une conférence

Une conférence2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012
Pays/TerritoireJapon
La villeKyoto
période25/03/1230/03/12

Empreinte digitale

Examiner les sujets de recherche de « Probabilistic model for main melody extraction using constant-Q transform ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation