Passer à la navigation principale Passer à la recherche Passer au contenu principal

Voice transformation using PSOLA technique

Résultats de recherche: Contribution à un journalArticleRevue par des pairs

Résumé

In this contribution, a new system for voice conversion is described. The proposed architecture combines a PSOLA (Pitch Synchronous Overlap and Add)-derived synthesizer and a module for spectral transformation. The synthesizer based on the classical source-filter decomposition allows prosodic and spectral transformations to be performed independently. Prosodic modifications are applied on the excitation signal using the TD-PSOLA scheme; converted speech is then synthesized using the transformed spectral parameters. Two different approaches to derive spectral transformations, borrowed from the speech-recognition domain, are compared: Linear Multivariate Regression (LMR) and Dynamic Frequency Warping (DFW). Vector-quantization is carried out as a preliminary stage to render the spectral transformations dependent of the acoustical realization of sounds. A formal listening test shows that the synthesizer produces a satisfyingly natural "transformed" voice. LMR proves yet to allow a slightly better conversion than DFW. Still there is room for improvement in the spectral transformation stage.

langue originaleAnglais
Pages (de - à)175-187
Nombre de pages13
journalSpeech Communication
Volume11
Numéro de publication2-3
Les DOIs
étatPublié - 1 janv. 1992

Empreinte digitale

Examiner les sujets de recherche de « Voice transformation using PSOLA technique ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation