Abstract
Whereas speaker normalization and adaptation has received a lot of attention for speech recognition, few studies have been devoted to voice transformation for speech synthesis despite the potential intcrestof such techniques. Converting voice individuality needs spectrum, glottal excitation and prosody modifications. This work focuses on spectral modifications but some easy prosodic alterations are taken into account. We combine two techniques to simulate speaker changement. The first one is the TD-PSOLA technique which is very efficient to alter prosody. The second is a classical source-filler decomposition. It extracts from the signal a spectral representation on which spectral modifications arc performed. Two approaches are suggested to transform the spectrum : the first is the well-known Linear Multivariate Regression ; the second is the Dynamic Frequency Warping.
| Original language | English |
|---|---|
| Pages | 345-348 |
| Number of pages | 4 |
| Publication status | Published - 1 Jan 1991 |
| Event | 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991 - Genova, Italy Duration: 24 Sept 1991 → 26 Sept 1991 |
Conference
| Conference | 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991 |
|---|---|
| Country/Territory | Italy |
| City | Genova |
| Period | 24/09/91 → 26/09/91 |
Fingerprint
Dive into the research topics of 'VOICE TRANFORMATION USING PSOLA TECHNIQUE'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver