Passer à la navigation principale Passer à la recherche Passer au contenu principal

Non-linear spectral subtraction (NSS) and hidden Markov models for robust speech recognition in car noise environments

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Achieving reliable performance for a speech recogniser is an important challenge, especially in the context of mobile telephony applications where the user can access telephone functions through voice. This paper adresses the problem of speaker-dependent discrete utterance recognition in noise. Special reference is made to the mismatch effects due to the fact that training and testing are made in different environments. This contribution extends recently published work[11] where a robust HMM training/recognition framework is proposed. The present contribution introduces several new aspects: use of enhanced NSS schemes, introduction of root-MFCC parameters, use of dynamic features, training of HMMs by a dynamic inference scheme (DIHMM). These enhancements are discussed from tests performed on band limited signals (200-3000 Hz). We show that these various optimisations allow a rise from 20 % to over 99 % in performance. A 93% recognition rate is already achievable on raw data using a weighted modified projection and a root-MFCC dynamic representation.

langue originaleAnglais
titreICASSP 1992 - 1992 International Conference on Acoustics, Speech, and Signal Processing
EditeurInstitute of Electrical and Electronics Engineers Inc.
Pages265-268
Nombre de pages4
ISBN (Electronique)0780305329
Les DOIs
étatPublié - 1 janv. 1992
Modification externeOui
Evénement1992 International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992 - San Francisco, États-Unis
Durée: 23 mars 199226 mars 1992

Série de publications

NomICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1
ISSN (imprimé)1520-6149

Une conférence

Une conférence1992 International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992
Pays/TerritoireÉtats-Unis
La villeSan Francisco
période23/03/9226/03/92

Empreinte digitale

Examiner les sujets de recherche de « Non-linear spectral subtraction (NSS) and hidden Markov models for robust speech recognition in car noise environments ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation