Non-linear spectral subtraction (NSS) and hidden Markov models for robust speech recognition in car noise environments

P. Lockwood, J. Boudy, M. Blanchet

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Achieving reliable performance for a speech recogniser is an important challenge, especially in the context of mobile telephony applications where the user can access telephone functions through voice. This paper adresses the problem of speaker-dependent discrete utterance recognition in noise. Special reference is made to the mismatch effects due to the fact that training and testing are made in different environments. This contribution extends recently published work[11] where a robust HMM training/recognition framework is proposed. The present contribution introduces several new aspects: use of enhanced NSS schemes, introduction of root-MFCC parameters, use of dynamic features, training of HMMs by a dynamic inference scheme (DIHMM). These enhancements are discussed from tests performed on band limited signals (200-3000 Hz). We show that these various optimisations allow a rise from 20 % to over 99 % in performance. A 93% recognition rate is already achievable on raw data using a weighted modified projection and a root-MFCC dynamic representation.

Original languageEnglish
Title of host publicationICASSP 1992 - 1992 International Conference on Acoustics, Speech, and Signal Processing
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages265-268
Number of pages4
ISBN (Electronic)0780305329
DOIs
Publication statusPublished - 1 Jan 1992
Externally publishedYes
Event1992 International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992 - San Francisco, United States
Duration: 23 Mar 199226 Mar 1992

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume1
ISSN (Print)1520-6149

Conference

Conference1992 International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992
Country/TerritoryUnited States
CitySan Francisco
Period23/03/9226/03/92

Fingerprint

Dive into the research topics of 'Non-linear spectral subtraction (NSS) and hidden Markov models for robust speech recognition in car noise environments'. Together they form a unique fingerprint.

Cite this