Robust Speech Recognition of Noisy or Reverberated Data Using Multiple Recognizers in Different Energy Bands
2015. 100 S. 220 mm
Verlag/Jahr: AV AKADEMIKERVERLAG 2015
ISBN: 3-639-86663-0 (3639866630)
Neue ISBN: 978-3-639-86663-6 (9783639866636)
This book focuses on automatic speech recognition in clean and noisy or reverbrant environments. Therefore, a parallel speech recognition system using TempoRAL Patterns (TRAPs) is described. The TRAPs are computed over a rather long temporal context for each critical band in the signal´s spectrum. Then, the features of the different bands are combined. Thus recognition only in certain bands is possible. This is beneficial if noise only occurs in parts of the spectrum. In this manner multiple speech recognizers are trained which analyze disjoint parts of the frequency domain. Each of the speech recognizers extracts a different word chain from the audio signal. In the end the word chains are merged to form a single recognition result. As shown on different data sets the parallel speech recognition system is much more robust to noise and reverberation than the state-of-the-art baseline system.Andreas Maier was born on 26th of November 1980 in Erlangen. He studied Computer Science and graduated in 2005. Since October 2005 he is working at the Chair of Pattern Recognition at the Computer Science Department of the University Erlangen-Nuremberg. His major research subject is recognition of pathologic speech.