Robust Speech Recognition of Uncertain or Missing Data: Theory and Applications

Front Cover
Dorothea Kolossa, Reinhold Haeb-Umbach
Springer Science & Business Media, Jul 14, 2011 - Technology & Engineering - 380 pages
0 Reviews

Automatic speech recognition suffers from a lack of robustness with respect to noise, reverberation and interfering speech. The growing field of speech recognition in the presence of missing or uncertain input data seeks to ameliorate those problems by using not only a preprocessed speech signal but also an estimate of its reliability to selectively focus on those segments and features that are most reliable for recognition. This book presents the state of the art in recognition in the presence of uncertainty, offering examples that utilize uncertainty information for noise robustness, reverberation robustness, simultaneous recognition of multiple speech signals, and audiovisual speech recognition.

The book is appropriate for scientists and researchers in the field of speech recognition who will find an overview of the state of the art in robust speech recognition, professionals working in speech recognition who will find strategies for improving recognition results in various conditions of mismatch, and lecturers of advanced courses on speech processing or speech recognition who will find a reference and a comprehensive introduction to the field. The book assumes an understanding of the fundamentals of speech recognition using Hidden Markov Models.

 

What people are saying - Write a review

We haven't found any reviews in the usual places.

Contents

Introduction
1
Part I Theoretical Foundations
6
Noise Robustness
65
Reverberation Robustness
222
Multiple Speakers and Modalities
291
Index
376
Copyright

Other editions - View all

Common terms and phrases

About the author (2011)

Prof. Dr.-Ing. Dorothea Kolossa is a professor at the Institut für Kommunikationsakustik of the Ruhr-Universität Bochum, Germany; her research interests are automatic speech recognition, digital speech signal processing, and blind source separation.

Prof. Dr.-Ing. Reinhold Haeb-Umbach heads the Dept. of Communications Engineering of the University of Paderborn, Germany; his research interest are speech signal processing and automatic speech recognition, statistical learning and pattern recognition, and signal processing for digital communications.

Bibliographic information