This book provides a comprehensive overview of the recent advancement in the field of automatic speech recognition with a focus on deep learning models including deep neural networks and many of their variants. This is the first automatic speech recognition book dedicated to the deep learning approach. In addition to the rigorous mathematical treatment of the subject, the book also presents insights and theoretical foundation of a series of highly successful deep learning models.
This book reviews past and present work on discriminative and hierarchical models for both acoustic and language modeling. It also analyzes the research direction and trends towards establishing future-generation speech recognition.