Hardcover: 428 pages
Publisher: Wiley (May 4, 1999)
Product Dimensions: 6.8 x 1.2 x 10 inches
Shipping Weight: 2.2 pounds (View shipping rates and policies)
Average Customer Review: 3.6 out of 5 stars See all reviews (7 customer reviews)
Best Sellers Rank: #2,311,854 in Books (See Top 100 in Books) #56 in Books > Computers & Technology > Software > Voice Recognition #786 in Books > Textbooks > Computer Science > Artificial Intelligence #1031 in Books > Computers & Technology > Programming > Languages & Tools > C & C++ > C++
This book is very useful for persons willing to implement a speech recognition system based on hidden Markov models (HMM). The authors provide the source code of a complete system. Each chapter is divided in two parts: theory and implementation. Some implementation issues are of interest only for those who develop code in C. I think this implementation-oriented book is a good complement for a theory-oriented book, as "Fundamentals of Speech Recog." by Rabiner and Juang. I really couldn't understand an appendix about "Econometric", that mainly discusses HREH (?) and just mentions HMM in the very end. The references related to this appendix were mixed with the ones related to speech, with an annoying result.
I appreciated the balance between theory and implementation in the book. Also the content covers the most important topics. It is unfortunate that the book contains numerous typo's and confusing choices of symbols. The errors are often right in the most critical places too. The explanation of the theory of HMM's for example. 0 and 1 are chosen to represent both white and black balls and 2 different urns all in the same diagram. When trying to sort out which is which the reader will be further confused by blatant errors where a 0 should be a 1. I am afraid many new readers will find frustration on the theory sections.The choice of C++ and inclusion of a CD-ROM with full source is a nice touch however. Just be aware that the code is not geared for real-time recognition.
This book is composed of two parts, theory and implementation. if you only read its theoretical part, it is ok with many details missing. it is not clearly written. however, if you study its C++ code, you would get all you want on recognition system. I spent 3-8 hours everyday for 4 months going through its code line by line. The C++ code (30,000 lines in total) is very well written but without comments. Many times, I need to figure out things not written in the book. I once spent 1 week on 200 lines of code. However, After 4 months, I truely understand the system.You will find this book useful only if you really spend time covering its C++ code line by line. If you want theory only, goto read other books.I rate 2 star for its theory and 5 star for its implementation.[website]
For studying the speech recognition subject this is not the right book to buy, It is hard to understand the theory using this book.The c++ code works but there should be more remarks to make it easier to the readers to get along.There is a free toolkit to download from the Internet named HTKthat contains full C code and a free book to the same theory so actually it is a waste of money to buy this book !!!
Speech Therapy for Kids : Techniques and Parents Guide for Speech Therapy (speech therapy, speech therapy materials) Speech Recognition: Theory and C++ Implementation Statistical Methods for Speech Recognition (Language, Speech, and Communication) Automatic Speech Recognition: A Deep Learning Approach (Signals and Communication Technology) Introduction to EEG- and Speech-Based Emotion Recognition The Art and Business of Speech Recognition: Creating the Noble Voice The Dragon: NaturallySpeaking Guide Speech Recognition Made Fast and Simple Dictation: Dictate Your Writing - Write Over 1,000,000 Words A Year Without Breaking A Sweat! (Writing Habits, Write Faster, Productivity, Speech Recognition Software, Dragon Naturally Speaking) How to Build a Speech Recognition Application: Second Edition: A Style Guide for Telephony Dialogues Computer Speech: Recognition, Compression, Synthesis (Springer Series in Information Sciences) Speech and Audio Signal Processing: Processing and Perception of Speech and Music Speech After Stroke: A Manual for the Speech Pathologist and the Family Member Father of the Bride Speech (The 7-STEP GUIDE to Writing a Sensational Wedding Speech & Toast Book 1) Markov Models for Pattern Recognition: From Theory to Applications Robust Localization and Mapping for Mobile Robotic Navigation: Theory, Algorithm and Implementation Earthquake Engineering: Theory and Implementation with the 2015 International Building Code, Third Edition Speech and Phenomena: And Other Essays on Husserl's Theory of Signs (Studies in Phenomenology and Existential Philosophy) Alphabet Stories: Puppets and Picture Stories That Teach Letter Recognition and Sounds (Makemaster Blackline Masters) The Cunning of Recognition: Indigenous Alterities and the Making of Australian Multiculturalism (Politics, History, and Culture) Virus Infections of Rodents and Lagomorphs: Virus Infections of Vertebrates Series, 1e (Machine Intelligence and Pattern Recognition)