Series: The Springer International Series in Engineering and Computer Science (Book 502)
Hardcover: 196 pages
Publisher: Springer; 1999 edition (May 31, 1999)
Language: English
ISBN-10: 079238492X
ISBN-13: 978-0792384922
Product Dimensions: 7 x 0.5 x 10 inches
Shipping Weight: 1.3 pounds
Average Customer Review: 4.0 out of 5 stars See all reviews (1 customer review)
Best Sellers Rank: #7,371,157 in Books (See Top 100 in Books) #15 in Books > Computers & Technology > Software > Optical Character Recognition #1295 in Books > Computers & Technology > Graphics & Design > Computer Modelling > Imaging Systems #1798 in Books > Computers & Technology > Computer Science > AI & Machine Learning > Computer Vision & Pattern Recognition
This book is not about OCR per se but about the different kinds of errors OCR systems make and how to correct them using context. It is therefore a specialized book. This book was published in 1999 and is therefore a little bit outdated. However, as someone who works in the area I think that the book is still relevant on the topics it covers. Before I continue I should say that the price currently listed is outrageous and I bought my book for much less.Optical Character Recognition, or OCR for short, is the technology, or art, of turning images with text into machine encoded text. This is not as easy as it sounds. For one thing the OCR software needs to analyze the stream of pixels, identify the relevant text blocks, figure out where the text lines are, find the words, then the individual characters, and then determine from the often noisy image block corresponding to the character what character the fuzzy blob in the picture really corresponds to. There are hundreds of fonts, many font sizes, a near infinite repertoire of different kinds of noise, distortions, smear, etc.In general different OCR systems tend to make similar errors. As an example, the string “rn” is often turned into the character “m” for many fonts almost regardless of what OCR system you use. This book uses three different OCR systems and analyzes the output from these systems in a number of situations in which OCR systems typically fail and the book also suggests remedies when applicable. The book contains an introduction (chapter one), a conclusion (chapter six), and 17 categories of errors spread over the four chapters in between. The book only considers OCR for printed text (not handwriting).
Optical Character Recognition: An Illustrated Guide to the Frontier (The Springer International Series in Engineering and Computer Science) Face Image Analysis by Unsupervised Learning (The Kluwer International Series in Engineering and Computer Science, Volume 612) (The Springer International Series in Engineering and Computer Science) Web Caching and Its Applications (The Springer International Series in Engineering and Computer Science) Applications of Digital Signal Processing to Audio and Acoustics (The Springer International Series in Engineering and Computer Science) Radiowave Propagation and Smart Antennas for Wireless Communications (The Springer International Series in Engineering and Computer Science) Face Image Analysis by Unsupervised Learning (The Springer International Series in Engineering and Computer Science) Analog Design Essentials (The Springer International Series in Engineering and Computer Science) Low Power Design Methodologies (The Springer International Series in Engineering and Computer Science) Analog Design for CMOS VLSI Systems (The Springer International Series in Engineering and Computer Science) Handbook of Optical Fibers and Cables, Second Edition (Optical Science and Engineering) Computer Speech: Recognition, Compression, Synthesis (Springer Series in Information Sciences) HACKING: Beginner's Crash Course - Essential Guide to Practical: Computer Hacking, Hacking for Beginners, & Penetration Testing (Computer Systems, Computer Programming, Computer Science Book 1) Jis-Kanji Character Recognition: Featuring the Gaiji Method/Book and Disk (Vnr Computer Library) Dan Frontier and the New House (Dan Frontier Series) Early Knives & Beaded Sheaths of the American Frontier (Of the American Frontier Series) Selected Papers on Optical Pattern Recognition (SPIE Milestone Series Vol. MS156) Photonics Rules of Thumb: Optics, Electro-Optics, Fiber Optics, and Lasers (Optical and Electro-Optical Engineerirng Series) Photonics: Optical Electronics in Modern Communications (The Oxford Series in Electrical and Computer Engineering) Introductory Logic and Sets for Computer Scientists (International Computer Science Series) Character Recognition Technologies: 1-2 February 1993 San Jose, California (Proceedings of Spie)