Session
The Tesseract OCR Engine
Ray Smith, Staff Software Engineer, Google Inc
Track: Desktop Applications
Date: Thursday, July 26
Time: 11:35am
- 12:20pm
Location: D133
The Tesseract Optical Character Recognition (OCR) engine is described. The talk will cover its history, where it came from, how it works, and how it can be trained for additional languages. The new features of version 2.0 will be described, and some ideas for future development will be proposed.





















