Introduction
The LEADTOOLS OCR Module provides programming tools for quickly and easily adding document optical character recognition (OCR) technology into software applications. Using the LEADTOOLS OCR Module, programmers can perform character recognition on document images and output recognized text to over 20 file formats. The PDF OCR Plug-in extends the LEADTOOLS OCR Module to add PDF output support. Supported output formats include:
DOC, RTF, TXT, and XLS
Adobe PDF edited
HTML and XML
Open eBook 1.0
2G Type 2 and 2G Type 3
LEADTOOLS makes OCR development easier with auto-zone detection, manual zone creation, auto-orientation, document image clean up, and the use of preset values for common document images to improve recognition results. The LEADTOOLS OCR Module provides support for many languages, as well as output document options like document margins and paragraph options.
Key Features:
Add page(s) to the internal OCR list of pages.
Select the language to use in recognizing the OCR pages.
Recognize a variety of documents, including facsimiles, photocopies and documents with complex layouts.
Save the document in any of several text output formats.
Correct document characteristics such as noise, darkness, lightness to achieve the best possible character recognition.
Manually or automatically detect and select zones for recognition.
Use dictionaries for improving OCR results.
Display document pages, with or without their zones.
Additional Features:
Recognize text from 5 to 72 points in virtually any typeface.
Automatically detect available zones in the document pages.
Recognize multiple document pages at once and save recognition result to a single file.
Recognize multiple languages within one document.
Recognize and export text, choosing from a variety of text, word processing, database, or spreadsheet file formats.
Multiple specialized OCR recognition engines (modules) are supported: MOR, MTX, and FireWorX. Each document may contain multiple OCR zones, and each zone may use any of the OCR engines.
Supported Environments
The toolkit comes in Win32 and x64 editions that can support development of software applications for any of the following environments:
Windows Vista
Windows XP
Windows 2000
For more information, refer to:
An Overview of Recognition Modules
Programming with LEADTOOLS OCR
See Also: