Products Downloads Order Support

LEADTOOLS OCR SDK - Programming Tools Programming tools for adding OCR technology into software applications quickly and easily.

LEADTOOLS OCR Module Optical Character Recognition for Documents

Do you need to convert reams of paper into digital documents? Do you need to automatically extract text and other important information from your scanned documents?

The award winning LEADTOOLS OCR Module gives programmers everything needed to develop robust, high performance and scalable document imaging solutions supporting optical character recognition. LEAD has leveraged its 17+ years of experience in developing and supporting imaging SDKs to extend the Best of Breed OmniPage OCR engine published by Nuance Communications, Inc. The result of this powerful combination gives you everything you expect from a LEADTOOLS product - an elegant and easy to use set of APIs, excellent documentation, quality code samples in multiple programming languages, and the best technical support in the business, - together with the unmatched speed, accuracy, language and output format support offered by the OmniPage recognition engine. Reduce your time to market without compromising quality and without dropping features using this powerful product!

LEAD's OCR tools include APIs, COM and .NET support. The PDF OCR Plug-in extends the LEADTOOLS OCR Module to add PDF output support.

LEADTOOLS OCR Features:

  • Extension. The OCR features extend the functionality of the LEADTOOLS Document Imaging SDK by providing OCR specific properties, methods, and events for easily incorporating optical character recognition engine into your applications.
  • Zones. You can add many zoned areas to the same page and each zone having its own options such as the recognition module, filter and much more.
  • Options. OCR development is made easy by pre-setting OCR options that work for most images by default.
  • Dictionaries. To improve recognition results, LEADTOOLS OCR supports custom dictionaries to recognize words that may only exist in the documents being recognized.
  • Languages. Support for over 20 languages including all major European and Scandinavian languages (Danish, Dutch, Finnish, French, German, Italian, Norwegian, Portuguese, Russian, Spanish, and Swedish) as well as English. For a complete list of supported languages, click here. Additionally, documents that continue multiple languages can also be recognized.
  • Output. Output document options such as document margins, paragraph options and more.
  • Fonts. LEADTOOLS supports a variety of different fonts, sizes (5 to 72 point) and styles.
  • Simplify. Recognize multiple documents with multiple pages and combine and save recognition results to a single file.
  • Text/graphics. Process both text and graphics. LEADTOOLS can distinguish graphics from text which provides the basis of creating a compound document processing system.
  • Export. Recognized text can be exported to a variety of text, word processing, database, or spreadsheet file formats including MS Word, XML, PDF, WordPerfect and more.
  • Speed. LEADTOOLS provides superior OCR processing speed for use in form recognition and processing applications.
  • Accuracy.
  • Integration. Integrate OCR capabilities with other LEADTOOLS SDKs to complete your imaging development by gaining access to image processing, document clean, image file format support, image display, and image capture functionality.

Output formats supported

  • Microsoft Word DOC
  • Adobe PDF
  • Adobe PDF with image substitutes
  • Adobe PDF with image on text
  • Adobe PDF image only
  • Adobe PDF edited
  • RTF
  • HTML
  • TXT
  • XML
  • Microsoft Excel XLS
  • Open eBook 1.0
  • 2G Type 2 and 3
  • and more

OCR SDK benefits:

  • You can create your own form for page using manual zones.
  • You can use LEADTOOLS Image Processing functions like cleanup functions to improve this page in order to generate the best recognition results.
  • You can add registration marks using LEADTOOLS Image Processing functions to form zones, create zones, and recognize all form pages using LEADTOOLS OCR toolkit.
  • When a page to be processed exceeds a A3 page size you can divide the page into small pages and perform the OCR processing.
  • OCR engine works fine with low page resolution such as produced when a document is printed using dot matrix printers.
  • Recognize a variety of documents, including facsimiles, photocopies and documents with complex layouts

Three specialized OCR recognition engines are supported:

MOR OCR Engine - This module recognizes machine printed text.

This module recognizes machine printed text (i.e. from printed publications, laser or ink-jet printers and electric typewriters). Output from mechanical typewriters in good condition may also be acceptable. It should also be used for letter or near letter quality (LQ, NLQ) output from dot-matrix printers.

This module can safely handle A3 size (11.69" x 16.54") both portrait and landscape images with 300 dpi resolution.

  • Supports up to 500 zones on one image
  • Supports Omnifont, Draftdot24 and OCR-A filling methods
  • Provides 3 page-level accuracy and speed trade off settings including Accurate, Balanced and Fast
  • Provides Checking Subsystem based correction
MTX (Mtext) OCR Engine – This module recognizes machine printed text.

This recognition module recognizes machine printed text (i.e. from printed publications, laser or ink-jet printers and electric typewriters). Output from mechanical typewriters in good condition may also be acceptable. It should also be used for Letter or Near Letter Quality output from dot-matrix printers, and can also be used for Draft Quality. Only images with the following resolution ranges are supported: 90-110, 160-240, 280-320, 400, and 600. This module does not process images larger than 6600 pixels in either width or height, that is it can safely handle A3 size (11.69" x 16.54") both portrait and landscape images with 300 dpi resolution.

  • The fastest of the selectable OCR engines
  • Supports up to 64 zones on one image
  • Supports Omnifont, Draftdot9 and Draftdot24 filling methods
  • Provides 2 page-level accuracy and speed trade off settings including a combined Accurate & balanced value quickly
  • Provides Checking Subsystem based correction
FireWorX OCR Engine – This module recognizes machine printed text.

This module recognizes machine printed text (i.e. from printed publications, laser or ink-jet printers and electric typewriters). Output from mechanical typewriters in good condition may also be acceptable. It should also be used for letter or near letter quality (NLQ, LQ) output from dot-matrix printers.

  • Optimized for speed
  • Supports up to 2,500 zones on one image
  • Supports Omnifont filling methods
Specialized Recognition Modules may be added on to LEADTOOLS OCR Module:
OMR (Optical Mark Recognition) module:

This recognition module is used for recognizing optical marks (checkmarks).

Typical application areas are:

  • Questionnaires
  • Ballot papers
  • Educational tests
  • Reporting or ordering sheets
  • Documents to be processed are form-like and filled by respondents, usually by hand.
ICR Recognition Modules:
  • This recognition module can be used for recognition of hand-printed numerals (0-9) and four additional signs (+ - . ,).
  • Also, this recognition module can recognize hand printed text.
MAT Matrix Matching Recognition Module:

This module is designed to read certain groups of fixed-font characters specially designed for OCR or imaging applications, in which no two characters have similar shapes. Each character group has its own filling method.

Application areas are in:

  • Banking
  • Check or waybill handling
  • Product distribution and document validation, where high accuracy can be vital.
  • Non-fixed print styles

Not all products may include all of the above functionality. Special notations have been added to help you determine what product you need. If you have any questions, contact sales@LEADTOOLS.com.

The OCR SDK and related products are available below

Pricing Structure
Product Price Purchase Try
LEADTOOLS OCR Module $1995 Add to shopping cart. Free Trial
The OCR plug-in is included in the following toolkit.
LEADTOOLS Document Imaging Suite $3995 Add to shopping cart. Free Trial
The OCR plug-in can be added to the following toolkits.
LEADTOOLS Medical Imaging SDK $4495 Add to shopping cart. Free Trial
LEADTOOLS Medical Imaging Suite $7995 Add to shopping cart. Free Trial
The OCR PDF plug-in can be added on to OCR.
*LEADTOOLS PDF OCR Plug-in $1495 Add to shopping cart. Free Trial

†Marked toolkits require runtime licensing based on the deployment of the application you develop. Several purchase options are available. For more information, please contact oemsales@leadtools.com or call a LEAD sales representative.

* LEADTOOLS PDF OCR Plug-in is required to output PDF.

LEADTOOLS Sales: 704-332-5532 | sales@leadtools.com
LEADTOOLS Support: 704-372-9681 | support@leadtools.com

Products | Downloads | Order | Support | Corporate | News

 

Live Chat

Have questions about the Document Toolkit?

Live sales and technical support available.

This module is included in the following toolkit

This module can be added on to the following toolkits

Free Trial / Purchase:

 
Are you a CEO, Manager or other decision maker who would prefer to view less programming-specific technical pages?
Imaging-Components.com is an informational website created to promote the use of LEADTOOLS "third-party" imaging software components.