Products | Support | Email a link to this topic. | Send comments on this topic. | Back to Introduction | Help Version 19.0.6.21
|
The LEADTOOLS .NET OCR Toolkit supports languages and spell checking through the following, separate parts:
The language environment defines the character set(s) recognized by the OCR engine. For example, if you enable the English and German languages, the German characters (ä, Ä, é, ö, Ö, ü, Ü, ß) will be combined with the English characters to define the set recognized by the engine.
To set the character sets to use in the engine, use the IOcrLanguageManager.EnableLanguages method. To get the character sets supported by the engine, use the IOcrLanguageManager.GetSupportedLanguages and IOcrLanguageManager.IsLanguageSupported methods. You can enable as many character sets as required.
The language environment does not automatically perform spell-checking. To enable it you need to use the spell-checking sub-system.
The functionality of the checking subsystem consists of three separate parts:
LEADTOOLS OCR supports spell checking and correction through the use of external dictionaries. The value of IOcrSpellCheckManager.SpellCheckEngine acts as a global switch to use a particular spell checker or turn spell checking off.
When you set the IOcrSpellCheckManager.SpellCheckEngine property to a value other than None, the OCR engine will automatically try to load the spell checker requested and queries the language dictionaries found on your machine. You can change SpellCheckEngine at any time during the life of the IOcrEngine depending on your application needs. For example, to disable spell checking while performing recognition on certain types of documents and then to re-enable it for other types.
To query the languages that support a dictionary in an engine, use IOcrSpellCheckManager.GetSupportedSpellLanguages. You can use one language dictionary at a time inside the engine.
For more information, refer to IOcrLanguageManager.
And the following Asian character sets (available with the Asian OCR Module):
Chinese Simplified (zh-Hans) | Chinese Traditional (zh-Hant) | Japanese (ja) | Korean (ko) |
Arabic (ar) |
For more information, refer to IOcrSpellCheckManager.
For more information, refer to OcrSpellCheckEngine.
This feature is not yet supported for the Arabic OCR engine.
Introduction
Getting Started (Guide to Example Programs)
LEADTOOLS OCR .NET Assemblies
Programming with LEADTOOLS .NET OCR
An Overview of OCR Recognition Modules
Creating an OCR Engine Instance
Starting and Shutting Down the OCR Engine
Multi-Threading with LEADTOOLS OCR
OCR Spell Language Dictionaries
Working with OCR Languages
Working with OCR Pages
Working with OCR Zones
Recognizing OCR Pages
OCR Confidence Reporting
Using OMR in LEADTOOLS .NET OCR
OCR Engine-Specific Settings
OCR Tutorial - Adding and Painting Zones
OCR Tutorial - Working with Pages
OCR Tutorial - Recognizing Pages
OCR Tutorial - Working with Recognition Results
OCR Tutorial - Scanning to Searchable PDF