Character set filter.
[SerializableAttribute()]
[FlagsAttribute()]
public enum OcrZoneCharacterFilters
<FlagsAttribute()>
<SerializableAttribute()>
Public Enum OcrZoneCharacterFilters
typedef NS_OPTIONS(NSUInteger, LTOcrZoneCharacterFilters)
public enum OcrZoneCharacterFilters
[FlagsAttribute()]
[SerializableAttribute()]
public enum class OcrZoneCharacterFilters
Value | Member | Description |
---|---|---|
0x00000000 | None | No character filters. |
0x00000001 | Digit | Recognition of numerals only. For example: "3" (Digit Three). |
0x00000002 | Uppercase | Recognition of uppercase letters only, including accented ones. For example: "A" (Capital A). |
0x00000004 | Lowercase | Recognition of lowercase letters only including accented ones. For example: "a" (Lowercase a). |
0x00000006 | Alpha | Upper and lowercase letters only. This is a combination of (Uppercase | Lowercase). |
0x00000008 | Punctuation | Recognition of punctuation signs only. For example: "!" (Exclamation Mark). |
0x00000010 | Miscellaneous | Recognition of other miscellaneous characters only. For example: "+" (Plus sign). |
0x0000001F | All | All characters. Since all elements are enabled, there is no filtering. This a combination of Digit (| Uppercase | Lowercase | Punctuation | Miscellaneous). |
0x00000020 | Plus | Enables the use of the "FilterPlus" characters. The FilterPlus characters are added after any filtering. For more information, refer to LEADTOOLS OCR Module - OmniPage Engine Settings. |
0x00000021 | Numbers | Digits plus the "FilterPlus" characters This is a combination of (Digit | Plus). |
This enumeration lists the available character set filter elements. The Language environment can be narrowed by specifying Character Set filters. The name of each filter element indicates which category of characters it validates. This enumeration is attributes with the FlagsAttribute and its members can be combined (OR-ed) together.
The filters can have an effect either at zone level (by specifying the zone's OcrZone.CharacterFilters property), or globally, at image level (defined by the "Recognition.DefaultCharacterFilter" setting).
The way to set no filtering is to give the value OcrZoneCharacterFilters.All.
Characters of the document that are not part of the specified character set will either be rejected or will be recognized as a validated character with a similar shape. For instance, if only the English language has been selected and the document contains a letter "Capital A with acute", then the recognized output will be a letter "Capital A"
Help Collections
Raster .NET | C API | C++ Class Library | HTML5 JavaScript
Document .NET | C API | C++ Class Library | HTML5 JavaScript
Medical .NET | C API | C++ Class Library | HTML5 JavaScript
Medical Web Viewer .NET
Multimedia
Direct Show .NET | C API | Filters
Media Foundation .NET | C API | Transforms
Supported Platforms
.NET, Java, Android, and iOS/macOS Assemblies
Imaging, Medical, and Document
C API/C++ Class Libraries
Imaging, Medical, and Document
HTML5 JavaScript Libraries
Imaging, Medical, and Document