Visual Basic (Declaration) | |
---|---|
<FlagsAttribute()> <SerializableAttribute()> Public Enum OcrZoneCharacterFilters Inherits Enum |
Visual Basic (Usage) | Copy Code |
---|---|
|
C# | |
---|---|
[FlagsAttribute()] [SerializableAttribute()] public enum OcrZoneCharacterFilters : Enum |
C++/CLI | |
---|---|
[FlagsAttribute()] [SerializableAttribute()] public enum class OcrZoneCharacterFilters : public Enum |
Member | Description |
---|---|
None | No character filters. |
Digit | Recognition of numerals only. For example: "3" (Digit Three). |
Uppercase | Recognition of uppercase letters only, including accented ones. For example: "A" (Capital A). |
Lowercase | Recognition of lowercase letters only including accented ones. For example: "a" (Lowercase a). |
Punctuation | Recognition of punctuation signs only. For example: "!" (Exclamation Mark). |
Miscellaneous | Recognition of other miscellaneous characters only. For example: "+" (Plus sign). |
Plus | Enables the use of the "FilterPlus" characters. The FilterPlus characters are added after any filtering. For more information, refer to LEADTOOLS OCR Plus Engine Settings. |
All | All characters. Since all elements are enabled, there is no filtering. This a combination of Digit (| Uppercase | Lowercase | Punctuation | Miscellaneous). |
Alpha | Upper and lowercase letters only. This is a combination of (Uppercase | Lowercase). |
Numbers | Digits plus the "FilterPlus" characters This is a combination of (Digit | Plus). |
This enumeration lists the available character set filter elements. The Language environment can be narrowed by specifying Character Set filters. The name of each filter element denotes which category of characters it validates. This enumeration is attributes with the FlagsAttribute and its members can be combined (OR-ed) together.
The filters can have an effect either at zone level (by specifying the zone's OcrZone.CharacterFilters property), or globally, at image level (defined by the "Recognition.DefaultCharacterFilter" setting).
The way to set no filtering is to give the value OcrZoneCharacterFilters.All.
Characters of the document that are not part of the specified character set will either be rejected or will be recognized as a validated character with a similar shape. For instance, if only the English language has been selected and the document contains a letter "Capital A with acute", then the recognized output will be a letter "Capital A"
The recognition module selected for recognition can also impose restrictions, e.g. the OcrZoneRecognitionModule.IcrNumeral module is restricted to numerals and four other characters.
Not all recognition modules support all filter elements:
Recognition module | Character filters supported |
---|---|
OcrZoneRecognitionModule.OmniFontMText | OcrZoneCharacterFilters.All, OcrZoneCharacterFilters.Digit and OcrZoneCharacterFilters.Alpha |
OcrZoneRecognitionModule.OmniFontMor | All filters |
OcrZoneRecognitionModule.DotMatrix | All filters |
OcrZoneRecognitionModule.Omr | None (All ignored) |
OcrZoneRecognitionModule.IcrNumeral | OcrZoneCharacterFilters.All, OcrZoneCharacterFilters.Digit, OcrZoneCharacterFilters.Punctuation and OcrZoneCharacterFilters.Miscellaneous |
OcrZoneRecognitionModule.IcrCharacter | All filters |
OcrZoneRecognitionModule.MatrixMatching | All filters |
OcrZoneRecognitionModule.OmniFontPlus2WayVoting | All filters |
OcrZoneRecognitionModule.OmniFontFireWorx | All filters |
OcrZoneRecognitionModule.OmniFontPlus3WayVoting | All filters |
System.Object
System.ValueType
System.Enum
Leadtools.Forms.Ocr.OcrZoneCharacterFilters
Target Platforms: Microsoft .NET Framework 3.0, Windows XP, Windows Server 2003 family, Windows Server 2008 family
Reference
Leadtools.Forms.Ocr NamespaceIOcrPage.AutoZone
OcrZoneRecognitionModule Enumeration
OcrZoneFillMethod Enumeration
OcrEngineManager Class
OcrEngineType Enumeration
IOcrPageCollection Interface
IOcrZoneCollection Interface
OcrZone Structure
Programming with Leadtools .NET OCR
LEADTOOLS OCR Plus Engine Settings