typedef enum
{
DOC2_ZONE_CHAR_FILTER_DEFAULT = 0x0000,
DOC2_ZONE_CHAR_FILTER_DIGIT = 0x0001,
DOC2_ZONE_CHAR_FILTER_UPPERCASE = 0x0002,
DOC2_ZONE_CHAR_FILTER_LOWERCASE = 0x0004,
DOC2_ZONE_CHAR_FILTER_PUNCTUATION = 0x0008,
DOC2_ZONE_CHAR_FILTER_MISCELLANEOUS = 0x0010,
DOC2_ZONE_CHAR_FILTER_PLUS = 0x0020,
DOC2_ZONE_CHAR_FILTER_ALL = 0x0040,
DOC2_ZONE_CHAR_FILTER_ALPHA = 0x0080,
DOC2_ZONE_CHAR_FILTER_NUMBERS = 0x0100,
DOC2_ZONE_CHAR_FILTER_OCRA = 0x0200,
DOC2_ZONE_CHAR_FILTER_USER_DICTIONARY = 0x0400,
} DOC2_CHAR_FILTER;
The DOC2_CHAR_FILTER enumerated type represents the possible character set filters for a zone.
Value | Meaning |
---|---|
DOC2_ZONE_CHAR_FILTER_DEFAULT | Have the zone handled globally. Do not combine this with any other filter. |
DOC2_ZONE_CHAR_FILTER_DIGIT | Recognize numerals only. Example: "3" (Digit Three).Note: Be sure to use this value whenever you are using the OmniPage engine to recognize ICR numeric characters. Other values are not recommended for this situation. |
DOC2_ZONE_CHAR_FILTER_UPPERCASE | Recognize only uppercase letters, including accented ones. Example: "A" (Capital A). |
DOC2_ZONE_CHAR_FILTER_LOWERCASE | Recognize only lowercase letters, including accented ones. Example: "a" (Lowercase a). |
DOC2_ZONE_CHAR_FILTER_PUNCTUATION | Recognize only punctuation signs. Example.: "!" (Exclamation Mark). |
DOC2_ZONE_CHAR_FILTER_MISCELLANEOUS | Recognize only miscellaneous characters. Example: "+" (Plus sign). |
DOC2_ZONE_CHAR_FILTER_PLUS | Enable the use of the Filter Plus characters specified by the pCHAROPTIONS2 structure. The Filter Plus characters are added to the language environment after any filtering. This filter can also be used to achieve a very limited character set. |
DOC2_ZONE_CHAR_FILTER_ALL | Enable all elements ( no filtering). |
DOC2_ZONE_CHAR_FILTER_ALPHA | Recognize only upper and lowercase letters. |
DOC2_ZONE_CHAR_FILTER_NUMBERS | Recognize digits, plus the Filter Plus characters set by the L_Doc2SetCharLangsOptions function. |
DOC2_ZONE_CHAR_FILTER_OCRA | Recognize the OCR-A characters with digit characters. |
DOC2_ZONE_CHAR_FILTER_USER_DICTIONARY | Recognize characters from the user dictionary section assigned to the zone. |
✎ NOTE User words and dictionaries are no longer supported in the LEADTOOLS OCR Module - OmniPage Engine. |
The enumeration is used by:
✎ NOTE
Characters in the document that are not part of the specified Character Set will either be rejected or will be recognized as a validated character with a similar shape.
The capabilities of the selected recognition module can also impose restrictions, e.g. the ASIAN module does not support character filters so these filters will not work with Asian recognition module while the Asian language is enabled.