Languages can be recognized if the OCR engine supports that language's character set. LEADTOOLS recognizes dozens of languages, enumerated in DOC2_LANGIDS.
In addition, many recognized languages also have spelling dictionaries, enabling spell-checking after character recognition is complete. Languages for which spell-checking is supported are listed below, along with their associated dictionary file.
All languages to be recognized can be selected using L_Doc2SelectLanguages / L_Doc2SelectLanguagesExt.
The language dictionary to use for spell-checking should be specified in the SpellLangId member of the pRecogOpts parameter passed to the L_Doc2Recognize / L_Doc2RecognizeExt function.
For more information on options for the recognition process, refer to RECOGNIZEOPTS2.
SPELL-CHECK LANGUAGE | LANGUAGE DICTIONARY FILE |
---|---|
Catalan | R_CAT.DAT |
Czech | R_CZH.DAT |
Danish | R_DAN.DAT |
Dutch | R_DUT.DAT |
English | R_ENG.DAT |
Finnish | R_FIN.DAT |
French | R_FRE.DAT |
German | R_GER.DAT |
Greek | R_GRE.DAT |
Hungarian | R_HUN.DAT |
Italian | R_ITA.DAT |
Norwegian | R_NOR.DAT |
Polish | R_POL.DAT |
Portuguese | R_POR.DAT |
Russian | R_RUS.DAT |
Slovenian | R_SLN.DMD |
Spanish | R_SPA.DAT |
Swedish | R_SWE.DAT |
✎ NOTE
Also requires the file ICHUNW32.DLL.
For more information, refer to: