typedef enum
{
DOC2_TEXT,
DOC2_UTEXT,
DOC2_FORMATTED_TEXT,
DOC2_UFORMATTED_TEXT,
DOC2_TEXT_LINEBREAKS,
DOC2_UTEXT_LINEBREAKS,
DOC2_TEXT_CSV,
DOC2_TEXT_UCSV,
DOC2_PDF,
DOC2_PDF_IMAGE_SUBSTITUTES,
DOC2_PDF_IMAGE_ON_TEXT,
DOC2_PDF_EDITED,
DOC2_XML,
DOC2_HTML_3_2,
DOC2_HTML_4_0,
DOC2_RTF_6,
DOC2_RTF_97,
DOC2_RTF_2000,
DOC2_RTF_WORD_2000,
DOC2_WORD_2000,
DOC2_WORD_97,
DOC2_EXCEL_97,
DOC2_EXCEL_2000,
DOC2_PPT_97,
DOC2_PUB_98,
DOC2_MICROSOFT_READER,
DOC2_WORDML,
DOC2_WORDPERFECT_8,
DOC2_WORDPERFECT_10,
DOC2_WORDPAD,
DOC2_INFOPATH,
DOC2_EBOOK,
DOC2_PDFA_IMAGE_ON_TEXT,
DOC2_PDFA_TEXT_ONLY,
DOC2_WORD_2007,
DOC2_EXCEL_2007,
} DOC2_FORMATTYPE;
The DOC2_FORMATTYPE enumerated type lists the document format types that are possible.
Value |
Meaning |
DOC2_TEXT |
Simple text output format |
DOC2_UTEXT |
Unicode text output format |
DOC2_FORMATTED_TEXT |
Retain the layout of the page by inserting extra spaces |
DOC2_UFORMATTED_TEXT |
Same as Formatted Text, but using Unicode characters |
DOC2_TEXT_LINEBREAKS |
Insert line breaks at the end of lines instead of only inserting them at the end of the paragraphs |
DOC2_UTEXT_LINEBREAKS |
Same as text with line breaks, but using Unicode characters. |
DOC2_TEXT_CSV |
Write the recognized text as a table (Comma delimited text file) that can be read by Excel |
DOC2_TEXT_UCSV |
Same as Text CSV, but using Unicode characters |
DOC2_PDF |
Adobe PDF file. Text only. |
DOC2_PDF_IMAGE_SUBSTITUTES |
Adobe PDF file with image substitutes |
DOC2_PDF_IMAGE_ON_TEXT |
Adobe PDF with image on text |
DOC2_PDF_EDITED |
Adobe PDF edited |
DOC2_XML |
XML output format |
DOC2_HTML_3_2 |
HTML 3.2 output format |
DOC2_HTML_4_0 |
HTML 4.0 output format |
DOC2_RTF_6 |
RTF 6 |
DOC2_RTF_97 |
RTF that can only be interpreted by Microsoft Word 97 and up |
DOC2_RTF_2000 |
RTF that can only be interpreted by Microsoft Word 2000 and up |
DOC2_RTF_WORD_2000 |
RTF/ Word file that can only be interpreted by Microsoft Word 2000 and up |
DOC2_WORD_2000 |
Word file that can only be interpreted by Microsoft Word 2000 and up |
DOC2_WORD_97 |
Word file that can only be interpreted by Microsoft Word 97 and up |
DOC2_EXCEL_97 |
Microsoft Excel 97 binary file |
DOC2_EXCEL_2000 |
Microsoft Excel 2000 binary file |
DOC2_PPT_97 |
Microsoft Power Point 97 |
DOC2_PUB_98 |
Microsoft Publisher 98 |
DOC2_MICROSOFT_READER |
Microsoft Reader convertor |
DOC2_WORDML |
Word ML convertor |
DOC2_WORDPERFECT_8 |
WordPerfect 8 convertor |
DOC2_WORDPERFECT_10 |
WordPerfect 10 convertor |
DOC2_WORDPAD |
Word Pad convertor |
DOC2_INFOPATH |
Info Path convertor |
DOC2_EBOOK |
eBook convertor |
DOC2_PDFA_IMAGE_ON_TEXT |
PDF/A Image on text. |
DOC2_PDFA_TEXT_ONLY |
PDF/A Text only. |
DOC2_WORD_2007 |
Microsoft Word Document Format (DOCX) (this format requires .NET Framework 3.0 and Microsoft Open XML Format SDK 1.0.). |
DOC2_EXCEL_2007 |
Microsoft Excel Spreadsheet Format (XLSX). |
This enumerated type is used by the following structure: