typedef enum
{
DOC2_TEXT,
DOC2_UTEXT,
DOC2_FORMATTED_TEXT,
DOC2_UFORMATTED_TEXT,
DOC2_TEXT_LINEBREAKS,
DOC2_UTEXT_LINEBREAKS,
DOC2_TEXT_CSV,
DOC2_TEXT_UCSV,
DOC2_PDF,
DOC2_PDF_IMAGE_SUBSTITUTES,
DOC2_PDF_IMAGE_ON_TEXT,
DOC2_PDF_EDITED,
DOC2_XML,
DOC2_HTML_3_2,
DOC2_HTML_4_0,
DOC2_RTF_6,
DOC2_RTF_97,
DOC2_RTF_2000,
DOC2_RTF_WORD_2000,
DOC2_WORD_2000,
DOC2_WORD_97,
DOC2_EXCEL_97,
DOC2_EXCEL_2000,
DOC2_PPT_97,
DOC2_PUB_98,
DOC2_MICROSOFT_READER,
DOC2_WORDML,
DOC2_WORDPERFECT_8,
DOC2_WORDPERFECT_10,
DOC2_WORDPAD,
DOC2_INFOPATH,
DOC2_EBOOK,
DOC2_PDFA_IMAGE_ON_TEXT,
DOC2_PDFA_TEXT_ONLY,
DOC2_WORD_2007,
DOC2_EXCEL_2007,
} DOC2_FORMATTYPE;
The DOC2_FORMATTYPE enumerated type lists the document format types that are possible.
Value | Meaning |
---|---|
DOC2_TEXT | Simple text output format |
DOC2_UTEXT | Unicode text output format |
DOC2_FORMATTED_TEXT | Retain the layout of the page by inserting extra spaces |
DOC2_UFORMATTED_TEXT | Same as Formatted Text, but using Unicode characters |
DOC2_TEXT_LINEBREAKS | Insert line breaks at the end of lines instead of only inserting them at the end of the paragraphs |
DOC2_UTEXT_LINEBREAKS | Same as text with line breaks, but using Unicode characters. |
DOC2_TEXT_CSV | Write the recognized text as a table (Comma delimited text file) that can be read by Excel |
DOC2_TEXT_UCSV | Same as Text CSV, but using Unicode characters |
DOC2_PDF | Adobe PDF file. Text only. |
DOC2_PDF_IMAGE_SUBSTITUTES | Adobe PDF file with image substitutes |
DOC2_PDF_IMAGE_ON_TEXT | Adobe PDF with image on text |
DOC2_PDF_EDITED | Adobe PDF edited |
DOC2_XML | XML output format |
DOC2_HTML_3_2 | HTML 3.2 output format |
DOC2_HTML_4_0 | HTML 4.0 output format |
DOC2_RTF_6 | RTF 6 |
DOC2_RTF_97 | RTF that can only be interpreted by Microsoft Word 97 and up |
DOC2_RTF_2000 | RTF that can only be interpreted by Microsoft Word 2000 and up |
DOC2_RTF_WORD_2000 | RTF/ Word file that can only be interpreted by Microsoft Word 2000 and up |
DOC2_WORD_2000 | Word file that can only be interpreted by Microsoft Word 2000 and up |
DOC2_WORD_97 | Word file that can only be interpreted by Microsoft Word 97 and up |
DOC2_EXCEL_97 | Microsoft Excel 97 binary file |
DOC2_EXCEL_2000 | Microsoft Excel 2000 binary file |
DOC2_PPT_97 | Microsoft Power Point 97 |
DOC2_PUB_98 | Microsoft Publisher 98 |
DOC2_MICROSOFT_READER | Microsoft Reader convertor |
DOC2_WORDML | Word ML convertor |
DOC2_WORDPERFECT_8 | WordPerfect 8 convertor |
DOC2_WORDPERFECT_10 | WordPerfect 10 convertor |
DOC2_WORDPAD | Word Pad convertor |
DOC2_INFOPATH | Info Path convertor |
DOC2_EBOOK | eBook convertor |
DOC2_PDFA_IMAGE_ON_TEXT | PDF/A Image on text. |
DOC2_PDFA_TEXT_ONLY | PDF/A Text only. |
DOC2_WORD_2007 | Microsoft Word Document Format (DOCX) (this format requires .NET Framework 3.0 and Microsoft Open XML Format SDK 1.0.). |
DOC2_EXCEL_2007 | Microsoft Excel Spreadsheet Format (XLSX). |
The enumeration is used by: