DOC2_FORMATTYPE

typedef enum
{
   DOC2_TEXT,
   DOC2_UTEXT,
   DOC2_FORMATTED_TEXT,
   DOC2_UFORMATTED_TEXT,
   DOC2_TEXT_LINEBREAKS,
   DOC2_UTEXT_LINEBREAKS,
   DOC2_TEXT_CSV,
   DOC2_TEXT_UCSV,
   DOC2_PDF,
   DOC2_PDF_IMAGE_SUBSTITUTES,
   DOC2_PDF_IMAGE_ON_TEXT,
   DOC2_PDF_EDITED,
   DOC2_XML,
   DOC2_HTML_3_2,
   DOC2_HTML_4_0,
   DOC2_RTF_6,
   DOC2_RTF_97,
   DOC2_RTF_2000,
   DOC2_RTF_WORD_2000,
   DOC2_WORD_2000,
   DOC2_WORD_97,
   DOC2_EXCEL_97,
   DOC2_EXCEL_2000,
   DOC2_PPT_97,
   DOC2_PUB_98,
   DOC2_MICROSOFT_READER,
   DOC2_WORDML,
   DOC2_WORDPERFECT_8,
   DOC2_WORDPERFECT_10,
   DOC2_WORDPAD,
   DOC2_INFOPATH,
   DOC2_EBOOK,
   DOC2_PDFA_IMAGE_ON_TEXT,
   DOC2_PDFA_TEXT_ONLY,
   DOC2_WORD_2007,
   DOC2_EXCEL_2007,
} DOC2_FORMATTYPE;

The DOC2_FORMATTYPE enumerated type lists the document format types that are possible.

Value

Meaning

DOC2_TEXT

Simple text output format

DOC2_UTEXT

Unicode text output format

DOC2_FORMATTED_TEXT

Retain the layout of the page by inserting extra spaces

DOC2_UFORMATTED_TEXT

Same as Formatted Text, but using Unicode characters

DOC2_TEXT_LINEBREAKS

Insert line breaks at the end of lines instead of only inserting them at the end of the paragraphs

DOC2_UTEXT_LINEBREAKS

Same as text with line breaks, but using Unicode characters.

DOC2_TEXT_CSV

Write the recognized text as a table (Comma delimited text file) that can be read by Excel

DOC2_TEXT_UCSV

Same as Text CSV, but using Unicode characters

DOC2_PDF

Adobe PDF file. Text only.

DOC2_PDF_IMAGE_SUBSTITUTES

Adobe PDF file with image substitutes

DOC2_PDF_IMAGE_ON_TEXT

Adobe PDF with image on text

DOC2_PDF_EDITED

Adobe PDF edited

DOC2_XML

XML output format

DOC2_HTML_3_2

HTML 3.2 output format

DOC2_HTML_4_0

HTML 4.0 output format

DOC2_RTF_6

RTF 6

DOC2_RTF_97

RTF that can only be interpreted by Microsoft Word 97 and up

DOC2_RTF_2000

RTF that can only be interpreted by Microsoft Word 2000 and up

DOC2_RTF_WORD_2000

RTF/ Word file that can only be interpreted by Microsoft Word 2000 and up

DOC2_WORD_2000

Word file that can only be interpreted by Microsoft Word 2000 and up

DOC2_WORD_97

Word file that can only be interpreted by Microsoft Word 97 and up

DOC2_EXCEL_97

Microsoft Excel 97 binary file

DOC2_EXCEL_2000

Microsoft Excel 2000 binary file

DOC2_PPT_97

Microsoft Power Point 97

DOC2_PUB_98

Microsoft Publisher 98

DOC2_MICROSOFT_READER

Microsoft Reader convertor

DOC2_WORDML

Word ML convertor

DOC2_WORDPERFECT_8

WordPerfect 8 convertor

DOC2_WORDPERFECT_10

WordPerfect 10 convertor

DOC2_WORDPAD

Word Pad convertor

DOC2_INFOPATH

Info Path convertor

DOC2_EBOOK

eBook convertor

DOC2_PDFA_IMAGE_ON_TEXT

PDF/A Image on text.

DOC2_PDFA_TEXT_ONLY

PDF/A Text only.

DOC2_WORD_2007

Microsoft Word 2007 document format (DOCX) (this format requires .NET Framework 3.0 and Microsoft Open XML Format SDK 1.0.).

DOC2_EXCEL_2007

Microsoft Excel 2007 format (XLSX).

Comments

This enumerated type is used by the following structure:

RESULTOPTIONS2