DOC2_FORMATTYPE

typedef enum 
{ 
   DOC2_TEXT, 
   DOC2_UTEXT, 
   DOC2_FORMATTED_TEXT, 
   DOC2_UFORMATTED_TEXT, 
   DOC2_TEXT_LINEBREAKS, 
   DOC2_UTEXT_LINEBREAKS, 
   DOC2_TEXT_CSV, 
   DOC2_TEXT_UCSV, 
   DOC2_PDF, 
   DOC2_PDF_IMAGE_SUBSTITUTES, 
   DOC2_PDF_IMAGE_ON_TEXT, 
   DOC2_PDF_EDITED, 
   DOC2_XML, 
   DOC2_HTML_3_2, 
   DOC2_HTML_4_0, 
   DOC2_RTF_6, 
   DOC2_RTF_97, 
   DOC2_RTF_2000, 
   DOC2_RTF_WORD_2000, 
   DOC2_WORD_2000, 
   DOC2_WORD_97, 
   DOC2_EXCEL_97, 
   DOC2_EXCEL_2000, 
   DOC2_PPT_97, 
   DOC2_PUB_98, 
   DOC2_MICROSOFT_READER, 
   DOC2_WORDML, 
   DOC2_WORDPERFECT_8, 
   DOC2_WORDPERFECT_10, 
   DOC2_WORDPAD, 
   DOC2_INFOPATH, 
   DOC2_EBOOK, 
   DOC2_PDFA_IMAGE_ON_TEXT, 
   DOC2_PDFA_TEXT_ONLY, 
   DOC2_WORD_2007, 
   DOC2_EXCEL_2007, 
} DOC2_FORMATTYPE; 

The DOC2_FORMATTYPE enumerated type lists the document format types that are possible.

Value

Meaning

DOC2_TEXT

Simple text output format

DOC2_UTEXT

Unicode text output format

DOC2_FORMATTED_TEXT

Retain the layout of the page by inserting extra spaces

DOC2_UFORMATTED_TEXT

Same as Formatted Text, but using Unicode characters

DOC2_TEXT_LINEBREAKS

Insert line breaks at the end of lines instead of only inserting them at the end of the paragraphs

DOC2_UTEXT_LINEBREAKS

Same as text with line breaks, but using Unicode characters.

DOC2_TEXT_CSV

Write the recognized text as a table (Comma delimited text file) that can be read by Excel

DOC2_TEXT_UCSV

Same as Text CSV, but using Unicode characters

DOC2_PDF

Adobe PDF file. Text only.

DOC2_PDF_IMAGE_SUBSTITUTES

Adobe PDF file with image substitutes

DOC2_PDF_IMAGE_ON_TEXT

Adobe PDF with image on text

DOC2_PDF_EDITED

Adobe PDF edited

DOC2_XML

XML output format

DOC2_HTML_3_2

HTML 3.2 output format

DOC2_HTML_4_0

HTML 4.0 output format

DOC2_RTF_6

RTF 6

DOC2_RTF_97

RTF that can only be interpreted by Microsoft Word 97 and up

DOC2_RTF_2000

RTF that can only be interpreted by Microsoft Word 2000 and up

DOC2_RTF_WORD_2000

RTF/ Word file that can only be interpreted by Microsoft Word 2000 and up

DOC2_WORD_2000

Word file that can only be interpreted by Microsoft Word 2000 and up

DOC2_WORD_97

Word file that can only be interpreted by Microsoft Word 97 and up

DOC2_EXCEL_97

Microsoft Excel 97 binary file

DOC2_EXCEL_2000

Microsoft Excel 2000 binary file

DOC2_PPT_97

Microsoft Power Point 97

DOC2_PUB_98

Microsoft Publisher 98

DOC2_MICROSOFT_READER

Microsoft Reader convertor

DOC2_WORDML

Word ML convertor

DOC2_WORDPERFECT_8

WordPerfect 8 convertor

DOC2_WORDPERFECT_10

WordPerfect 10 convertor

DOC2_WORDPAD

Word Pad convertor

DOC2_INFOPATH

Info Path convertor

DOC2_EBOOK

eBook convertor

DOC2_PDFA_IMAGE_ON_TEXT

PDF/A Image on text.

DOC2_PDFA_TEXT_ONLY

PDF/A Text only.

DOC2_WORD_2007

Microsoft Word Document Format (DOCX) (this format requires .NET Framework 3.0 and Microsoft Open XML Format SDK 1.0.).

DOC2_EXCEL_2007

Microsoft Excel Spreadsheet Format (XLSX).

Comments

This enumerated type is used by the following structure:

RESULTOPTIONS2

Help Version 19.0.2017.10.27
Products | Support | Contact Us | Copyright Notices
© 1991-2017 LEAD Technologies, Inc. All Rights Reserved.
LEADTOOLS Professional OCR C API Help