The OcrDocumentFormatType Enumeration is available in LEADTOOLS Document and Medical Imaging toolkits.
The document formats supported by the LEADTOOLS OCR toolkit.Visual Basic (Declaration) | |
---|---|
<DataContractAttribute(Namespace="http://Leadtools.Services.Forms.DataContracts/2009/01", Name="OcrDocumentFormatType")> Public Enum OcrDocumentFormatType Inherits System.Enum Implements IComparable, IConvertible, IFormattable |
Visual Basic (Usage) | Copy Code |
---|---|
Dim instance As OcrDocumentFormatType |
C# | |
---|---|
[DataContractAttribute(Namespace="http://Leadtools.Services.Forms.DataContracts/2009/01", Name="OcrDocumentFormatType")] public enum OcrDocumentFormatType : System.Enum, IComparable, IConvertible, IFormattable |
C++/CLI | |
---|---|
[DataContractAttribute(Namespace="http://Leadtools.Services.Forms.DataContracts/2009/01", Name="OcrDocumentFormatType")] public enum class OcrDocumentFormatType : public System.Enum, IComparable, IConvertible, IFormattable |
Member | Description |
---|---|
Doc | Microsoft Word 2003 document format (DOC). |
Docx | Microsoft Word 2007 document format (DOCX). |
Emf | Windows Enhanced Meta File (EMF). EMF format does not support multi-page documents. Therefore, only the last page will be used in the final document. |
Html | HTML output. HTML 4.0 can set the exact position and size of objects. Use this output format with full formatting. |
The target document should be PDF v1.4. PDF is generally not suited for long term preservation. The PDF format may contain resources (such as fonts) that may not exist on the viewing machine. Hence, font substitution may occur resulting in a document that may not look exactly like the original version. | |
Pdf12 | The target document should be PDF v1.2. PDF is generally not suited for long term preservation. The PDF format may contain resources (such as fonts) that may not exist on the viewing machine. Hence, font substitution may occur resulting in a document that may not look exactly like the original version. |
Pdf12ImageOverText | The target document should be PDF v1.2. PDF is generally not suited for long term preservation. The PDF format may contain resources (such as fonts) that may not exist on the viewing machine. Hence font substitution may occur resulting in a document that may not look exactly like the original version. The Raster image overlies on top of the resulting PDF document. |
Pdf13 | The target document should be PDF v1.3. PDF is generally not suited for long term preservation. The PDF format may contain resources (such as fonts) that may not exist on the viewing machine. Hence, font substitution may occur resulting in a document that may not look exactly like the original version. |
Pdf13ImageOverText | The target document should be PDF v1.3. PDF is generally not suited for long term preservation. The PDF format may contain resources (such as fonts) that may not exist on the viewing machine. Hence font substitution may occur resulting in a document that may not look exactly like the original version. The Raster image overlies on top of the resulting PDF document. |
Pdf15 | The target document should be PDF v1.5. PDF is generally not suited for long term preservation. The PDF format may contain resources (such as fonts) that may not exist on the viewing machine. Hence, font substitution may occur resulting in a document that may not look exactly like the original version. |
Pdf15ImageOverText | The target document should be PDF v1.5. PDF is generally not suited for long term preservation. The PDF format may contain resources (such as fonts) that may not exist on the viewing machine. Hence font substitution may occur resulting in a document that may not look exactly like the original version. The Raster image overlies on top of the resulting PDF document. |
PdfA | The target document should be PDF/A. PDF/A is a subset of PDF obtained by leaving out PDF features not suited to long-term archiving. The resulting document is 100 percent self contained where all of the information necessary for displaying the document in the same manner every time is embedded in the file. Saving with PDF/A document type may result in larger output file sizes. |
PdfAImageOverText | The target document should be PDF/A. PDF/A is a subset of PDF obtained by leaving out PDF features not suited to long-term archiving. The resulting document is guaranteed to look exactly like the original version when viewed on the target machine. Saving with PDF/A document type may result in larger output file sizes. The Raster image overlies on top of the resulting PDF document. |
PdfImageOverText | The target document should be PDF v1.4. PDF is generally not suited for long term preservation. The PDF format may contain resources (such as fonts) that may not exist on the viewing machine. Hence font substitution may occur resulting in a document that may not look exactly like the original version. The Raster image overlies on top of the resulting PDF document. |
Rtf | Microsoft Rich Text Format (RTF). |
TextAnsi | The output text document type is ANSI (contains 8-bit ANSI characters only). |
TextUnicode | The output text document type is UNICODE (contains 16-bit UNICODE characters). |
Xls | Microsoft Excel 2003 document format (XLS). |
Xps | Microsoft XML Paper Specification (XPS). |
The Leadtools.Services.Forms.ServiceContracts.IOcrService.Recognize method allows you to save the recognized pages data to a final document format.
Some of the document formats require a special key to unlock. When using such formats you have to first unlock the specified support through the configuration files shipped with our services.
The following table lists the document formats and corresponding support types which must be unlocked in order to be used:
Document Format | Support Type |
---|---|
Pdf, PdfImageOverText, Pdf12, Pdf12ImageOverText, Pdf13, Pdf13ImageOverText, Pdf15, Pdf15ImageOverText, Xps | You need to set the value for OcrPlusPdfOutputKey found in Leadtools.Services.Forms.ServiceImplementations.dll.config file when using the OcrEngineType.Plus engine and OcrProfessionalPdfOutputKey when using the OcrEngineType.Professional engine |
PdfA | You need to set the value for OcrPlusPdfLeadOutputKey found in Leadtools.Services.Forms.ServiceImplementations.dll.config file when using the OcrEngineType.Plus engine and OcrProfessionalPdfLeadOutputKey when using the OcrEngineType.Professional engine |
System.Object
System.ValueType
System.Enum
Leadtools.Services.Forms.DataContracts.OcrDocumentFormatType
Target Platforms: Microsoft .NET Framework 3.0, Windows 2000, Windows XP, Windows Server 2003 family, Windows Server 2008 family, Windows Vista, Windows 7