- options
- A combination of one or more OcrXmlOutputOptions enumeration members that specify the XML generation options.
Visual Basic (Declaration) | |
---|---|
Overloads Overridable Function SaveXml( _ ByVal options As OcrXmlOutputOptions _ ) As String |
Visual Basic (Usage) | Copy Code |
---|---|
|
C# | |
---|---|
virtual string SaveXml( OcrXmlOutputOptions options ) |
C++/CLI | |
---|---|
virtual String^ SaveXml( OcrXmlOutputOptions options ) |
Parameters
- options
- A combination of one or more OcrXmlOutputOptions enumeration members that specify the XML generation options.
Return Value
A String object containing the XML data.This example recognize a page then process the result XML data.
Visual Basic | Copy Code |
---|---|
Private Sub SaveAndProcessXmlExample() |
C# | Copy Code |
---|---|
private void SaveAndProcessXmlExample() |
To save the output document as XML to a disk file or a .NET stream, use IOcrDocument.SaveXml(string fileName, OcrXmlOutputOptions options) and IOcrDocument.SaveXml(Stream stream, OcrXmlOutputOptions options).
Each IOcrPage object in the Pages collection of this IOcrDocument object holds its recognition data internally. This data is used by this method to generate the final output document.
Typical OCR operation using the IOcrEngine involves starting up the engine. Creating a new IOcrDocument object using the IOcrDocumentManager.CreateDocument method before adding the pages into it and perform either automatic or manual zoning. Once this is done, you can use the IOcrPage.Recognize method of each page to collect the recognition data and store it internally in the page. After the recognition data is collected, you use the various IOcrDocument.Save methods to save the document to its final format as well as IOcrDocument.SaveXml to save as XML.
You can also use the IOcrPage.RecognizeText method to recognize and return the recognition data as a simple String object.
You can use IOcrDocument.SaveXml as many times as required to save the document to multiple formats. You can also continue to add and recognize pages (through the IOcrPage.Recognize method after you save the document.
For each IOcrPage that is not recognized (the user did not call IOcrPage.Recognize and the value of the page IOcrPage.IsRecognized is still false) the IOcrDocument will insert an empty page into the final document.
To get the low level recognition data including the recognized characters and their confidence, use IOcrPage.GetRecognizedCharacters instead.
The IOcrDocument interface implements IDisposable, hence you must dispose the IOcrDocument object as soon as you are finished using it. Disposing an IOcrDocument object will free all the pages stored inside its IOcrDocument.Pages collection.
Target Platforms: Microsoft .NET Framework 3.0, Windows XP, Windows Server 2003 family, Windows Server 2008 family
Reference
IOcrDocument InterfaceIOcrDocument Members
Overload List
DocumentFormat
IOcrDocumentManager Interface
IOcrDocument.Save
IOcrDocument.SaveXml
IOcrPage.Recognize
IOcrEngine Interface
OcrEngineManager Class
OcrEngineType Enumeration
Programming with Leadtools .NET OCR
Files to be Included with Your Application