←Select platform

AutoPreprocess(OcrAutoPreprocessPageCommand,OcrProgressCallback) Method

Summary
Perform auto image processing clean up on all the pages in the OCR document to enhance the quality of the pages before starting its recognition.
Syntax
C#
Objective-C
C++/CLI
Java
Python
- (BOOL)autoPreprocessPages:(LTOcrAutoPreprocessPageCommand)command 
                   progress:(nullable LTOcrProgressHandler)progressHandler 
                      error:(NSError **)error 
public void autoPreprocess(OcrAutoPreprocessPageCommand command, 
                           OcrProgressListener callback) 
def AutoPreprocess(self,callback): 

Parameters

command
The preprocessing command to perform.

callback
Optional callback to show operation progress.

Remarks

This method will iterate through all the pages in this OCR document and run IOcrPage.AutoPreprocess on each page.

You can use this method to deskew, rotate or invert the image according to  command. By performing auto pre-processing on the page, you can improve the image quality of draft mode faxes.

You can use the OcrProgressCallback to show the operation progress or to abort it. For more information and an example, refer to OcrProgressCallback.

You should call this method prior to calling Recognize.

To perform auto-preprocessing to a range of pages in the OCR document use AutoPreprocess(OcrAutoPreprocessPageCommand command, int firstPageIndex, int lastPageIndex, OcrProgressCallback callback).

This member only works with memory-based documents and will throw an exception otherwise. For more information, refer to IOcrDocumentManager.CreateDocument and Programming with the LEADTOOLS .NET OCR.

Example
C#
Java
using Leadtools; 
using Leadtools.Codecs; 
using Leadtools.Ocr; 
using Leadtools.Document.Writer; 
using Leadtools.Forms.Common; 
using Leadtools.ImageProcessing.Core; 
 
public void PageCollectionExamples() 
{ 
   // For this example, we need a multi-page TIF file. 
   // Create a muti-page TIF from Ocr1.tif, Ocr2.tif, Ocr3.tif and Ocr4.tif 
   string tifFileName = Path.Combine(LEAD_VARS.ImagesDir, "Ocr.tif"); 
   if (File.Exists(tifFileName)) 
      File.Delete(tifFileName); 
 
   using (RasterCodecs codecs = new RasterCodecs()) 
   { 
      for (int i = 0; i < 4; i++) 
      { 
         string pageFileName = Path.Combine(LEAD_VARS.ImagesDir, string.Format("Ocr{0}.tif", i + 1)); 
         using (RasterImage image = codecs.Load(pageFileName)) 
            codecs.Save(image, tifFileName, RasterImageFormat.CcittGroup4, 1, 1, 1, -1, CodecsSavePageMode.Append); 
      } 
   } 
 
   string pdfFileName = Path.Combine(LEAD_VARS.ImagesDir, "Ocr.pdf"); 
 
   // Create an instance of the engine 
   using (IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.LEAD)) 
   { 
      // Start the engine using default parameters 
      ocrEngine.Startup(null, null, null, LEAD_VARS.OcrLEADRuntimeDir); 
 
      // Create an OCR document 
      using (IOcrDocument ocrDocument = ocrEngine.DocumentManager.CreateDocument()) 
      { 
         // Load all the pages of the multi-page tif file we created into the form 
         ocrDocument.Pages.AddPages(tifFileName, 1, -1, null); 
         Console.WriteLine("{0} pages added to the document", ocrDocument.Pages.Count); 
 
         // Auto-zone 
         ocrDocument.Pages.AutoZone(null); 
 
         // Recognize 
         ocrDocument.Pages.Recognize(null); 
 
         // Save 
         ocrDocument.Save(pdfFileName, DocumentFormat.Pdf, null); 
      } 
 
      // Shutdown the engine 
      // Note: calling Dispose will also automatically shutdown the engine if it has been started 
      ocrEngine.Shutdown(); 
   } 
} 
 
static class LEAD_VARS 
{ 
   public const string ImagesDir = @"C:\LEADTOOLS23\Resources\Images"; 
   public const string OcrLEADRuntimeDir = @"C:\LEADTOOLS23\Bin\Common\OcrLEADRuntime"; 
} 
 
import java.io.File; 
import java.io.IOException; 
import java.net.URI; 
import java.net.URISyntaxException; 
 
import org.junit.*; 
import org.junit.runner.JUnitCore; 
import org.junit.runner.Result; 
import org.junit.runner.notification.Failure; 
import static org.junit.Assert.*; 
 
import leadtools.*; 
import leadtools.codecs.*; 
import leadtools.document.writer.DocumentFormat; 
import leadtools.imageprocessing.core.DeskewCommand; 
import leadtools.imageprocessing.core.DeskewCommandFlags; 
import leadtools.ocr.OcrDocument; 
import leadtools.ocr.OcrEngine; 
import leadtools.ocr.OcrEngineManager; 
import leadtools.ocr.OcrEngineType; 
 
 
public void IOcrPageCollectionsPageCollectionExamples() { 
   final String LEAD_VARS_IMAGES_DIR = "C:\\LEADTOOLS23\\Resources\\Images"; 
   final String LEAD_VARS_OCR_LEAD_RUNTIME_DIR = "C:\\LEADTOOLS23\\Bin\\Common\\OcrLEADRuntime"; 
 
   // For this example, we need a multi-page TIF file. 
   // Create a muti-page TIF from Ocr1.tif, Ocr2.tif, Ocr3.tif and Ocr4.tif 
   var tifFileName = combine(LEAD_VARS_IMAGES_DIR, "Ocr.tif"); 
   var file = new File(tifFileName); 
   if (file.exists()) 
      file.delete(); 
 
   RasterCodecs codecs = new RasterCodecs(); 
   for (var i = 0; i < 4; i++) { 
      var pageFileName = combine(LEAD_VARS_IMAGES_DIR, "Ocr" + (i + 1) + ".tif"); 
      var image = codecs.load(pageFileName); 
      codecs.save(image, tifFileName, RasterImageFormat.CCITT_GROUP4, 1, 1, 1, -1, CodecsSavePageMode.APPEND); 
   } 
   codecs = null; 
 
   var pdfFileName = combine(LEAD_VARS_IMAGES_DIR, "Ocr.pdf"); 
 
   codecs = new RasterCodecs(); 
   // Create an instance of the engine 
   OcrEngine ocrEngine = OcrEngineManager.createEngine(OcrEngineType.LEAD); 
 
   // Start the engine using default parameters 
   ocrEngine.startup(codecs, null, null, LEAD_VARS_OCR_LEAD_RUNTIME_DIR); 
   assertTrue("Engine unsuccessfully started", ocrEngine.isStarted()); 
 
   // Create an OCR document 
   OcrDocument ocrDocument = ocrEngine.getDocumentManager().createDocument(); 
 
   // Load all the pages of the multi-page tif file we created into the form 
   ocrDocument.getPages().addPages(codecs.load(tifFileName), 1, -1, null); 
   System.out.println(ocrDocument.getPages().size() + " pages added to the document"); 
 
   // Auto-zone 
   ocrDocument.getPages().autoZone(null); 
 
   // Recognize 
   ocrDocument.getPages().recognize(null); 
 
   // Save 
   ocrDocument.save(pdfFileName, DocumentFormat.PDF, null); 
   assertTrue("File unsuccessfully saved", (new File(tifFileName)).exists()); 
 
   // Shutdown the engine 
   // Note: calling Dispose will also automatically shutdown the engine if it has 
   // been started 
   ocrEngine.shutdown(); 
} 
Requirements

Target Platforms

Help Version 23.0.2024.4.19
Products | Support | Contact Us | Intellectual Property Notices
© 1991-2024 LEAD Technologies, Inc. All Rights Reserved.

Leadtools.Ocr Assembly
Products | Support | Contact Us | Intellectual Property Notices
© 1991-2023 LEAD Technologies, Inc. All Rights Reserved.