Programming with the LEADTOOLS Forms Recognition Processing Engine

The LEADTOOLS Forms Recognition and Processing engine provides developers with a comprehensive set of advanced tools to create form recognition and processing applications with minimal coding. It is fast, accurate, and reliable.

Overview
Forms Recognition
Forms Processing
Speed Processing
Low-Level Forms Recognition
Low-Level Forms Processing

Overview

In general, forms recognition and processing systems perform the following steps:

Creating form attributes
Comparing a form to a master form
Aligning the form
Processing the form

The LEADTOOLS AutoFormsEngine is an optimized implementation of the recognition and processing system. Built upon the Low-Level engine, it automatically creates form attributes, compares it with the Master Forms in the repository, and processes the form fields. LEADTOOLS AutoFormsEngine also gives you the option to run in multi-threaded mode to speed up recognition and processing, taking advantage of multi-core processors. The Leadtools.Forms.Auto namespace provides a rich set of classes, interfaces, and methods that will reduce the implementation time of actions such as:

Run: There are Run methods that can both recognize and process a form at the same time. With a single line of code you can run and get back recognition and processing result reports. Running methods, especially in multi-threaded mode, produce faster results than calling recognition and processing separately.
Recognize Forms: Form recognition is implemented by running comparisons to the Master Forms in a repository. The decision is based on the returned confidence value. When the confidence value is above the minimum confidence value needed, the search is stopped to save time with unnecessary form comparisons.
Recognize Pages: Pages can be searched to find which type of form it is a part of. Recognition creates a page's attributes and compares them with all Master Form pages in the repository to find the one with the confidence value above the minimum specified.
Process Forms: The AutoFormsEngine automatically aligns the recognized form and processes its fields. The fields will be updated with the process results.
Process Pages: The AutoFormsEngine automatically aligns the recognized page and processes its fields. The fields will be updated with the process results.
Calculate the Minimum Confidence Value The minimum confidence value is used to determine whether the form or page recognized corresponds to one of the Master Forms in the repository. Setting a value speeds up the search (recognition process) because there is no need to continue to compare forms once a confidence value above the minimum value is found.
Generate Master Form Attributes: Master Form attributes are generated in a manner consistent with the Object Managers being used.

While most ECM (Enterprise Content Management) systems may take advantage of both recognition and processing; each process, recognition, or processing, has a very specific task in a typical workflow.

Forms Recognition

Forms Recognition is the process of automatically identifying the name, type, and ID of any unknown form without human intervention. As long as a master form exists for the form being recognized, the LEADTOOLS Recognition Engine can quickly and accurately distinguish it from an unlimited number of predefined master forms. The engine uses an extremely accurate algorithm to extract the unique features (attributes) of each master form (single or multi-page) and stores them in an XML file. This file is portable and efficient so you no longer need to store all of the original images for your master forms, thus freeing up unnecessary disk space. After you have created master forms for all of the forms you expect to process, you will be able to fully automate the recognition process for all forms no matter which source (archival, scanner, etc.) or resolution is used; whether it is deformed or computer-generated, etc.

The LEADTOOLS Recognition Engine is the industry-leading recognition engine. With it, programmers can fine-tune the engine for the specific types of forms expected to be processed. There are many factors to consider when creating a master form's attributes, including the text, barcodes, and unique objects in the form.

LEADTOOLS has created unique sub-engines ("Object Managers" as referred to by the SDK), to handle all of these different factors. Object Managers make it posssible to choose the factors which should be considered when creating master form attributes. You can use a single Object Manager, or a group of them. Each manager has a unique purpose: hence, choosing the appropriate manager will increase the performance and accuracy of the forms recognition. For example, if all the forms are expected to have unique barcodes, most likely only the Barcode Manager would be needed. Other managers can be used as well, but the Barcode Manager would be all that is necessary (potentially saving the processing time spent using other, unnecessary engines).

In addition to automatically creating form attributes through the different Object Managers, the engine has an optional feature to highlight important information in the form, such as the company or form name. No matter which object manager is used, the engine provides you with comprehensive results about the recognition, including a confidence level for each form. The Forms Recognition Engine provides the following "Object Managers":

OCR Manager (requires a LEADTOOLS OCR Engine) - The OCR Manager uses OCR to extract the text features from a form to create the form's attributes. The OCR manager can be used with any OCR Engine LEADTOOLS provides such as the Advantage Engines. The OCR Manager is the optimal manager and is capable of recognizing forms which were scanned under several different conditions (resolution, alignment, etc.) from the conditions used to scan the master form. It uses an internal algorithm capable of calculating the amount of scale and shift in the unidentified form to provide a complete automatic alignment solution.
Barcode Manager (requires a LEADTOOLS Barcode Engine) - The Barcode Manager uses Barcode recognition technology to extract the barcode features from a form to create the form's attributes. This manager is capable of accurately recognizing forms in fractions of a second (even with larger sizes of images). The Barcode Manager can be used with any Barcode Engine LEADTOOLS provides, including both 1D (EAN/UPC, Code 128, etc.) and 2D (DataMatrix, PDF417, QR, etc.) barcodes. The Barcode Manager uses the image's resolution to calculate the alignment so it is ideal for recognizing forms which can have different resolutions, but similar scales and shifts. Since most forms already contain some type of unique barcode, the Barcode Manager is a perfect fit for most scenarios.
Default Manager (No add-on required) - The Default Manager extracts special object features such as lines and inverted text from a form to create the form's attributes. This manager is useful for simple forms which have unique lines and other objects. While accurate, the Barcode and OCR Managers should be used for optimal performance and accuracy. The Default Manager uses image resolution to calculate the alignment. Consequently, it is ideal for recognizing forms which are generated at different resolutions, but have similar scales and shifts.

Forms Recognition basically works by creating a FormRecognitionAttributes object for each Master Form and form you would like to recognize, and then comparing the attributes to see which Master Form matches each form with the highest confidence. The following is an outline of the general steps involved in performing Form Recognition on one or more pages.

Create the Master Forms Repository that points to the storage location of the Master Forms.

SetLicense(); 
                 
// Set the name of the folder that contains the Master Forms 
string root = @"C:\Users\Public\Documents\LEADTOOLS Images\Forms\MasterForm Sets\OCR\"; 
                 
RasterCodecs codecs = new RasterCodecs() 
DiskMasterFormsRepository repository = new DiskMasterFormsRepository(codecs, root);

Create the OCR and Barcode engines to be used in the Auto-Forms Engine.

// Create the OCR engine instance, and use LEADTOOLS Advantage OCR engine 
IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.Advantage, false) 
                 
// Start up the OCR engine 
ocrEngine.Startup(null, null, null, @"C:\LEADTOOLS 19\Bin\Common\OcrAdvantageRuntime"); 
                 
// Create the Main BarcodeEngine instance 
BarcodeEngine barcodeEngine = new BarcodeEngine();

Create the Auto-Forms Engine using the AutoFormsEngine Class.

// Create the Main AutoFormsEngine instance 
AutoFormsEngine autoEngine = new AutoFormsEngine(repository, ocrEngine, barcodeEngine, 30, 80, true); 
                 
// Set up the recognition options 
autoEngine.RecognizeFirstPageOnly = true; 
autoEngine.MinimumConfidenceKnownForm = 40; 
                 
// Get a list of the files to process 
string[] files = Directory.GetFiles(@"C:\Users\Public\Documents\LEADTOOLS Images\Forms\Images\", "*.tif"); 
                 
// Call the function that contains the main recognition process 
ProcessFiles(autoEngine, files);

Call AutoEngine.Run to both

{                                                   }

recognize and process the form at the same time, or call AutoEngine.RecognizeForm to recognize only the form. The following code shows how to handle the AutoFormsEngine class in a multi-threading application: codeContainer Generic">private static void ProcessFiles(AutoFormsEngine autoEngine, string[] files)   Console.WriteLine("Started Processing Files ...");  // Get the number of files to process  int fileCount = files.Length;   // Event to notify us when all work is finished  using (AutoResetEvent finishedEvent = new AutoResetEvent(false))  {  // Loop through all Files in the given Folder  foreach (string file in files)  {  // Capture the file name here, since we are using an anonymous function  string fileToProcess = file;   // Process it in a thread  ThreadPool.QueueUserWorkItem((state) =>  {  try  {  // Show the name  string name = Path.GetFileName(fileToProcess);  Console.WriteLine("Processing {0}", name);   // Process it  AutoFormsRunResult result = autoEngine.Run(fileToProcess, null);   // Check results  if (result.FormFields != null && result.RecognitionResult.MasterForm != null)  Console.WriteLine(string.Format("  Master Form Found \"{0}\" for {1}", result.RecognitionResult.MasterForm.Name, name));  else  Console.WriteLine(string.Format("  No Master Form Found for {0}", name));  }  catch(Exception ex)  {  Console.WriteLine("Error {0}", ex.Message);  }  finally  {  if (Interlocked.Decrement(ref fileCount) == 0)  {  // We are done, inform the main thread  finishedEvent.Set();  }  }  });  }   // Wait till all operations are finished  finishedEvent.WaitOne();  Console.WriteLine("Finished Processing Files");  }

For a detailed outline to just recognize a form, refer to Steps To Recognize and Process a Form.
For a detailed outline to generate a Master Form, refer to Steps To Generate Master Form and Save it to a Master Repository.

The Leadtools.Forms.Auto namespace provides a set of classes and interfaces for automated forms recognition and processing using multi-threaded processing. Those who want to implement their own multi-threaded process can disable multi-threading in Auto Forms or use the Low-Level Forms design. The framework handles Form Categories using Repositories. Sample implementations for disk-based form repositories are provided in the toollkit. Users can inherit from the framework's interfaces (IMasterForm, IMasterFormsCategory, IMasterFormsRepository) and implement their own custom repository as well.

Forms Processing

Forms Processing is the process of extracting the filled-in data information from predefined fields in a form. Fields are defined per page, so fields for a form comprised of several pages can easily be created and its data extracted from the desired field or page. Each field has the following attributes associated with it:

Name or ID (usually the field name on the form)
Location on the actual form. The location can be specified in several types of units to accommodate different applications.
Type of field (text, check box, image)

Field information can be processed regardless of image resolution, scale, or other form generation characteristics. No matter which field type is being used, the engine provides comprehensive results of the processing, including a confidence value for each result. The Forms Processing Engine provides the following field types:

Text Field - Text Fields are used to read text characters from the form. The characters can be letters, numbers, punctuation, or symbols. Both handwritten and machine printed characters are supported. The exact level of support provided for text fields and languages depends on which OCR Module is being used. For example, handwritten fields would require the ICR Module while machine-printed fields would require an OCR Module.
Barcode Field - Barcode Fields are used to read barcode information from the form. The exact level of support for barcode fields depends on which Barcode Module is being used. For example, 1D barcode fields require the 1D Barcode Module while 2D (DataMatrix, PDF417, QR) barcode fields require the 2D Barcode Module.
Image Field - Image fields are used to extract specific images from the form. These images can be logos, stamps, finger prints, etc.
OMR Field - OMR fields are used to read check mark information (check box, radio button, etc.) from the form. The results indicate whether an area was checked or selected. OMR Fields require OMR support to be unlocked.
Table Field - Table fields are used to read invoice information from the form. Table fields can be any OcrFormField type, Text, or Omr. Table length cannot be fixed: table length is automatically extended during processing to detect all rows of the table.

In addition to the above predefined Field Types, the Processing Engine allows you to create your own custom fields for any unique needs you may have.

How to Speed up Forms Recognition and Processing

AutoFormsEngine

Low-Level Forms Recognition

Low-Level Forms Recognition makes it possible to design custom algorithms for recognition and forms comparisons. Whereas the High-Level Forms Recognition searches for the first form that meets or exceeds the Minimum Confidence value, the Low-Level Forms Recognition performs the search over all master forms in the repository to find the form with the highest confidence value. Note that this requires all the master forms in the repository to be loaded.

The general steps involved in performing Form Recognition on one or more pages are shown in the following outline.

FormRecognitionEngine

Forms Recognition Engine

RecognitionObjectsManager

Master Form attributes can be loaded and saved to disk using the GetData and SetData methods. In most cases, you should save all master form attributes to disk. Then, when recognizing filled forms, load each master form attributes file and compare it with the attributes of the form being recognized to see which returns the highest confidence value. Note that the search has to be performed over all master forms in the repository. For a simple tutorial on using Forms Recognition, refer to Recognizing Forms.

Low-Level Forms Processing

Low-Level Forms Processing makes it possible to customize alignment and processing. The general steps involved in performing Form Processing on one or more pages is shown in the following outline.

FormProcessingEngine.Pages.Add

Process

GetFormAlignment

Fields can be loaded and saved to disk using the LoadFields and SaveFields methods. In most cases, you should save all of the fields for each master form to disk. Then, when processing filled forms, load the appropriate form fields from file for use in the FormProcessingEngine. For a simple tutorial on using Forms Processing, refer to Processing Forms.

SDK Definitions

Attributes
Unique features of a Master Form used to identify filled forms in the forms recognition process.

Barcode Manager
An Object Manager which creates attributes based on barcode fields in the master form.

Confidence
Value from 0 to 100 representing the amount of confidence in the results. A value of "100" means complete confidence while a value of "0" means no confidence.

Default Manager
An Object Manager which creates attributes based on unique objects such as lines and inverted text in the master form.

Exclude region
An area which has no features or attributes necessary for form recognition.

Field
A pre-defined area on a recognized form from which text, barcode, check box, image, or custom data is extracted.

Filled Data
Any data a user adds to a form in a predefined field. Using the LEADTOOLS Forms Processing Engine, this data can be extracted from a recognized form.

Filled Form
A Master Form containing filled data. The Forms Recognition and Processing Engine is used to uniquely identify the forms and extract the data from its fields.

Form
A filled form which needs to be recognized and/or processed.

Form Alignment
Information necessary to align a recognized form with its corresponding master form.

Form Category
A collection or logical grouping of similar Master Forms in a Form Repository. A Form Category can contain Master Forms and/or sub categories.

Forms Processing
The process of extracting user-filled data from pre-defined fields in a recognized form.

Forms Recognition
The process of identifying a filled form with that of a Master Form.

Forms Repository
A storage system for Form Categories. This is the top level of the collection.

Include region
An area which has features or attributes necessary for form recognition.

Master Form / Template
An unfilled or blank form containing attributes unique to that form. Master Forms can be single or multi-page. Master Forms attributes are generated by the different Object Managers.

Object Manager
Unique sub-engines which generate attributes for a specific master form.

Ocr Manager
An Object Manager which created attributes based on text fields in the master form.

Page Alignment
Information necessary to align a recognized form page with the corresponding page from the master form.

Region of interest
An area which has very important attributes necessary for form recognition. These regions are used to highlight important features such as the company or form name.

Top ^