Detecting Registration Marks

When a user scans a form, perturbations or transformations are added to the image such as rotation, zooming, reduction, and translation (shifting in position). These effects make it difficult to identify the different parts of the form, complicating the detection and recognition phases.

Registration marks can be used to help the identify and localize the different parts of the form. Registration marks are predefined shapes that are added to the original form in specific locations. These marks can then be used to identify the transformations or deformation added to the original form, and then to correct a scanned form's orientation and resize it to its original size.

Currently, there is only one type of registration mark that has been defined for use with scanned forms (more are to be added later). The valid registration mark is a T-shaped figure rotated by 90 degrees counter clockwise. The line width should be greater than 2 pixels (preferably 3 pixels). There are no conditions on line lengths since you provide the values for uWidth, uHeight, uMaxScale, and uMinScale. Use the LBitmap::IsRegMark function to ensure the selected mark is a valid registration mark.

image\T_Shape_Fig.gif

Currently, to be used with the registration marks functions a document image must consist of white objects on a black background.

To use the registration functions, begin with creating the reference document that contains the registration marks in the exact positions on the form that will be used for scanning. The registration marks should be placed in different areas of the form far enough apart on the form that good triangulation can be performed -- but not where they would not always appear on the form in their entirety when the form is scanned. The registration functions are designed to use three designated registration marks as reference points.

After creating the reference document, scan it to obtain a document image. The reference document must be scanned in perfectly: any documents using the reference document for registration will be transformed to match its size, shifting, and rotation exactly.

The registration functions work by finding the (X, Y) positions of three labeled points and comparing them with the positions of these points on the scanned image. With that information it is possible to determine the shifting, rotation and scaling of the scanned image (5 equations with 5 unknowns). The unknowns are:

For each registration mark, designate the searching area for that mark, and specify the maximum allowable shift, rotation and scaling. Each searching area should contain only one registration mark and it is preferable that the mark be located in an otherwise blank portion of the form. This information is designated using a SEARCHMARKS structure and contains the following information:

Members

uType

RGS_T

uWidth

Registration mark's width, in pixels.

uHeight

Registration mark's height, in pixels.

uMinScale

Minimum registration mark percentage scaling to be detected.(for example, 90, 85, or 75, .etc)

uMaxScale

Maximum registration mark percentage scaling to be detected. (for example, 110, 120, or 125, .etc)

rcRect

Rectangular area to be searched for registration marks. Note that if you select too small of a search area, you run the risk the mark will not be detected because of shift, rotation or scaling. However, the larger the area, the more time that is consumed in the search.

uMarkSearchCount

Number of expected registration marks inside the search area. Set this to 1. (Since there must be one registration mark inside each area.)

pMarkDetectedPoints =

(POINT *) malloc(uNumOfMarks * sizeof(POINT)). This will be updated by LBitmap::SearchRegMarks function.

uMarkDetectedCount

Count of detected marks. Set this to 0. This will be updated by LBitmap::SearchRegMarks function.

With this information you can use the LBitmap::GetMarksCenterMass function to compute the center of mass for each of these marks. Save these points to an array.

Now you can scan forms and correct their orientation and size.

The next step is to search for the registration marks in the scanned image. Use the LBitmap::SearchRegMarks function on the scanned image to find the registration marks. If all the registration marks were detected (the uMarkDetectedCount parameter in each SEARCHMARKS structure equals 1), then use the LBitmap::GetMarksCenterMass function to compute the center of mass for each registration mark, and save those to an array of points.

The next step is to determine the shifting, scaling and rotation necessary to re-orient the scanned image. To do this, pass the array of reference points as well as the array of transformed points to the LBitmap::GetTransformationParameters function.

To re-orient and resize the scanned image so it is like the reference image, use the values from the LBitmap::GetTransformationParameters function and pass them to the LBitmap::ApplyTransformationParameters function. This is equivalent to shifting the image using LBitmapBase::Combine, rotating the image using LBitmapBase::Rotate(without resizing), and scaling the image using either LBitmapBase::Size or LBitmapBase::Resize or LBitmapBase::SizeInterpolate. (Note: It is important to do the operations in this order. You will get an incorrect result if you change the order.).

A demo has been prepared to illustrate the use of these functions. For a tutorial, refer to How to Use the Registration Marks Demo

Help Version 23.0.2024.2.29
Products | Support | Contact Us | Intellectual Property Notices
© 1991-2024 LEAD Technologies, Inc. All Rights Reserved.

LEADTOOLS Raster Imaging C++ Class Library Help
Products | Support | Contact Us | Intellectual Property Notices
© 1991-2023 LEAD Technologies, Inc. All Rights Reserved.