Document Clean up

Scanned documents often contain undesired elements, such as specks, dots, black borders, lines, and hole-punch marks. LEADTOOLS provides powerful functions that can clean up documents by detecting and removing these elements. This class of LEAD functions each supply a "hook" into the API decision making process, allowing the programmer to obtain information about every detected element, and to use the information to tailor the function to meet his needs. Each function can also optionally return a region corresponding to any proposed changes, allowing for end-user "preview" capabilities. These new features, along with the deskew, dilation, erosion, min and max filters give LEADTOOLS Document Imaging SDK the power and flexibility you need when working with scanned documents.

The document clean-up functionality includes:

Border removal - Remove any or all image borders (black borders surrounding scanned document), with the ability to specify tolerance level for amount of noise, maximum border size, and variability of border size.

Dot removal - Remove dots, specks, and blobs from documents, specifying min and max dot size.

Hole Punch removal - Remove hole-punch artifacts, specifying min and max hole size and count.

Line removal - Remove horizontal and vertical lines from scanned images, with the ability to specify minimum line length, maximum line width, maximum gap for dashed or otherwise broken lines, tolerance values for line width variance, and other parameters to ensure that valid image obects are preserved.

Smoothing - Smooth the nicks and bumps on scanned text characters (or any black-on-white objects), specifying the minimum bump size.

Text Inversion - Areas on scanned image of white text (or line objects) on a black background are detected and inverted to black-on-white. You can specify minimum size and min and max percent-black of regions to be detected.

 

Supported Platforms

Document Clean up API

Document Clean up Class Library

Document Clean up .NET and COM