OCR and Extract Data from Scanned Invoices and Forms using C#

Posted on 2022-08-18 by Ryan Fritz

OCR

In our previous blog, we discussed the benefits of using the LEADTOOLS Document SDK to easily recognize hundreds of different forms and invoices. This post will build off of that, showcasing how to then automatically extract information and data from the recognized forms.

First, we will identify and then process the fields. The fields are the various locations on the form that were set when we created the master form (or template) for each invoice or specific form type. Our processing engine will look for data in those designated areas. There are many types of data fields including text, images, tables, OMR bubbles, and barcodes. Our engine loads and processes all the fields for the developer to then easily write how they want that information distributed. It is often good practice to check to see what type of field was processed and then write code for that type accordingly.

Continue Reading...

Automatically Recognize Invoices from Different Vendors using C#

Posted on 2022-08-16 by Ryan Fritz

Invoice

When working in a paperless office, businesses receive hundreds of different forms and invoices from different vendors. It is often a major pain point and bottleneck to manually find, extract, and store all the necessary information. Thankfully with the LEADTOOLS Document SDK and our patented Forms Recognition technology, everything can be easily automated to improve workflow productivity and efficiency.

With LEADTOOLS, users need only to create templates, also known as master forms, for each of the different invoices or form types. These master forms are then stored in a repository and used to automatically recognize which type of filled form is currently being processed.

Continue Reading...

Invoice Recognition and Processing Video

Posted on 2021-03-08 15:59:04 by Zac Ferraresi

Some things you just have to see to believe, and we think that LEADTOOLS V19's new Invoice Recognition and Processing SDK is one of them. Now you can in our latest YouTube video!

If you have any familiarity with Forms Recognition and/or OCR, you may understand that those technologies typically rely on the master template and the document you wish to recognize having the same layout and proportions. Tax forms and applications fit this nicely since as long as the documents can get cleaned up and aligned, the zones and fields on the form to recognize will easily be found. Invoices, bills and other unstructured documents need special processing which LEADTOOLS masterfully developed and exposed in this new programmer-friendly toolkit. We hope you enjoy the video!

Continue Reading...

Forms Processing API Tutorial: Recognize and Process a Form

Posted on 2021-01-12 14:13:10 by Zac Ferraresi

FormsRecognitionImage

Automate your data entry problems away with a state-of-the-art forms processing API. Whether you're working with customer surveys, tax documents, or billing records every industry uses forms daily to conduct business. Moving data from a paper to a digital medium can be a time consuming hassle. That's why LEADTOOLS has developed exclusive capabilities that extract text from images containing any combination of machine-printed text, handwritten text, MICR, MRZ, and OMR fields. LEADTOOLS will automatically detect and recognize everything! Below are the main steps to quickly and accurately process various form types regardless of how the data is formatted.

Continue Reading...

Tutorial: Auto Recognize and Process a Form

Posted on 2020-04-24 by Zac Ferrasi

Processing forms and invoices are a large part of many companies day-to-day workflow. When a copy of a form is filled out by a person and scanned back into the company, that information then needs to be extracted. Many OCR engines struggle to extract this information since the form could have been scanned in at a lower resolution than the original, could have noise introduced by the scanner, or the fields may be unstructured and dynamically generated. Thankfully, the LEADTOOLS Forms Recognition SDK takes care of all of that and eliminates the need for any additional manual processing. Powered by LEAD’s patented machine learning algorithms, these advanced forms recognition and OCR libraries handle both structured and unstructured forms and can help save companies valuable time and money.

Continue Reading...
LEADTOOLS Blog

LEADTOOLS Powered by Apryse,the Market Leading PDF SDK,All Rights Reserved