LEADTOOLS Support
General
General Questions
Leadtools Forms Recognition on vectorial PDF
#1
Posted
:
Tuesday, May 25, 2021 3:36:33 AM(UTC)
Groups: Registered
Posts: 40
Thanks: 5 times
Hi,
how Forms Recognition works on vectorial pdf files?
It rasterize the file and then try to read with ocr or try to read the text directly without using ocr engine?
#2
Posted
:
Tuesday, May 25, 2021 10:42:31 AM(UTC)
Groups: Registered, Tech Support, Administrators
Posts: 163
Was thanked: 9 time(s) in 9 post(s)
We have more information regarding the Forms Recognition toolkit and its workflow available in our online documentation.
https://www.leadtools.co...processing-workflow.htmlNote part of the forms recognition process is comparing the input document against a repository of templates to determine the appropriate template to use. This process requires rasterization of the input image without regard to its original format - this process discards any SVG information present in the document. Once the document is matched against the corresponding form, OCR is used to extract the areas of interest defined in the template.
If you're explicitly using SVG documents which will all have the same layout, consider using the GetText() method. This method retrieves the SVG content from the specified boundary (if present) but applies OCR otherwise.
https://www.leadtools.com/help/sdk/v21/dh/dox/documentpage-gettext().htmlEdited by user Wednesday, May 26, 2021 7:41:18 AM(UTC)
| Reason: Fixing link
Nick Crook
Developer Support Engineer
LEAD Technologies, Inc.
1 user thanked Nick for this useful post.
LEADTOOLS Support
General
General Questions
Leadtools Forms Recognition on vectorial PDF
You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.