Welcome Guest! To enable all features, please Login or Register.

Notification

Error

Please Help me

Options

View

Last

Unread

Previous Topic Next Topic

This topic and its replies were posted before the current version of LEADTOOLS was released and may no longer be applicable.

#1 Posted : Thursday, June 21, 2007 5:18:18 AM(UTC)

hemantbkulkarni

Groups: Registered
Posts: 9

I am sending you an Image.

There are so many small images are embedded in a single image.

Now I want to identify all these images and store them in a database with name that is a text on the small images in the sent image.

I want to store these small images in this single image as separate images and tag the name.

Is it possible to identify these images and save separately using Lead tools OCR Document Imaging Tool?

If so please guide me how to proceed further or send me some examples.

Thanks in advance.

With Regards

Hemant BK


	Try the latest version of LEADTOOLS for free for 60 days by downloading the evaluation: https://www.leadtools.com/downloads Wanna join the discussion? Login to your LEADTOOLS Support account or Register a new forum account.

#2 Posted : Thursday, June 21, 2007 11:06:48 AM(UTC)

GregR

Groups: Registered, Tech Support, Administrators
Posts: 764

You did not have any files attached to your post. Please attach it so I can better understand your problem.

NOTE: If you are attaching a project or file to the forums...

1.) Zip the file(s) up.

2.) Make sure it is less than 10 MB. If it is larger, please send an email to support@leadtools.com or give us your email address and we'll send you FTP instructions.

3.) Remove ANY AND ALL unlock codes and LEADTOOLS DLLs.

4.) Do not click the preview button, the attachment will not show up when you post it.

If you do not wish to post your file(s) on the forum since they are publically viewable, please send an email to support@leadtools.com and make sure that you include a link to this forum post.

#3 Posted : Thursday, June 21, 2007 8:12:23 PM(UTC)

hemantbkulkarni

Groups: Registered
Posts: 9

I am sending the Image as ZIP File and RAR File , my file size is 4.66mb

still when I post it , it do not get posted .

What could be the problem?

Thanks in advance

With Regards

Hemant BK

#4 Posted : Thursday, June 21, 2007 8:12:25 PM(UTC)

hemantbkulkarni

Groups: Registered
Posts: 9

Again I am sending the image.

Thanks in advance

with regards

Hemant BK

File Attachment(s):

Image.rar (2,885kb) downloaded 30 time(s).

#5 Posted : Friday, June 22, 2007 8:08:35 AM(UTC)

GregR

Groups: Registered, Tech Support, Administrators
Posts: 764

The OCR engine will not be able to separate the images or aid in separating them because it deals with finding text.

If you want to separate the small images (the photo IDs I presume) you will need to make a search algorithm that can detect where the IDs are located, and then crop them into their own images. Unfortunately, LEAD does not have anything that can do this, since all of the IDs are different.

My first idea would be to search by scanning horizontally and/or vertically and find the thick, black frame around the ID photos. The frame is pretty distinguishable and looks to be the same size on each ID, so you should be able to make some pretty good guesses as to what is the frame and what isn't based on the DPI, number of pixels the same color (or color range) in a row.

Probably your biggest difficulty in this kind of algorithm is the fact that your image is grayscale, so the frame will not always be perfectly black. Second biggest will be any ID pictures that are dark.

#6 Posted : Sunday, June 24, 2007 7:50:58 PM(UTC)

hemantbkulkarni

Groups: Registered
Posts: 9

Thanks for your suggestions.

Shall I get some more suggestions from you?

My requirement is this.

I have big image (the sent one) which comprises some small sized images and text above indicating the name of the image.

I want to save these small images as separate images with the name which is above in the sent image.

There are two possibilities.

1. While scanning only I should scan these images as separate images by defining the regions in a template (scanning template).

Then I should define the regions and read these texts on the images using OCR / ICR tools.

Finally associate these images with the names read as text (using any of these ICR or OCR) and save them in the database.

2. If not in scanning then I should process this image. ie Define regions and separate these images and again read the texts on the image using OCR OR ICR readers and associate these names with the image and store it in the database.

In short the requirement can be divided into three

1. Extracting the Images. (Small images)

2. Reading the text on the images by defining the regions.

3. Associate these images with the names read and store them in the database.

The first one is either carried out while scanning the image by defining a template for scanning where I should be able to define regions for these images or after scanning the image I should process the image to extract the small images.

According to you which one easier?

Shall I save these images as different images while scanning by defining a template (where I should be able to define regions.)? If so how should I proceed? Does a Lead tool support this type of scanning?

Can you suggest me any Image processing tools which can help me to separate these images?

Is it possible to read these texts above the images using lead tools OCX?

Please throw some light on this issue.

With Regards

Hemant BK

#7 Posted : Monday, June 25, 2007 4:29:56 AM(UTC)

GregR

Groups: Registered, Tech Support, Administrators
Posts: 764

I would think that using your scanner to "cut" your large document up would be easiest. That way you only have one photo ID and one set of information to worry about. However, not all scanners will have this ability. You will need to use the ICAP_FRAMES capability and the TW_IMAGELAYOUT structure. For more information on these and how they are used, you will need to check the TWAIN 1.9 documentation (http://twain.org/downloads.shtm). Within LEADTOOLS, you will be able to access the scanner by using the GetCapability and SetCapability methods.

There is no OCX interface for OCR, but you can use the OCR COM interface. The LEADRaster.Bitmap property should be interchangeable with the LEAD Main Control's Bitmap property.

You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.