This topic and its replies were posted before the current version of LEADTOOLS was released and may no longer be applicable.
#1
Posted
:
Sunday, January 1, 2012 6:18:46 AM(UTC)
Groups: Registered
Posts: 7
Apologies if this seems to be an easy one. I'd like to know what is the best way to determine if two PDF files are the same in terms of content (images and text).
I've tried PDFDocument and DocumentReader, but I'm not sure if there are others out there that are just as effective.
Any help will be greatly appreciated. Thanks!
#2
Posted
:
Monday, January 2, 2012 12:11:16 AM(UTC)
Groups: Registered, Tech Support
Posts: 1,326
Was thanked: 1 time(s) in 1 post(s)
Regarding comparing images, I would recommend looking at the CorrelationCommand class. For more information, please see the following page:
http://www.leadtools.com/help/leadtools/v175/dh/po/leadtools.imageprocessing.core~leadtools.imageprocessing.core.correlationcommand.html
Note that to use the CorrelationCommand class, you need to load the PDF files as raster images using the RasterCodecs.Load() method, and then pass the images to the CorrelationCommand class.
Also, the other option is using the RasterImage.GetRow or GetRowColumn methods, and comparing sections of pixels within the image to see if they're the same.
Thanks,
Maen Badwan
LEADTOOLS Technical Support
#3
Posted
:
Wednesday, January 4, 2012 12:35:52 AM(UTC)
Groups: Registered
Posts: 7
Thanks for the reply Maen! That sounds like a great idea! I'll try it out and then I'll let you know how it does.
You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.