VoyForums
[ Show ]
Support VoyForums
[ Shrink ]
VoyForums Announcement: Programming and providing support for this service has been a labor of love since 1997. We are one of the few services online who values our users' privacy, and have never sold your information. We have even fought hard to defend your privacy in legal cases; however, we've done it with almost no financial support -- paying out of pocket to continue providing the service. Due to the issues imposed on us by advertisers, we also stopped hosting most ads on the forums many years ago. We hope you appreciate our efforts.

Show your support by donating any amount. (Note: We are still technically a for-profit company, so your contribution is not tax-deductible.) PayPal Acct: Feedback:

Donate to VoyForums (PayPal):

Login ] [ Contact Forum Admin ] [ Main index ] [ Post a new message ] [ Search | Check update time | Archives: [1] ]
Subject: Identify Non-text-searchable PDFs


Author:
Paul Wright
[ Next Thread | Previous Thread | Next Message | Previous Message ]
Date Posted: 12:14:14 04/18/03 Fri

PDF Explorer is a wonderful program but, like the other PDF programs I have found so far, doesn't do the one thing I need to have done. I am hoping you can either easily add to a new version or can suggest a solution otherwise.

We have many PDFs stored on our server. Some are text searchable and some are just images. There appears to be no easy way to distinguish between them in mass and mark the non-searchables for further processing. The main way to identify them is to open each document and then attempt to extract text. If that fails, it is non-searchable. It has been suggested that we can do a text search for the word Font to identify searchable PDFs. While it seems to work, we have found no automatic way to mark them to provide differentiation. (We could move them to other folders but that would create a certain amount of chaos in itself.) We are working in a Windows 2000 environment.

I had hoped that PDF Explorer would provide some indication but find that PDFs that show no data in PDF Explorer may still contain text data.

Anyhow, do you have any suggestions for me?

Thanks again for a powerful and excellent program.

-----Paul-----

[ Next Thread | Previous Thread | Next Message | Previous Message ]

Replies:
Subject Author Date
Re: Identify Non-text-searchable PDFsRTT15:09:08 04/18/03 Fri


[ Contact Forum Admin ]


Forum timezone: GMT-8
VF Version: 3.00b, ConfDB:
Before posting please read our privacy policy.
VoyForums(tm) is a Free Service from Voyager Info-Systems.
Copyright © 1998-2019 Voyager Info-Systems. All Rights Reserved.