VoyForums
[ Show ]
Support VoyForums
[ Shrink ]
VoyForums Announcement: Programming and providing support for this service has been a labor of love since 1997. We are one of the few services online who values our users' privacy, and have never sold your information. We have even fought hard to defend your privacy in legal cases; however, we've done it with almost no financial support -- paying out of pocket to continue providing the service. Due to the issues imposed on us by advertisers, we also stopped hosting most ads on the forums many years ago. We hope you appreciate our efforts.

Show your support by donating any amount. (Note: We are still technically a for-profit company, so your contribution is not tax-deductible.) PayPal Acct: Feedback:

Donate to VoyForums (PayPal):

Login ] [ Contact Forum Admin ] [ Main index ] [ Post a new message ] [ Search | Check update time | Archives: 1 ]
Subject: Re: Suggestion about text in image format


Author:
RTT
[ Next Thread | Previous Thread | Next Message | Previous Message ]
Date Posted: 06:59:15 03/16/06 Thu
In reply to: Nuno Brás 's message, "Suggestion about text in image format" on 02:23:43 03/16/06 Thu

Is already in the TODO list but the achievement of that feature is not a easy job. What for your is a non text pdf file? There are many documents made using image scan processes that have some text added, like page numbers or other foot and header text content. Also, to classify this way the files we need to scan all the pdf pages to see if they contain text or not. The scanning of all the pages is time consuming so making this in the current DiskTree scan process is problematic.
Meanwhile you can use the results of the IndexTextWords batch tool to identify these files. Probably I'm also go to use this batch tool to get that flag ;-)


>Hello,
>
>firstly, VERY GOOD JOB!
>
>Now the suggestion:
>
>I think it is possible to identify the files that are
>partially not explored because they have text in image
>format, probably originated from a bad or old scan
>process. I propose you to make a flag that warns up
>the user that this kind of text was not "explored". Is
>it possible?
>
>Regards,
>
>Nuno Brás

[ Next Thread | Previous Thread | Next Message | Previous Message ]


[ Contact Forum Admin ]


Forum timezone: GMT-8
VF Version: 3.00b, ConfDB:
Before posting please read our privacy policy.
VoyForums(tm) is a Free Service from Voyager Info-Systems.
Copyright © 1998-2019 Voyager Info-Systems. All Rights Reserved.