VoyForums
[ Show ]
Support VoyForums
[ Shrink ]
VoyForums Announcement: Programming and providing support for this service has been a labor of love since 1997. We are one of the few services online who values our users' privacy, and have never sold your information. We have even fought hard to defend your privacy in legal cases; however, we've done it with almost no financial support -- paying out of pocket to continue providing the service. Due to the issues imposed on us by advertisers, we also stopped hosting most ads on the forums many years ago. We hope you appreciate our efforts.

Show your support by donating any amount. (Note: We are still technically a for-profit company, so your contribution is not tax-deductible.) PayPal Acct: Feedback:

Donate to VoyForums (PayPal):

Login ] [ Contact Forum Admin ] [ Main index ] [ Post a new message ] [ Search | Check update time | Archives: 1 ]
Subject: Re: Search function not working


Author:
RTT
[ Next Thread | Previous Thread | Next Message | Previous Message ]
Date Posted: 16:22:08 04/25/06 Tue
In reply to: SR 's message, "Search function not working" on 20:38:37 04/24/06 Mon

The current PDFE text extraction routines are very weak, alpha stage, and for some pdf's can not extract text correctly. You can check that using the "text viewer" in PDF View mode. Obvious I'm going to try to improve the text extraction in upcoming versions, but right now I’m focusing my coding time doing other improvements.

Also note that the best way to search for text content is to use the DBSearch scan mode, IndexedContents option, after index the pdf's text using the IndexTextWords batch tool. You are not going to increase the text extraction accuracy, but speed up subsequent full text search operations.

The best way to search for pdf is, after spending some time in QuickInfoEdit mode (Edit>QuickInfoEdit) entering meaningfull information into pdf Infofields, to use the DBSearch scan mode, InfoFields mode. You get search results more fast and more accurate.

>has anyone else had a problem where PDF explorer runs
>a search through the contents of multiple PDF files
>and comes up with nothing. but then when you open a
>PDF file the phrase you were looking for was there the
>whole time!?
>
>this is happening to me. i use foxit pdf reader to
>view pdf files. when the pdf i need it opened in foxit
>and i search for the phrase i want, it comes up no
>problem. but in pdf explorer when running a "content"
>search, it comes up with nothing.
>
>the only thing i use pdf explorer for is to batch
>search pdf files for phrases. but now its proven
>itself inaccurate and ultimately useless

[ Next Thread | Previous Thread | Next Message | Previous Message ]


[ Contact Forum Admin ]


Forum timezone: GMT-8
VF Version: 3.00b, ConfDB:
Before posting please read our privacy policy.
VoyForums(tm) is a Free Service from Voyager Info-Systems.
Copyright © 1998-2019 Voyager Info-Systems. All Rights Reserved.