Error Fetching URL when Indexing PDF Files

ID: Q188957


The information in this article applies to:

This problem occurs when you use Microsoft Site Server version 3.0 and the Adobe PDF IFilter version 1.1 (beta) and prior.

The third-party products discussed here are manufactured by vendors independent of Microsoft; we make no warranty, implied or otherwise, regarding these products' performance or reliability.

REFERENCES

Microsoft Site Server Search documentation.


SYMPTOMS

When you are indexing .pdf files with Site Server Search, the gatherer log may report the following error message for one or more files:

Error fetching URL.


CAUSE

The IFilter provided by Adobe is limited to access by a single thread. By default, Site Server Search requests and processes multiple documents simultaneously. This may result in errors if the Portable Document Format file (PDF) filter installed is version 1.1 beta or earlier.


WORKAROUND

To work around this problem, Site Server Search can be configured to request a single document at a time. This slows down the indexing of a site but has the desired effect of limiting access to the IFilter to a single thread at a time. You can accomplish this as follows:

  1. Create a virtual directory and place your PDF files in it. Be sure to allow directory browsing on this directory.


  2. In the properties for the Catalog Builder, select the Timing tab and limit the site that contains the virtual directory to one document at a time.


  3. Create a Search project that will do a Web crawl of the PDF virtual directory. Set the crawl to one (1) page hop allowed.


  4. Build the catalog.


Additional query words: kbnokeyword


Keywords          : prodsitesrv3 prodsrch 
Version           : WINNT:3.0
Platform          : winnt 
Issue type        : kbprb 

Last Reviewed: July 14, 1999