Wednesday, February 27, 2008

Indexing PDF content in MOSS Sharepoint 2007

Out of the box, MOSS only indexes the meta data of a PDF and not the Text inside it if it is available. To index the full content you need an iFilter which is explained well here Steven Van de Craen's Blog.

We have used Foxit's ifilter as it has an x64 version which our MOSS servers are running on.

After getting this installed I was still having trouble with some files not indexing and found that out of the box MOSS will only index 16Mb files. You can up this using a registry setting on the server - documented here. This works fine except you should be mindful of possible server index time out errors. Mindsharp say that you can increase the timeout value here, but I havnt managed to find this setting in MOSS enterprise yet.

(There is an interesting article on custom Sharepoint Searches here)

No comments: