Select the search type
  • Site
  • Web
Search
You are here:  Support/Forums
Support

Bring2mind Forums

Searching PDF content - disappear over time
Last Post 07/06/2008 11:41 AM by Peter Donker. 5 Replies.
Sort:
PrevPrev NextNext
You are not authorized to post a reply.
Author Messages
ant s
New Member
New Member
Posts:3


--
06/26/2008 4:46 PM

Hi there,

The PDF content in our site seems to disappear over time. Once we have re-index the content, we can search for the PDF content. However, one day later, the same keyword will not return the PDF files.

Have anyone come across this issue?

Additionally, we found that the PDF content indexing only works if we run the indexing on the whole application. It does not work if we index only the portal.

Thanks,

Peter Donker
Veteran Member
Veteran Member
Posts:4536


--
06/27/2008 6:54 PM
Hi Ant,
No this is new to me. PDFs are 'scanned using Adobe's own iFilter. Whether you're using Lucene or Indexing Service provider for indexing. Having said that: you could try to switch to Indexing Service to see if this improves it.
Peter
ant s
New Member
New Member
Posts:3


--
07/01/2008 1:11 AM
Thanks for the suggestion Peter - I might give it a try later.

It seems really strange that I have to re-index the whole installation to pick up the PDF content - especially considering I only have 1 portal on the installation.

I'm suspecting this is what causing the content disappear from the index - someone or a scheduler kick off the portal re-indexing, and this make the content gone missing from the index.
Peter Donker
Veteran Member
Veteran Member
Posts:4536


--
07/03/2008 9:46 AM
What version DMX are you using? There was such an issue in an older version I remember now.
Peter
ant s
New Member
New Member
Posts:3


--
07/04/2008 12:34 AM

Hi Peter,

It's reported as 4.2.3 from the Module Definition. I know one of the developer had previously installed the older version and the module had since updated.

Could this be the case as well? If so, how do I check?

Thanks

Peter Donker
Veteran Member
Veteran Member
Posts:4536


--
07/06/2008 11:41 AM
What is in the Module Definitions should be accurate. So you're working on the latest version. Then I have no explanation for the Lucene index getting corrupted. I cannot replicate that.
It'll get harder to find something. Maybe you can try the following: recycle the application pool in IIS and delete the DMX/Lucene drectories under your portal directory. Now you've really removed all Lucene stuff. Go to your website and make it reindex the complete installation. The various Lucene directories should be recreated now with brand new files in them.
Peter
You are not authorized to post a reply.