When I crawl MS word docs from Filenet, the "field codes" from the table of contents are also crawled and appear in the snippet in ICA search results. I see tags like "toc12345678" in the Enterprise Search results. Is there anyway to exclude these field codes from the crawl? Thanks.
Pinned topic remove Word "field codes" in crawl?
dschoppmann 270002P2U58 Posts
Re: remove Word "field codes" in crawl?2013-12-12T15:44:54ZThis is the accepted answer. This is the accepted answer.
Have you installed the latest OmniFind fix pack? There were fixes in the text extraction component which perhaps avoid the tags you mentioned. Latest fix pack for OmniFind 9.1 is 5.
JohnMiedema 270002S8RA3 Posts
Re: remove Word "field codes" in crawl?2013-12-12T15:58:35ZThis is the accepted answer. This is the accepted answer.
- dschoppmann 270002P2U5
Thanks. I am using the Enterprise Search in ICA, so I probably should have posted this question in the ICA community. But your response suggests to me a better question. These Word tags should not appear if ES/ICA is working correctly, right? I.e., this is a _fix_, i.e., not an expected behavior? I have applied all the ICA fix packs except the recently released Fix Pack 4. If FP4 does not fix it, I should open a PMR. Does this sound right?