Topic
3 replies Latest Post - ‏2013-12-12T16:08:17Z by dschoppmann
JohnMiedema
JohnMiedema
3 Posts
ACCEPTED ANSWER

Pinned topic remove Word "field codes" in crawl?

‏2013-11-15T14:27:25Z |

When I crawl MS word docs from Filenet, the "field codes" from the table of contents are also crawled and appear in the snippet in ICA search results. I see tags like "toc12345678" in the Enterprise Search results. Is there anyway to exclude these field codes from the crawl? Thanks.

Updated on 2013-11-15T14:28:37Z at 2013-11-15T14:28:37Z by JohnMiedema
  • dschoppmann
    dschoppmann
    8 Posts
    ACCEPTED ANSWER

    Re: remove Word "field codes" in crawl?

    ‏2013-12-12T15:44:54Z  in response to JohnMiedema

    Have you installed the latest OmniFind fix pack? There were fixes in the text extraction component which perhaps avoid the tags you mentioned. Latest fix pack for OmniFind 9.1 is 5.

    http://www-01.ibm.com/support/docview.wss?uid=swg24035824

    • JohnMiedema
      JohnMiedema
      3 Posts
      ACCEPTED ANSWER

      Re: remove Word "field codes" in crawl?

      ‏2013-12-12T15:58:35Z  in response to dschoppmann

      Thanks. I am using the Enterprise Search in ICA, so I probably should have posted this question in the ICA community. But your response suggests to me a better question. These Word tags should not appear if ES/ICA is working correctly, right? I.e., this is a _fix_, i.e., not an expected behavior? I have applied all the ICA fix packs except the recently released Fix Pack 4. If FP4 does not fix it, I should open a PMR. Does this sound right?

      • dschoppmann
        dschoppmann
        8 Posts
        ACCEPTED ANSWER

        Re: remove Word "field codes" in crawl?

        ‏2013-12-12T16:08:17Z  in response to JohnMiedema

        Yes please first install FP4. If this does not help, please open a PMR.