Example Configurations

For a small collection that you'd like high recall for, you might use 4 streams that differ only in their stemmers: none (boost for exact match), case (boost for exact characters), delanguage (boost for only being pluralized/accented differently), english (get better recall by pulling in other words).

Note: This information does not apply to lexical analysis language streams. See Lexical Analysis Streams for more information.

To set a global stream configuration, use the graphical interface on the Configuration > Indexing pane.

To set an entire document to use a different stream definition than the global, add the stream definition at the document level:

<document>
  <vse-index-stream segmenter="japanese" kb="japanese-indexing" stem="depluralize"/>
  ...content in Japanese...
</document>

To set a content to use a different stream definition than the document or global level configuration, add the stream definition at the content level:

<content name="full-name">
  <vse-index-stream segmenter="literal" />
  Joe Developer
</content>