Document truncation
A text search server limits the number of Unicode characters that can be indexed for each text document. Sometimes this character limit results in the truncation of large text documents in the text search index.
The default value for the number of Unicode characters that are allowed for each text document is 10 million. For a rich text document, this limit is applied after the document is transformed to plain text.
If a text document is truncated during the parsing stage, you receive a warning that some documents were not processed correctly or completely, and the document is partially indexed. Details about the warning are written to the event table that is created for the text search index. Text that is in the document after the limit is reached is not indexed and cannot be searched.