To run FileNet® P8 components in a non-English environment, certain conditions must be met. Review the following considerations and tasks, organized by administrator role, if you plan to run FileNet P8 in a non-English environment.
By default, Content Platform Engine uses Oracle Outside In Search Export for text extraction on PDF documents. For right-to-left language PDF documents, you can optionally use Apache PDFBox technology for text extraction. To use PDFBox, you set a JVM property on Content Platform Engine. For more information, see the topics in .
For information on how IBM® Content Search Services extracts text from documents that are sent to it by IBM Content Collector, see .