Agent for Windows file systems crawler - configuration properties
The Agent for Windows file systems crawler crawls remote Microsoft Windows file systems.
To crawl a Microsoft Windows file system, you must first install an agent server on the remote Microsoft Windows server. The agent server is common with IBM Watson® Explorer Content Analytics. Use the install image Agent for Windows File Systems. For more information, see Agent for Windows file systems crawlers.
The Create crawler: Agent for Windows file systems screen is where you enter the configuration parameters for this crawler.
Crawler Properties
- Crawler name
- The name of the crawler. Alphanumeric characters, hyphens, underscores, and spaces are allowed.
- Crawler description
- A description of the crawler.
- Advanced options
-
- Time to wait between retrieval requests
- The time is expressed in milliseconds.
- Maximum number of active crawler threads
- The maximum number of active crawler threads.
- Maximum document size
- The maximum size expressed in kilobytes. The maximum value is 131,071 kilobytes.
- When the crawler session is started
- Specifies which content to crawl.
Data Source Properties
- Host name
- The host name of the remote Microsoft Windows server.
- Port for authentication
- The port for authenticating which is configured when the agent server is installed.
- Port for data transfer
- The port for transferring data which is configured when the agent server is installed.
- User name
- The user name to connect the agent server which is configured when the agent server is installed.
- Password
- The password of the specified user.
Crawl space Properties
You can find and add multiple crawl spaces for a file system. For instructions, see Finding and adding crawl spaces in a Windows file system.
Crawler plug-in
Data source crawler plug-ins are Java™ applications that can change the content or metadata of crawled documents. You can configure a data source crawler plug-in for all non-web crawler types. For more information, see Crawler plug-ins.
- Enable the crawler plug-in
- Enable this option when you use the crawler plug-in.
- Plug-in class name
- The class name for the crawler plug-in.
- Plug-in class path
- The JAR file location of the crawler plug-in. The folder that contains the JAR file must be mounted so it is available. For more information, see Providing access to the local filesystem from Watson Explorer oneWEX.