Once you have added the Form-Based Authentication crawler condition to the search collection that is associated with the resource that you want to crawl, enter the following information in the configuration for that crawler condition:
INSTALL-DIR/data/search-collections/*/COLLECTION-NAME/crawlNUMBER/tmp
The debugging file in this directory has the prefix viv_cvo.
Watson Explorer Engine will automatically handle HTTP redirects of various sorts, but requires explicit information about how to provide the data that is required to log in on a remote resource. You must provide this information in the Form-Based Authentication Steps field for the crawler condition that you are defining.
In order to provide this information in the Form-Based Authentication Steps field, click the xml button. An edit dialog displays in which you can enter form definitions in the following format:
<form xpath="XPATH"> <parameter name="NAME" value="VALUE" /> ... <parameter name="NAME" value="VALUE" /> </form>
The elements and associated attributes in a form definition have the following meaning and requirements:
In order to provide an initial value for the contents of the Form-Based Authentication Steps field, examine the transcript that you captured in Capturing HTTP Transactions. Beginning at the end of the transcript, search backward for the credentials that you had to provide in order to authenticate. You should find these values inside POST data. Enter an equivalent <form>, with any mandatory parameters, in the Form-Based Authentication Steps field, and click OK to save your changes and close the crawler condition that you have just defined.
As an example, suppose that the final form in your transcript has a unique ID of openid_login and that the value that you entered when authenticating are being passed as fields named username and password. The form element that you would specify in the Form-Based Authentication Steps field would look like this:
<form xpath="//form[@id='openid_login']" /> <parameter name="username" value="joe-user" /> <parameter name="password" value="joes-password" /> </form>
The next section explains how to test the form-handler that you just defined.