Tables
When Watson™ Explorer Engine gathers reporting data, it inserts it into a large SQL database, in multiple tables. The tables that it inserts data into will vary according to which reporting options have been enabled in the Reporting tab of a project. Here is an overview of the tables that may exist in your reporting database.
Global values (one value per "session"):
- reporting - this table consists of the primary key and the timestamp of the event.
- application - a simple string specified at the project level to differentiate multiple sets of projects.
- query - the query entered by the user, normalized for spaces and capitalization.
- pageid - the value of the "page" or "id" cgi parameter.
- tb - toolbar usage tracking (records the "tb" CGI parameter).
- gut - (general user tracking) the value can be manipulated in the project's main macro.
- user - if the query was performed by a user logged into Watson Explorer Engine, this is the name of the user.
- project - the name of the project.
- ip - the user's IP address (value of REMOTE_ADDR CGI environment variable).
- remotehost - the value of the REMOTE_HOST CGI environment variable.
- referrer - the value of the HTTP_REFERER CGI environment variable.
- servername - the value of the SERVER_NAME CGI environment variable.
User-Agent parsing:
- ua - the full user-agent string.
- uaplatform - the parsed platform (operating system) from the user-agent, if recognized.
- uabrowser - the parsed browser name from the user-agent, if recognized.
- uabrowserfull - the parsed full browser name (name with platform) from the user-agent, if recognized.
- uabrowserversion - the parsed browser version number from the user-agent, if recognized.
- ualanguage - the language specified by the user-agent, if recognized.
- ualocale - the locale specified by the user-agent, if recognized.
Other global values:
- term - an individual term in the user's query (there may be multiple values per session).
- cluster - the descriptor of a cluster in the output (when track-clusters is enabled).
- spelling - the suggested spelling correction, if presented to the user.
- boost - the number of boosts for this query.
Per-source values (the "key" column for each entry is the source name):
- responsetime - the amount of time elapsed between the start of processing and the end of the last fetch (per source).
- fetchingtime - the amount of time elapsed between the start of the first fetch and the end of the last fetch to complete (per source).
- parsingtime - the total amount of time spent parsing (per source).
- numresults - the number of results returned (per source).
Per-result-click values (the "key" column for each entry is a unique identifier generated for that click):
- url - the url of the result that was clicked by the user.
- rank - the rank (from the source) of the result click by the user.
- paid - the value "1" if the clicked result was a paid listing.
- resultsource - the name of the source from which this result was fetched.
- resultlabel - the descriptor of the cluster containing the clicked result, if it is in a cluster that was expanded by the user.
- resultcluster - the rank of the cluster containing the clicked result, if it is in a cluster that was expanded by the user.
- clusterposition - the rank of the cluster that was clicked by the user.
- descriptor - the descriptor of the cluster that has been clicked by the user.