URL Normalizations

You can choose to apply case-normalization to different components of the URL. These options are in the URL normalization sub-section of the General Settings for a search collection.

Windows servers, for example, are case insensitive and it is useful to identify them and mark the URLs as case-insensitive. For specific scripts, you may have knowledge that allows you to enable other case normalizations.

  • Case normalization - This will convert all upper-case letters to lower-case. Available URL components are the following:
    • url - All components of the URL except for the query-string.
    • cgi-name - The name portion of all values in the query-string.
    • cgi-value - The value portion of all values in the query-string.
  • CGI string to add - URLs may need to have CGI parameters added to them. This option will correctly append a valid CGI string.
  • CGI parameters to remove - URLs often contain CGI parameters that encode state or display information. This list takes one CGI parameter per line. Each CGI parameter must be written in its URL-escaped form.

If these options do not allow you to normalize the URL in some specific way, all sub-conditions (but not the global options) allow regular expression search-and-replace expressions. Add a custom conditional setting and in the Advanced sub-section, you can use the Replacements option by putting a regular expression in the Replace this... field and the literal to use instead in the With that... field. Click on the plus sign to the right of the fields to add another rule. Global search-and-replaces could be added by adding a sub-condition that matches the URL * and adding search-and-replace expressions to this rule.