Extended ASCII characters
Extended ASCII characters are handled several different ways in the full-text filter. Some are special characters while others are treated as spaces. Some are treated as letters that depend on the spaces around the characters.
Special characters or punctuation are not indexed and cannot be searched for in filters. The following table describes what behavior applies to these characters in the filter.
| Extended ASCII Character Syntax | Description |
|---|---|
| Special characters | Some special characters generate syntax errors
because they cannot be processed. Either remove the character from
the filter or surround the filter with single quotation marks. For
example, a filter for hello! without quotations
generates an error. However, if the filter is 'hello!' (with
single quotation marks), the query can be processed. In general,
special characters are treated as a space in a filter and they are
not indexed. For example, Most special characters can also be searched
for with quotation marks. For example, These
special characters generate errors unless surrounded by quotation
marks: |
| Period (.) | A period with no space on either side is treated
like a letter. For example, a filter for 172.16.0.46 finds
the whole number of 172.16.0.46, not partial numbers
of 172, 16, 0, 46.A
period with a space on either side is treated as a space and ignored.
For example, a filter for |
| At (@) | An @ symbol is indexed like
a letter when there are no spaces on either side. For example, a filter
for jdoe@yourcompany.com finds the entire email address.
An @ symbol with a space on either side or one side
is treated as a space and ignored. For example, @home finds
home in the data object. |
| Hyphen (-) | A hyphen is indexed like a letter when there
are no spaces on either side. For example, a filter for 012-44-5668 finds
the entire number, not 012, 44, 5668 as
separate numbers. A hyphen with a space on either side is treated
as a space and ignored. |
| Slash (/) | A slash is indexed like a letter when there
are no spaces on either side. For example, a filter for 10/30/1940 finds
the entire expression 10/30/1940 (and not 10
30 1940) as separate numbers. |
| Ampersand (&) | An ampersand is indexed like a letter when there
are no spaces on either side. For example, a filter for S&P finds S&P in
the data objects. However, a filter for Proctor & Gamble without
quotation marks finds either Proctor or Gamble.
A filter with quotation marks "Proctor & Gamble" finds
the phrase Proctor Gamble only. |
| Colon (:) | A colon is considered a special character. It
is treated as a space in filter and is not indexed. For example, 21:35:15 is
indexed as three consecutive words, that is, 21, 35,
and 15. Searching for '21:35:15' returns
zero object. To search for '21 35 15', data object
that has the word 21:35:15 or that has three consecutive
words as 21, 35, 15 is
returned. Specifying terms with colons without the single quotation
marks results in an invalid syntax error. |
| Apostrophe (') | The apostrophe in the term is indexed as a whole word. For example, a filter for patient's returns results for patient's but not for patient. A filter for the name O’Hara returns results for O’Hara. An apostrophe is treated as a single quotation mark, so it must always be quoted with double quotation marks. |
| Comma (,) | Commas are left in when surrounded by numerals,
and those numerals make a legitimate number. For example, "1,000" is
indexed as 1,000. To search this term, specify 1,000 in
single quotation marks or quotation marks. Otherwise, it results in
an invalid syntax error. For the term "10,00", it
is indexed as 10 and 00. To search
for this term, specify "10 00". |
| Number sign (#), dollar sign ($), percent (%), equal sign (=), underscore (_) | These characters are treated as white space
in the filter. For example, a search for $100 returns
results for the number 100 whether 100 refers
to dollars, a percentage, or a whole number. The term 'JAVA_HOME'
is indexed as 'JAVA HOME'. To search for this term, specify JAVA HOME
in quotation marks. |