Topic
  • 4 replies
  • Latest Post - ‏2012-02-25T21:30:22Z by pradicon
SystemAdmin
SystemAdmin
7754 Posts

Pinned topic Writing XML with UTF-8

‏2012-02-25T20:40:38Z |
We've run into an issue where the XML stage (DS 8.1) appears not to be writing characters above 127 correctly according to XML standards.

For example: Smith & Jones is correctly encoded as Smith & Jones

Second example: The split vertical bar (*not* | which is a different character) appears to be output as-is instead of ¦ where 166 is the unicode decimal number.

This caused an issue downstream as the result was not a valid XML message.

Q: What settings are needed to ensure symbols like this, or non-english characters are represented correctly in the XML output?

Thanks!
Updated on 2012-02-25T21:30:22Z at 2012-02-25T21:30:22Z by pradicon
  • SystemAdmin
    SystemAdmin
    7754 Posts

    Re: Writing XML with UTF-8i

    ‏2012-02-25T20:45:08Z  
    In the above example the & character is correctly output as the string & a m p ; (I've added spaces so the browser doesn't encode back to &).
  • SystemAdmin
    SystemAdmin
    7754 Posts

    Re: Writing XML with UTF-8i

    ‏2012-02-25T20:46:32Z  
    In the above example the & character is correctly output as the string & a m p ; (I've added spaces so the browser doesn't encode back to &).
    Arrgh - ¦ should be encoded as & # 1 6 6 ;
  • pradicon
    pradicon
    3 Posts

    Re: Writing XML with UTF-8i

    ‏2012-02-25T21:28:01Z  
    Arrgh - ¦ should be encoded as & # 1 6 6 ;
    Hi
    I am sorry but i didn't understood your second post.
    Anyway if you have problem with charachter conversion, i guess u need change the map table
  • pradicon
    pradicon
    3 Posts

    Re: Writing XML with UTF-8i

    ‏2012-02-25T21:30:22Z  
    • pradicon
    • ‏2012-02-25T21:28:01Z
    Hi
    I am sorry but i didn't understood your second post.
    Anyway if you have problem with charachter conversion, i guess u need change the map table
    The other option is to locate the problmatic char in the transformer and change it's the hex or Dec value