Customizing the configuration file
You can edit the xmlsimp.xml configuration file to suit your requirements. For example, if you process Latin-1 characters, you might add the eacute (é) character to your Exclude list as shown:
<character_restrictions name="latin1">
<elements>
<character_restriction exclude="'" reference_string="&apos;"/>
<character_restriction exclude="'" reference_string="&#39;"/>
<character_restriction exclude=""" reference_string="&quot;"/>
<character_restriction exclude=""" reference_string="&#34;"/>
<character_restriction exclude=">" reference_string="&gt;"/>
<character_restriction exclude=">" reference_string="&#62;"/>
<character_restriction exclude="<" reference_string="&lt;"/>
<character_restriction exclude="<" reference_string="&#60;"/>
<character_restriction exclude="é" reference_string="&eacute;"/>
<character_restriction exclude="é" reference_string="&#233;"/>
<character_restriction exclude="é" reference_string="&#xE9;"/>
<character_restriction exclude="é" reference_string="é"/>
<character_restriction exclude="&" reference_string="&amp;"/>
<character_restriction exclude="&" reference_string="&#38;"/>
<character_restriction exclude="'" reference_string="'"/>
<character_restriction exclude=""" reference_string="""/>
<character_restriction exclude=">" reference_string=">"/></elements>
<attributes>
<character_restriction exclude="'" reference_string="&apos;"/>
<character_restriction exclude="'" reference_string="&#39;"/>
<character_restriction exclude=""" reference_string="&quot;"/>
<character_restriction exclude=""" reference_string="&#34;"/>
<character_restriction exclude=">" reference_string="&gt;"/>
<character_restriction exclude=">" reference_string="&#62;"/>
<character_restriction exclude="<" reference_string="&lt;"/>
<character_restriction exclude="<" reference_string="&#60;"/>
<character_restriction exclude="&" reference_string="&amp;"/>
<character_restriction exclude="&" reference_string="&#38;"/>
</attributes>
</character_restrictions>
You must specify the proper encoding in the XML prolog in the configuration file. For example, if the character é is encoded in the configuration file as Latin-1 (ISO-8859-1) byte value 0xE9, the XML prolog of the configuration file must specify that the ISO-8859-1 encoding is used, instead of the default UTF-8.