ENCODING Subcommand (GET STATA command)
The ENCODING
subcommand specifies
the character encoding of the Stata data file.
- The encoding must be correctly identified or the file cannot be read.
- The subcommand is followed by an optional equals sign and a quoted encoding value.
- The quoted value can be any of the values in the Encoding column in the Character Encoding table.
- The default encoding is "Locale", which is the encoding of the current IBM® SPSS® Statistics locale. See the topic LOCALE Subcommand (SET command) for more information.
Example
GET STATA FILE='/data/empl.dta'
/ENCODING='Windows-1252'.
Character Set | Encoding |
---|---|
IBM SPSS Statistics Locale | Locale |
Operating System Locale | System |
Western | ISO-8859-1 |
Western | ISO-8859-15 |
Western | IBM850 |
Western | Windows-1252 |
Celtic | ISO-8859-14 |
Greek | ISO-8859-7 |
Greek | Windows-1253 |
Nordic | ISO-8859-10 |
Baltic | Windows-1257 |
Central European | IBM852 |
Central European | ISO-8859-2 |
Cyrillic | IBM855 |
Cyrillic | ISO-8859-5 |
Cyrillic | Windows-1251 |
Cyrillic/Russian | CP-866 |
Chinese Simplified | GBK |
Chinese Simplified | ISO-2022-CN |
Chinese Traditional | Big5 |
Chinese Traditional | EUC-TW |
Japanese | EUC-JP |
Japanese | ISO-2022-JP |
Japanese | Shift-JIS |
Korean | EUC-KR |
Thai | Windows-874 |
Turkish | IBM857 |
Turkish | ISO-8859-9 |
Arabic | Windows-1256 |
Arabic | IBM864 |
Hebrew | ISO-8859-8 |
Hebrew | Windows-1255 |
Hebrew | IBM862 |