Currently supported character sets are for Czech, Cyrillic, Greek, Hebrew, Turkish, Chinese, Korean, Japanese and more... The following is the detailed list of supported character sets.
Character Set Encoding ----------------------------------------------------- ... ISO 8859-1 Latin-1 8-bit Czech,... ISO 8859-2 Latin-2 8-bit ... ISO 8859-3 Latin-3 8-bit ... ISO 8859-4 Latin-4 8-bit Cyrillic ISO 8859-5 8-bit Cyrillic KOI-8 8-bit Greek ISO 8859-7 8-bit Hebrew ISO 8859-8 8-bit Turkish ISO 8859-9 Latin-5 8-bit Chinese GB 2312 GB/HZ encoding Chinese Big5 Big5 Korean KSC 5601 EUC,ISO-2022-KR Japanese JIS X 0208 EUC,ISO-2022-JP
We also have a little note on Japanese encoding methods and a related problem. It would be helpful in considering handling multi-byte characters in WWW. Check out here if you have interest.