UTF-8化を行うには、Tera Termの「Setup[設定]」メニューから「General[全般]」
を選び、「Language[言語]」を "UTF-8", "Japanese" および "Korean"
のいずれかへ変更します。
Languageに "Japanese" と "Korean" のいずれかを選んだ場合、次に「Setup[全般]」
メニューから「Terminal[端末]」を選択すると出る "Terminal setup[端末の設定]"
ダイアログの「Kanji(receive)[漢字-受信]」および「Kanji(transmit)[漢字-送信]」
にて、"UTF-8" を選んでください。Tera Termの再起動は必要ありません。
コマンドラインの「/KT」および「/KR」オプションにおいて、"UTF8" を指定すると、
送信および受信コードに UTF-8 を設定することができます。
現状のTera Termは内部設計がUnicode対応になっておらず、 以下に示すように文字コードは二段変換になっています。
UTF-8 <-----> Unicode(UTF-16LE) <-----> MBCS
(1) (2)
(1)において、UTF-8は3バイトまでしか変換していないため、
サロゲートペア(surrogate pair)や結合文字(combining character)などには対応していません。
(2)において、UnicodeとMBCS(Multiple Byte Character Set)の相互変換を行うために、
ロケール(言語指定)を正しく設定する必要があります。
Unicodeによるローカライズ言語を扱えるようにするためには、teraterm.iniファイルにロケールおよびコードページの設定が必要となります。下記にサンプルを示します。
; Locale for Unicode Locale = japanese ; CodePage for Unicode CodePage = 932
この設定は、"Terminal setup[端末の設定]" ダイアログの 「Locale[ロケール]」および 「CodePage[言語コード]」でも設定できます。
ロケールおよびコードページに設定できる内容については、下記のサイトを参照してください。
; Locale for Unicode Locale = american ; CodePage for Unicode CodePage = 65001
また、MBCSを表示する為にフォントを正しく設定する必要があります。
「Setup[設定]」メニューの「Font[フォント]」で、"フォント"に使用する言語の文字が
扱えるフォントを、"文字セット" に使用する言語の文字セットを指定してください。
ロケールおよびフォントが正しく設定されていれば、英語版Windowsで日本語を表示したり、
日本語版Windowsで韓国語を表示したりする事も可能です。
ただし多国語対応していないので、一つのTera Termウィンドウで日本語と韓国語を同時に
表示したりする事は出来ません。
"UTF-8m" はMac OS X(HFS+)向けのエンコーディング指定です。受信のみのサポートです。
「/KR」コマンドラインでは"UTF8m"を指定することができます。
| Primary language | Sublanguage | Language string |
|---|---|---|
| Chinese | Chinese | "chinese" |
| Chinese | Chinese (simplified) | "chinese-simplified" or "chs" |
| Chinese | Chinese (traditional) | "chinese-traditional" or "cht" |
| Czech | Czech | "csy" or "czech" |
| Danish | Danish | "dan" or "danish" |
| Dutch | Dutch (default) | "dutch" or "nld" |
| Dutch | Dutch (Belgium) | "belgian", "dutch-belgian", or "nlb" |
| English | English (default) | "english" |
| English | English (Australia) | "australian", "ena", or "english-aus" |
| English | English (Canada) | "canadian", "enc", or "english-can" |
| English | English (New Zealand) | "english-nz" or "enz" |
| English | English (United Kingdom) | "eng", "english-uk", or "uk" |
| English | English (United States) | "american", "american english", "american-english", "english-american", "english-us", "english-usa", "enu", "us", or "usa" |
| Finnish | Finnish | "fin" or "finnish" |
| French | French (default) | "fra" or "french" |
| French | French (Belgium) | "frb" or "french-belgian" |
| French | French (Canada) | "frc" or "french-canadian" |
| French | French (Switzerland) | "french-swiss" or "frs" |
| German | German (default) | "deu" or "german" |
| German | German (Austria) | "dea" or "german-austrian" |
| German | German (Switzerland) | "des", "german-swiss", or "swiss" |
| Greek | Greek | "ell" or "greek" |
| Hungarian | Hungarian | "hun" or "hungarian" |
| Icelandic | Icelandic | "icelandic" or "isl" |
| Italian | Italian (default) | "ita" or "italian" |
| Italian | Italian (Switzerland) | "italian-swiss" or "its" |
| Japanese | Japanese | "japanese" or "jpn" |
| Korean | Korean | "kor" or "korean" |
| Norwegian | Norwegian (default) | "norwegian" |
| Norwegian | Norwegian (Bokmal) | "nor" or "norwegian-bokmal" |
| Norwegian | Norwegian (Nynorsk) | "non" or "norwegian-nynorsk" |
| Polish | Polish | "plk" or "polish" |
| Portuguese | Portuguese (default) | "portuguese" or "ptg" |
| Portuguese | Portuguese (Brazil) | "portuguese-brazilian" or "ptb" |
| Russian | Russian (default) | "rus" or "russian" |
| Slovak | Slovak | "sky" or "slovak" |
| Spanish | Spanish (default) | "esp" or "spanish" |
| Spanish | Spanish (Mexico) | "esm" or "spanish-mexican" |
| Spanish | Spanish (Modern) | "esn" or "spanish-modern" |
| Swedish | Swedish | "sve" or "swedish" |
| Turkish | Turkish | "trk" or "turkish" |
| Identifier | Name |
|---|---|
| 037 | IBM EBCDIC US-Canada |
| 437 | OEM United States |
| 500 | IBM EBCDIC International |
| 708 | Arabic (ASMO 708) |
| 709 | Arabic (ASMO-449+, BCON V4) |
| 710 | Arabic - Transparent Arabic |
| 720 | Arabic (Transparent ASMO); Arabic (DOS) |
| 737 | OEM Greek (formerly 437G); Greek (DOS) |
| 775 | OEM Baltic; Baltic (DOS) |
| 850 | OEM Multilingual Latin 1; Western European (DOS) |
| 852 | OEM Latin 2; Central European (DOS) |
| 855 | OEM Cyrillic (primarily Russian) |
| 857 | OEM Turkish; Turkish (DOS) |
| 858 | OEM Multilingual Latin 1 + Euro symbol |
| 860 | OEM Portuguese; Portuguese (DOS) |
| 861 | OEM Icelandic; Icelandic (DOS) |
| 862 | OEM Hebrew; Hebrew (DOS) |
| 863 | OEM French Canadian; French Canadian (DOS) |
| 864 | OEM Arabic; Arabic (864) |
| 865 | OEM Nordic; Nordic (DOS) |
| 866 | OEM Russian; Cyrillic (DOS) |
| 869 | OEM Modern Greek; Greek, Modern (DOS) |
| 870 | IBM EBCDIC Multilingual/ROECE (Latin 2); IBM EBCDIC Multilingual Latin 2 |
| 874 | ANSI/OEM Thai (same as 28605, ISO 8859-15); Thai (Windows) |
| 875 | IBM EBCDIC Greek Modern |
| 932 | ANSI/OEM Japanese; Japanese (Shift-JIS) |
| 936 | ANSI/OEM Simplified Chinese (PRC, Singapore); Chinese Simplified (GB2312) |
| 949 | ANSI/OEM Korean (Unified Hangul Code) |
| 950 | ANSI/OEM Traditional Chinese (Taiwan; Hong Kong SAR, PRC); Chinese Traditional (Big5) |
| 1026 | IBM EBCDIC Turkish (Latin 5) |
| 1047 | IBM EBCDIC Latin 1/Open System |
| 1140 | IBM EBCDIC US-Canada (037 + Euro symbol); IBM EBCDIC (US-Canada-Euro) |
| 1141 | IBM EBCDIC Germany (20273 + Euro symbol); IBM EBCDIC (Germany-Euro) |
| 1142 | IBM EBCDIC Denmark-Norway (20277 + Euro symbol); IBM EBCDIC (Denmark-Norway-Euro) |
| 1143 | IBM EBCDIC Finland-Sweden (20278 + Euro symbol); IBM EBCDIC (Finland-Sweden-Euro) |
| 1144 | IBM EBCDIC Italy (20280 + Euro symbol); IBM EBCDIC (Italy-Euro) |
| 1145 | IBM EBCDIC Latin America-Spain (20284 + Euro symbol); IBM EBCDIC (Spain-Euro) |
| 1146 | IBM EBCDIC United Kingdom (20285 + Euro symbol); IBM EBCDIC (UK-Euro) |
| 1147 | IBM EBCDIC France (20297 + Euro symbol); IBM EBCDIC (France-Euro) |
| 1148 | IBM EBCDIC International (500 + Euro symbol); IBM EBCDIC (International-Euro) |
| 1149 | IBM EBCDIC Icelandic (20871 + Euro symbol); IBM EBCDIC (Icelandic-Euro) |
| 1200 | Unicode UTF-16, little endian byte order (BMP of ISO 10646); available only to managed applications |
| 1201 | Unicode UTF-16, big endian byte order; available only to managed applications |
| 1250 | ANSI Central European; Central European (Windows) |
| 1251 | ANSI Cyrillic; Cyrillic (Windows) |
| 1252 | ANSI Latin 1; Western European (Windows) |
| 1253 | ANSI Greek; Greek (Windows) |
| 1254 | ANSI Turkish; Turkish (Windows) |
| 1255 | ANSI Hebrew; Hebrew (Windows) |
| 1256 | ANSI Arabic; Arabic (Windows) |
| 1257 | ANSI Baltic; Baltic (Windows) |
| 1258 | ANSI/OEM Vietnamese; Vietnamese (Windows) |
| 1361 | Korean (Johab) |
| 10000 | MAC Roman; Western European (Mac) |
| 10001 | Japanese (Mac) |
| 10002 | MAC Traditional Chinese (Big5); Chinese Traditional (Mac) |
| 10003 | Korean (Mac) |
| 10004 | Arabic (Mac) |
| 10005 | Hebrew (Mac) |
| 10006 | Greek (Mac) |
| 10007 | Cyrillic (Mac) |
| 10008 | MAC Simplified Chinese (GB 2312); Chinese Simplified (Mac) |
| 10010 | Romanian (Mac) |
| 10017 | Ukrainian (Mac) |
| 10021 | Thai (Mac) |
| 10029 | MAC Latin 2; Central European (Mac) |
| 10079 | Icelandic (Mac) |
| 10081 | Turkish (Mac) |
| 10082 | Croatian (Mac) |
| 12000 | Unicode UTF-32, little endian byte order; available only to managed applications |
| 12001 | Unicode UTF-32, big endian byte order; available only to managed applications |
| 20000 | CNS Taiwan; Chinese Traditional (CNS) |
| 20001 | TCA Taiwan |
| 20002 | Eten Taiwan; Chinese Traditional (Eten) |
| 20003 | IBM5550 Taiwan |
| 20004 | TeleText Taiwan |
| 20005 | Wang Taiwan |
| 20105 | IA5 (IRV International Alphabet No. 5, 7-bit); Western European (IA5) |
| 20106 | IA5 German (7-bit) |
| 20107 | IA5 Swedish (7-bit) |
| 20108 | IA5 Norwegian (7-bit) |
| 20127 | US-ASCII (7-bit) |
| 20261 | T.61 |
| 20269 | ISO 6937 Non-Spacing Accent |
| 20273 | IBM EBCDIC Germany |
| 20277 | IBM EBCDIC Denmark-Norway |
| 20278 | IBM EBCDIC Finland-Sweden |
| 20280 | IBM EBCDIC Italy |
| 20284 | IBM EBCDIC Latin America-Spain |
| 20285 | IBM EBCDIC United Kingdom |
| 20290 | IBM EBCDIC Japanese Katakana Extended |
| 20297 | IBM EBCDIC France |
| 20420 | IBM EBCDIC Arabic |
| 20423 | IBM EBCDIC Greek |
| 20424 | IBM EBCDIC Hebrew |
| 20833 | IBM EBCDIC Korean Extended |
| 20838 | IBM EBCDIC Thai |
| 20866 | Russian (KOI8-R); Cyrillic (KOI8-R) |
| 20871 | IBM EBCDIC Icelandic |
| 20880 | IBM EBCDIC Cyrillic Russian |
| 20905 | IBM EBCDIC Turkish |
| 20924 | IBM EBCDIC Latin 1/Open System (1047 + Euro symbol) |
| 20932 | Japanese (JIS 0208-1990 and 0121-1990) |
| 20936 | Simplified Chinese (GB2312); Chinese Simplified (GB2312-80) |
| 20949 | Korean Wansung |
| 21025 | IBM EBCDIC Cyrillic Serbian-Bulgarian |
| 21027 | (deprecated) |
| 21866 | Ukrainian (KOI8-U); Cyrillic (KOI8-U) |
| 28591 | ISO 8859-1 Latin 1; Western European (ISO) |
| 28592 | ISO 8859-2 Central European; Central European (ISO) |
| 28593 | ISO 8859-3 Latin 3 |
| 28594 | ISO 8859-4 Baltic |
| 28595 | ISO 8859-5 Cyrillic |
| 28596 | ISO 8859-6 Arabic |
| 28597 | ISO 8859-7 Greek |
| 28598 | ISO 8859-8 Hebrew; Hebrew (ISO-Visual) |
| 28599 | ISO 8859-9 Turkish |
| 28603 | ISO 8859-13 Estonian |
| 28605 | ISO 8859-15 Latin 9 |
| 29001 | Europa 3 |
| 38598 | ISO 8859-8 Hebrew; Hebrew (ISO-Logical) |
| 50220 | ISO 2022 Japanese with no halfwidth Katakana; Japanese (JIS) |
| 50221 | ISO 2022 Japanese with halfwidth Katakana; Japanese (JIS-Allow 1 byte Kana) |
| 50222 | ISO 2022 Japanese JIS X 0201-1989; Japanese (JIS-Allow 1 byte Kana - SO/SI) |
| 50225 | ISO 2022 Korean |
| 50227 | ISO 2022 Simplified Chinese; Chinese Simplified (ISO 2022) |
| 50229 | ISO 2022 Traditional Chinese |
| 50930 | EBCDIC Japanese (Katakana) Extended |
| 50931 | EBCDIC US-Canada and Japanese |
| 50933 | EBCDIC Korean Extended and Korean |
| 50935 | EBCDIC Simplified Chinese Extended and Simplified Chinese |
| 50936 | EBCDIC Simplified Chinese |
| 50937 | EBCDIC US-Canada and Traditional Chinese |
| 50939 | EBCDIC Japanese (Latin) Extended and Japanese |
| 51932 | EUC Japanese |
| 51936 | EUC Simplified Chinese; Chinese Simplified (EUC) |
| 51949 | EUC Korean |
| 51950 | EUC Traditional Chinese |
| 52936 | HZ-GB2312 Simplified Chinese; Chinese Simplified (HZ) |
| 54936 | Windows XP and later: GB18030 Simplified Chinese (4 byte); Chinese Simplified (GB18030) |
| 57002 | ISCII Devanagari |
| 57003 | ISCII Bengali |
| 57004 | ISCII Tamil |
| 57005 | ISCII Telugu |
| 57006 | ISCII Assamese |
| 57007 | ISCII Oriya |
| 57008 | ISCII Kannada |
| 57009 | ISCII Malayalam |
| 57010 | ISCII Gujarati |
| 57011 | ISCII Punjabi |
| 65000 | Unicode (UTF-7) |
| 65001 | Unicode (UTF-8) |