diff options
author | Daniel Baumann <daniel.baumann@progress-linux.org> | 2024-04-15 19:43:11 +0000 |
---|---|---|
committer | Daniel Baumann <daniel.baumann@progress-linux.org> | 2024-04-15 19:43:11 +0000 |
commit | fc22b3d6507c6745911b9dfcc68f1e665ae13dbc (patch) | |
tree | ce1e3bce06471410239a6f41282e328770aa404a /templates/man7/charsets.7.pot | |
parent | Initial commit. (diff) | |
download | manpages-l10n-fc22b3d6507c6745911b9dfcc68f1e665ae13dbc.tar.xz manpages-l10n-fc22b3d6507c6745911b9dfcc68f1e665ae13dbc.zip |
Adding upstream version 4.22.0.upstream/4.22.0
Signed-off-by: Daniel Baumann <daniel.baumann@progress-linux.org>
Diffstat (limited to 'templates/man7/charsets.7.pot')
-rw-r--r-- | templates/man7/charsets.7.pot | 1024 |
1 files changed, 1024 insertions, 0 deletions
diff --git a/templates/man7/charsets.7.pot b/templates/man7/charsets.7.pot new file mode 100644 index 00000000..3de69690 --- /dev/null +++ b/templates/man7/charsets.7.pot @@ -0,0 +1,1024 @@ +# SOME DESCRIPTIVE TITLE +# Copyright (C) YEAR Free Software Foundation, Inc. +# This file is distributed under the same license as the PACKAGE package. +# FIRST AUTHOR <EMAIL@ADDRESS>, YEAR. +# +#, fuzzy +msgid "" +msgstr "" +"Project-Id-Version: PACKAGE VERSION\n" +"POT-Creation-Date: 2024-03-01 16:53+0100\n" +"PO-Revision-Date: YEAR-MO-DA HO:MI+ZONE\n" +"Last-Translator: FULL NAME <EMAIL@ADDRESS>\n" +"Language-Team: LANGUAGE <LL@li.org>\n" +"Language: \n" +"MIME-Version: 1.0\n" +"Content-Type: text/plain; charset=UTF-8\n" +"Content-Transfer-Encoding: 8bit\n" + +#. type: TH +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "charsets" +msgstr "" + +#. type: TH +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "2024-01-28" +msgstr "" + +#. type: TH +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "Linux man-pages 6.06" +msgstr "" + +#. type: SH +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "NAME" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "charsets - character set standards and internationalization" +msgstr "" + +#. type: SH +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "DESCRIPTION" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"This manual page gives an overview on different character set standards and " +"how they were used on Linux before Unicode became ubiquitous. Some of this " +"information is still helpful for people working with legacy systems and " +"documents." +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"Standards discussed include such as ASCII, GB 2312, ISO/IEC\\~8859, JIS, " +"KOI8-R, KS, and Unicode." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"The primary emphasis is on character sets that were actually used by locale " +"character sets, not the myriad others that could be found in data from other " +"systems." +msgstr "" + +#. type: SS +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "ASCII" +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"ASCII (American Standard Code For Information Interchange) is the original 7-" +"bit character set, originally designed for American English. Also known as " +"US-ASCII. It is currently described by the ISO/IEC\\~646:1991 IRV " +"(International Reference Version) standard." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Various ASCII variants replacing the dollar sign with other currency symbols " +"and replacing punctuation with non-English alphabetic characters to cover " +"German, French, Spanish, and others in 7 bits emerged. All are deprecated; " +"glibc does not support locales whose character sets are not true supersets " +"of ASCII." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"As Unicode, when using UTF-8, is ASCII-compatible, plain ASCII text still " +"renders properly on modern UTF-8 using systems." +msgstr "" + +#. type: SS +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859" +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"ISO/IEC\\~8859 is a series of 15 8-bit character sets, all of which have " +"ASCII in their low (7-bit) half, invisible control characters in positions " +"128 to 159, and 96 fixed-width graphics in positions 160\\[en]255." +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"Of these, the most important is ISO/IEC\\~8859-1 (\"Latin Alphabet No. 1\" / " +"Latin-1). It was widely adopted and supported by different systems, and is " +"gradually being replaced with Unicode. The ISO/IEC\\~8859-1 characters are " +"also the first 256 characters of Unicode." +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"Console support for the other ISO/IEC\\~8859 character sets is available " +"under Linux through user-mode utilities (such as B<setfont>(8)) that modify " +"keyboard bindings and the EGA graphics table and employ the \"user mapping\" " +"font table in the console driver." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "Here are brief descriptions of each character set:" +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-1 (Latin-1)" +msgstr "" + +#. type: Plain text +#: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron +#: opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Latin-1 covers many European languages such as Albanian, Basque, Danish, " +"English, Faroese, Galician, Icelandic, Irish, Italian, Norwegian, " +"Portuguese, Spanish, and Swedish. The lack of the ligatures Dutch IJ/ij, " +"French œ, and „German“ quotation marks was considered tolerable." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-2 (Latin-2)" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Latin-2 supports many Latin-written Central and East European languages such " +"as Bosnian, Croatian, Czech, German, Hungarian, Polish, Slovak, and " +"Slovene. Replacing Romanian ș/ț with ş/ţ was considered tolerable." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-3 (Latin-3)" +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"Latin-3 was designed to cover of Esperanto, Maltese, and Turkish, but ISO/" +"IEC\\~8859-9 later superseded it for Turkish." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-4 (Latin-4)" +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"Latin-4 introduced letters for North European languages such as Estonian, " +"Latvian, and Lithuanian, but was superseded by ISO/IEC\\~8859-10 and ISO/" +"IEC\\~8859-13." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-5" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Cyrillic letters supporting Bulgarian, Byelorussian, Macedonian, Russian, " +"Serbian, and (almost completely) Ukrainian. It was never widely used, see " +"the discussion of KOI8-R/KOI8-U below." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-6" +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"Was created for Arabic. The ISO/IEC\\~8859-6 glyph table is a fixed font of " +"separate letter forms, but a proper display engine should combine these " +"using the proper initial, medial, and final forms." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-7" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "Was created for Modern Greek in 1987, updated in 2003." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-8" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Supports Modern Hebrew without niqud (punctuation signs). Niqud and full-" +"fledged Biblical Hebrew were outside the scope of this character set." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-9 (Latin-5)" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"This is a variant of Latin-1 that replaces Icelandic letters with Turkish " +"ones." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-10 (Latin-6)" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Latin-6 added the Inuit (Greenlandic) and Sami (Lappish) letters that were " +"missing in Latin-4 to cover the entire Nordic area." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-11" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Supports the Thai alphabet and is nearly identical to the TIS-620 standard." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-12" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "This character set does not exist." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-13 (Latin-7)" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Supports the Baltic Rim languages; in particular, it includes Latvian " +"characters not found in Latin-4." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-14 (Latin-8)" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"This is the Celtic character set, covering Old Irish, Manx, Gaelic, Welsh, " +"Cornish, and Breton." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-15 (Latin-9)" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Latin-9 is similar to the widely used Latin-1 but replaces some less common " +"symbols with the Euro sign and French and Finnish letters that were missing " +"in Latin-1." +msgstr "" + +#. type: TP +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~8859-16 (Latin-10)" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"This character set covers many Southeast European languages, and most " +"importantly supports Romanian more completely than Latin-2." +msgstr "" + +#. type: SS +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "KOI8-R / KOI8-U" +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"KOI8-R is a non-ISO character set popular in Russia before Unicode. The " +"lower half is ASCII; the upper is a Cyrillic character set somewhat better " +"designed than ISO/IEC\\~8859-5. KOI8-U, based on KOI8-R, has better support " +"for Ukrainian. Neither of these sets are ISO/IEC\\~2022 compatible, unlike " +"the ISO/IEC\\~8859 series." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Console support for KOI8-R is available under Linux through user-mode " +"utilities that modify keyboard bindings and the EGA graphics table, and " +"employ the \"user mapping\" font table in the console driver." +msgstr "" + +#. type: SS +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "GB 2312" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"GB 2312 is a mainland Chinese national standard character set used to " +"express simplified Chinese. Just like JIS X 0208, characters are mapped " +"into a 94x94 two-byte matrix used to construct EUC-CN. EUC-CN is the most " +"important encoding for Linux and includes ASCII and GB 2312. Note that EUC-" +"CN is often called as GB, GB 2312, or CN-GB." +msgstr "" + +#. type: SS +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "Big5" +msgstr "" + +#. Thanks to Tomohiro KUBOTA for the following sections about +#. national standards. +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"Big5 was a popular character set in Taiwan to express traditional Chinese. " +"(Big5 is both a character set and an encoding.) It is a superset of ASCII. " +"Non-ASCII characters are expressed in two bytes. Bytes 0xa1\\[en]0xfe are " +"used as leading bytes for two-byte characters. Big5 and its extension were " +"widely used in Taiwan and Hong Kong. It is not ISO/IEC\\~2022 compliant." +msgstr "" + +#. type: SS +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "JIS X 0208" +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"JIS X 0208 is a Japanese national standard character set. Though there are " +"some more Japanese national standard character sets (like JIS X 0201, JIS X " +"0212, and JIS X 0213), this is the most important one. Characters are " +"mapped into a 94x94 two-byte matrix, whose each byte is in the range " +"0x21\\[en]0x7e. Note that JIS X 0208 is a character set, not an encoding. " +"This means that JIS X 0208 itself is not used for expressing text data. JIS " +"X 0208 is used as a component to construct encodings such as EUC-JP, " +"Shift_JIS, and ISO/IEC\\~2022-JP. EUC-JP is the most important encoding for " +"Linux and includes ASCII and JIS X 0208. In EUC-JP, JIS X 0208 characters " +"are expressed in two bytes, each of which is the JIS X 0208 code plus 0x80." +msgstr "" + +#. type: SS +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "KS X 1001" +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"KS X 1001 is a Korean national standard character set. Just as JIS X 0208, " +"characters are mapped into a 94x94 two-byte matrix. KS X 1001 is used like " +"JIS X 0208, as a component to construct encodings such as EUC-KR, Johab, and " +"ISO/IEC\\~2022-KR. EUC-KR is the most important encoding for Linux and " +"includes ASCII and KS X 1001. KS C 5601 is an older name for KS X 1001." +msgstr "" + +#. type: SS +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +#, no-wrap +msgid "ISO/IEC\\~2022 and ISO/IEC\\~4873" +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"The ISO/IEC\\~2022 and ISO/IEC\\~4873 standards describe a font-control " +"model based on VT100 practice. This model is (partially) supported by the " +"Linux kernel and by B<xterm>(1). Several ISO/IEC\\~2022-based character " +"encodings have been defined, especially for Japanese." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"There are 4 graphic character sets, called G0, G1, G2, and G3, and one of " +"them is the current character set for codes with high bit zero (initially " +"G0), and one of them is the current character set for codes with high bit " +"one (initially G1). Each graphic character set has 94 or 96 characters, and " +"is essentially a 7-bit character set. It uses codes either 040\\[en]0177 " +"(041\\[en]0176) or 0240\\[en]0377 (0241\\[en]0376). G0 always has size 94 " +"and uses codes 041\\[en]0176." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Switching between character sets is done using the shift functions " +"B<\\[ha]N> (SO or LS1), B<\\[ha]O> (SI or LS0), ESC n (LS2), ESC o (LS3), " +"ESC N (SS2), ESC O (SS3), ESC \\[ti] (LS1R), ESC } (LS2R), ESC | (LS3R). " +"The function LSI<n> makes character set GI<n> the current one for codes with " +"high bit zero. The function LSI<n>R makes character set GI<n> the current " +"one for codes with high bit one. The function SSI<n> makes character set " +"GI<n> (I<n>=2 or 3) the current one for the next character only (regardless " +"of the value of its high order bit)." +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"A 94-character set is designated as GI<n> character set by an escape " +"sequence ESC ( xx (for G0), ESC ) xx (for G1), ESC * xx (for G2), ESC + xx " +"(for G3), where xx is a symbol or a pair of symbols found in the ISO/" +"IEC\\~2375 International Register of Coded Character Sets. For example, ESC " +"( @ selects the ISO/IEC\\~646 character set as G0, ESC ( A selects the UK " +"standard character set (with pound instead of number sign), ESC ( B selects " +"ASCII (with dollar instead of currency sign), ESC ( M selects a character " +"set for African languages, ESC ( ! A selects the Cuban character set, and so " +"on." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"A 96-character set is designated as GI<n> character set by an escape " +"sequence ESC - xx (for G1), ESC . xx (for G2) or ESC / xx (for G3). For " +"example, ESC - G selects the Hebrew alphabet as G1." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"A multibyte character set is designated as GI<n> character set by an escape " +"sequence ESC $ xx or ESC $ ( xx (for G0), ESC $ ) xx (for G1), ESC $ * xx " +"(for G2), ESC $ + xx (for G3). For example, ESC $ ( C selects the Korean " +"character set for G0. The Japanese character set selected by ESC $ B has a " +"more recent version selected by ESC & @ ESC $ B." +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"ISO/IEC\\~4873 stipulates a narrower use of character sets, where G0 is " +"fixed (always ASCII), so that G1, G2, and G3 can be invoked only for codes " +"with the high order bit set. In particular, B<\\[ha]N> and B<\\[ha]O> are " +"not used anymore, ESC ( xx can be used only with xx=B, and ESC ) xx, ESC * " +"xx, ESC + xx are equivalent to ESC - xx, ESC . xx, ESC / xx, respectively." +msgstr "" + +#. type: SS +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "TIS-620" +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"TIS-620 is a Thai national standard character set and a superset of ASCII. " +"In the same fashion as the ISO/IEC\\~8859 series, Thai characters are mapped " +"into 0xa1\\[en]0xfe." +msgstr "" + +#. type: SS +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "Unicode" +msgstr "" + +#. type: Plain text +#: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron +#: opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Unicode (ISO/IEC 10646) is a standard which aims to unambiguously represent " +"every character in every human language. Unicode's structure permits 20.1 " +"bits to encode every character. Since most computers don't include 20.1-bit " +"integers, Unicode is usually encoded as 32-bit integers internally and " +"either a series of 16-bit integers (UTF-16) (needing two 16-bit integers " +"only when encoding certain rare characters) or a series of 8-bit bytes " +"(UTF-8)." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Linux represents Unicode using the 8-bit Unicode Transformation Format " +"(UTF-8). UTF-8 is a variable length encoding of Unicode. It uses 1 byte to " +"code 7 bits, 2 bytes for 11 bits, 3 bytes for 16 bits, 4 bytes for 21 bits, " +"5 bytes for 26 bits, 6 bytes for 31 bits." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Let 0,1,x stand for a zero, one, or arbitrary bit. A byte 0xxxxxxx stands " +"for the Unicode 00000000 0xxxxxxx which codes the same symbol as the ASCII " +"0xxxxxxx. Thus, ASCII goes unchanged into UTF-8, and people using only " +"ASCII do not notice any change: not in code, and not in file size." +msgstr "" + +#. type: Plain text +#: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron +#: opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"A byte 110xxxxx is the start of a 2-byte code, and 110xxxxx 10yyyyyy is " +"assembled into 00000xxx xxyyyyyy. A byte 1110xxxx is the start of a 3-byte " +"code, and 1110xxxx 10yyyyyy 10zzzzzz is assembled into xxxxyyyy yyzzzzzz. " +"(When UTF-8 is used to code the 31-bit ISO/IEC 10646 then this progression " +"continues up to 6-byte codes.)" +msgstr "" + +#. type: Plain text +#: archlinux fedora-40 fedora-rawhide mageia-cauldron +msgid "" +"For most texts in ISO/IEC\\~8859 character sets, this means that the " +"characters outside of ASCII are now coded with two bytes. This tends to " +"expand ordinary text files by only one or two percent. For Russian or Greek " +"texts, this expands ordinary text files by 100%, since text in those " +"languages is mostly outside of ASCII. For Japanese users this means that " +"the 16-bit codes now in common use will take three bytes. While there are " +"algorithmic conversions from some character sets (especially ISO/" +"IEC\\~8859-1) to Unicode, general conversion requires carrying around " +"conversion tables, which can be quite large for 16-bit codes." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Note that UTF-8 is self-synchronizing: 10xxxxxx is a tail, any other byte is " +"the head of a code. Note that the only way ASCII bytes occur in a UTF-8 " +"stream, is as themselves. In particular, there are no embedded NULs " +"(\\[aq]\\e0\\[aq]) or \\[aq]/\\[aq]s that form part of some larger code." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Since ASCII, and, in particular, NUL and \\[aq]/\\[aq], are unchanged, the " +"kernel does not notice that UTF-8 is being used. It does not care at all " +"what the bytes it is handling stand for." +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Rendering of Unicode data streams is typically handled through \"subfont\" " +"tables which map a subset of Unicode to glyphs. Internally the kernel uses " +"Unicode to describe the subfont loaded in video RAM. This means that in the " +"Linux console in UTF-8 mode, one can use a character set with 512 different " +"symbols. This is not enough for Japanese, Chinese, and Korean, but it is " +"enough for most other purposes." +msgstr "" + +#. type: SH +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "SEE ALSO" +msgstr "" + +#. type: Plain text +#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide +#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed +msgid "B<iconv>(1), B<ascii>(7), B<iso_8859-1>(7), B<unicode>(7), B<utf-8>(7)" +msgstr "" + +#. type: TH +#: debian-bookworm +#, no-wrap +msgid "2023-02-05" +msgstr "" + +#. type: TH +#: debian-bookworm +#, no-wrap +msgid "Linux man-pages 6.03" +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Standards discussed include such as ASCII, GB 2312, ISO 8859, JIS, KOI8-R, " +"KS, and Unicode." +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"ASCII (American Standard Code For Information Interchange) is the original 7-" +"bit character set, originally designed for American English. Also known as " +"US-ASCII. It is currently described by the ISO 646:1991 IRV (International " +"Reference Version) standard." +msgstr "" + +#. type: SS +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "ISO 8859" +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"ISO 8859 is a series of 15 8-bit character sets, all of which have ASCII in " +"their low (7-bit) half, invisible control characters in positions 128 to " +"159, and 96 fixed-width graphics in positions 160\\[en]255." +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Of these, the most important is ISO 8859-1 (\"Latin Alphabet No. 1\" / " +"Latin-1). It was widely adopted and supported by different systems, and is " +"gradually being replaced with Unicode. The ISO 8859-1 characters are also " +"the first 256 characters of Unicode." +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Console support for the other 8859 character sets is available under Linux " +"through user-mode utilities (such as B<setfont>(8)) that modify keyboard " +"bindings and the EGA graphics table and employ the \"user mapping\" font " +"table in the console driver." +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-1 (Latin-1)" +msgstr "" + +#. type: Plain text +#: debian-bookworm +msgid "" +"Latin-1 covers many European languages such as Albanian, Basque, Danish, " +"English, Faroese, Galician, Icelandic, Irish, Italian, Norwegian, " +"Portuguese, Spanish, and Swedish. The lack of the ligatures Dutch IJ/ij, " +"French œ, and old-style „German“ quotation marks was considered tolerable." +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-2 (Latin-2)" +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-3 (Latin-3)" +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Latin-3 was designed to cover of Esperanto, Maltese, and Turkish, but 8859-9 " +"later superseded it for Turkish." +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-4 (Latin-4)" +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Latin-4 introduced letters for North European languages such as Estonian, " +"Latvian, and Lithuanian, but was superseded by 8859-10 and 8859-13." +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-5" +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-6" +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Was created for Arabic. The 8859-6 glyph table is a fixed font of separate " +"letter forms, but a proper display engine should combine these using the " +"proper initial, medial, and final forms." +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-7" +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-8" +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-9 (Latin-5)" +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-10 (Latin-6)" +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-11" +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-12" +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-13 (Latin-7)" +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-14 (Latin-8)" +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-15 (Latin-9)" +msgstr "" + +#. type: TP +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "8859-16 (Latin-10)" +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"KOI8-R is a non-ISO character set popular in Russia before Unicode. The " +"lower half is ASCII; the upper is a Cyrillic character set somewhat better " +"designed than ISO 8859-5. KOI8-U, based on KOI8-R, has better support for " +"Ukrainian. Neither of these sets are ISO-2022 compatible, unlike the ISO " +"8859 series." +msgstr "" + +#. Thanks to Tomohiro KUBOTA for the following sections about +#. national standards. +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"Big5 was a popular character set in Taiwan to express traditional Chinese. " +"(Big5 is both a character set and an encoding.) It is a superset of ASCII. " +"Non-ASCII characters are expressed in two bytes. Bytes 0xa1\\[en]0xfe are " +"used as leading bytes for two-byte characters. Big5 and its extension were " +"widely used in Taiwan and Hong Kong. It is not ISO 2022 compliant." +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"JIS X 0208 is a Japanese national standard character set. Though there are " +"some more Japanese national standard character sets (like JIS X 0201, JIS X " +"0212, and JIS X 0213), this is the most important one. Characters are " +"mapped into a 94x94 two-byte matrix, whose each byte is in the range " +"0x21\\[en]0x7e. Note that JIS X 0208 is a character set, not an encoding. " +"This means that JIS X 0208 itself is not used for expressing text data. JIS " +"X 0208 is used as a component to construct encodings such as EUC-JP, " +"Shift_JIS, and ISO-2022-JP. EUC-JP is the most important encoding for Linux " +"and includes ASCII and JIS X 0208. In EUC-JP, JIS X 0208 characters are " +"expressed in two bytes, each of which is the JIS X 0208 code plus 0x80." +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"KS X 1001 is a Korean national standard character set. Just as JIS X 0208, " +"characters are mapped into a 94x94 two-byte matrix. KS X 1001 is used like " +"JIS X 0208, as a component to construct encodings such as EUC-KR, Johab, and " +"ISO-2022-KR. EUC-KR is the most important encoding for Linux and includes " +"ASCII and KS X 1001. KS C 5601 is an older name for KS X 1001." +msgstr "" + +#. type: SS +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "ISO 2022 and ISO 4873" +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"The ISO 2022 and 4873 standards describe a font-control model based on VT100 " +"practice. This model is (partially) supported by the Linux kernel and by " +"B<xterm>(1). Several ISO 2022-based character encodings have been defined, " +"especially for Japanese." +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"A 94-character set is designated as GI<n> character set by an escape " +"sequence ESC ( xx (for G0), ESC ) xx (for G1), ESC * xx (for G2), ESC + xx " +"(for G3), where xx is a symbol or a pair of symbols found in the ISO 2375 " +"International Register of Coded Character Sets. For example, ESC ( @ " +"selects the ISO 646 character set as G0, ESC ( A selects the UK standard " +"character set (with pound instead of number sign), ESC ( B selects ASCII " +"(with dollar instead of currency sign), ESC ( M selects a character set for " +"African languages, ESC ( ! A selects the Cuban character set, and so on." +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"ISO 4873 stipulates a narrower use of character sets, where G0 is fixed " +"(always ASCII), so that G1, G2, and G3 can be invoked only for codes with " +"the high order bit set. In particular, B<\\[ha]N> and B<\\[ha]O> are not " +"used anymore, ESC ( xx can be used only with xx=B, and ESC ) xx, ESC * xx, " +"ESC + xx are equivalent to ESC - xx, ESC . xx, ESC / xx, respectively." +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"TIS-620 is a Thai national standard character set and a superset of ASCII. " +"In the same fashion as the ISO 8859 series, Thai characters are mapped into " +"0xa1\\[en]0xfe." +msgstr "" + +#. type: Plain text +#: debian-bookworm +msgid "" +"Unicode (ISO 10646) is a standard which aims to unambiguously represent " +"every character in every human language. Unicode's structure permits 20.1 " +"bits to encode every character. Since most computers don't include 20.1-bit " +"integers, Unicode is usually encoded as 32-bit integers internally and " +"either a series of 16-bit integers (UTF-16) (needing two 16-bit integers " +"only when encoding certain rare characters) or a series of 8-bit bytes " +"(UTF-8)." +msgstr "" + +#. type: Plain text +#: debian-bookworm +msgid "" +"A byte 110xxxxx is the start of a 2-byte code, and 110xxxxx 10yyyyyy is " +"assembled into 00000xxx xxyyyyyy. A byte 1110xxxx is the start of a 3-byte " +"code, and 1110xxxx 10yyyyyy 10zzzzzz is assembled into xxxxyyyy yyzzzzzz. " +"(When UTF-8 is used to code the 31-bit ISO 10646 then this progression " +"continues up to 6-byte codes.)" +msgstr "" + +#. type: Plain text +#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +msgid "" +"For most texts in ISO 8859 character sets, this means that the characters " +"outside of ASCII are now coded with two bytes. This tends to expand " +"ordinary text files by only one or two percent. For Russian or Greek texts, " +"this expands ordinary text files by 100%, since text in those languages is " +"mostly outside of ASCII. For Japanese users this means that the 16-bit " +"codes now in common use will take three bytes. While there are algorithmic " +"conversions from some character sets (especially ISO 8859-1) to Unicode, " +"general conversion requires carrying around conversion tables, which can be " +"quite large for 16-bit codes." +msgstr "" + +#. type: TH +#: debian-unstable opensuse-leap-15-6 opensuse-tumbleweed +#, no-wrap +msgid "2023-03-12" +msgstr "" + +#. type: TH +#: debian-unstable opensuse-tumbleweed +#, no-wrap +msgid "Linux man-pages 6.05.01" +msgstr "" + +#. type: TH +#: opensuse-leap-15-6 +#, no-wrap +msgid "Linux man-pages 6.04" +msgstr "" |