summaryrefslogtreecommitdiffstats
path: root/templates/man7/charsets.7.pot
diff options
context:
space:
mode:
authorDaniel Baumann <daniel.baumann@progress-linux.org>2024-04-15 19:43:11 +0000
committerDaniel Baumann <daniel.baumann@progress-linux.org>2024-04-15 19:43:11 +0000
commitfc22b3d6507c6745911b9dfcc68f1e665ae13dbc (patch)
treece1e3bce06471410239a6f41282e328770aa404a /templates/man7/charsets.7.pot
parentInitial commit. (diff)
downloadmanpages-l10n-fc22b3d6507c6745911b9dfcc68f1e665ae13dbc.tar.xz
manpages-l10n-fc22b3d6507c6745911b9dfcc68f1e665ae13dbc.zip
Adding upstream version 4.22.0.upstream/4.22.0
Signed-off-by: Daniel Baumann <daniel.baumann@progress-linux.org>
Diffstat (limited to 'templates/man7/charsets.7.pot')
-rw-r--r--templates/man7/charsets.7.pot1024
1 files changed, 1024 insertions, 0 deletions
diff --git a/templates/man7/charsets.7.pot b/templates/man7/charsets.7.pot
new file mode 100644
index 00000000..3de69690
--- /dev/null
+++ b/templates/man7/charsets.7.pot
@@ -0,0 +1,1024 @@
+# SOME DESCRIPTIVE TITLE
+# Copyright (C) YEAR Free Software Foundation, Inc.
+# This file is distributed under the same license as the PACKAGE package.
+# FIRST AUTHOR <EMAIL@ADDRESS>, YEAR.
+#
+#, fuzzy
+msgid ""
+msgstr ""
+"Project-Id-Version: PACKAGE VERSION\n"
+"POT-Creation-Date: 2024-03-01 16:53+0100\n"
+"PO-Revision-Date: YEAR-MO-DA HO:MI+ZONE\n"
+"Last-Translator: FULL NAME <EMAIL@ADDRESS>\n"
+"Language-Team: LANGUAGE <LL@li.org>\n"
+"Language: \n"
+"MIME-Version: 1.0\n"
+"Content-Type: text/plain; charset=UTF-8\n"
+"Content-Transfer-Encoding: 8bit\n"
+
+#. type: TH
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "charsets"
+msgstr ""
+
+#. type: TH
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "2024-01-28"
+msgstr ""
+
+#. type: TH
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "Linux man-pages 6.06"
+msgstr ""
+
+#. type: SH
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "NAME"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "charsets - character set standards and internationalization"
+msgstr ""
+
+#. type: SH
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "DESCRIPTION"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"This manual page gives an overview on different character set standards and "
+"how they were used on Linux before Unicode became ubiquitous. Some of this "
+"information is still helpful for people working with legacy systems and "
+"documents."
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"Standards discussed include such as ASCII, GB 2312, ISO/IEC\\~8859, JIS, "
+"KOI8-R, KS, and Unicode."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"The primary emphasis is on character sets that were actually used by locale "
+"character sets, not the myriad others that could be found in data from other "
+"systems."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "ASCII"
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"ASCII (American Standard Code For Information Interchange) is the original 7-"
+"bit character set, originally designed for American English. Also known as "
+"US-ASCII. It is currently described by the ISO/IEC\\~646:1991 IRV "
+"(International Reference Version) standard."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Various ASCII variants replacing the dollar sign with other currency symbols "
+"and replacing punctuation with non-English alphabetic characters to cover "
+"German, French, Spanish, and others in 7 bits emerged. All are deprecated; "
+"glibc does not support locales whose character sets are not true supersets "
+"of ASCII."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"As Unicode, when using UTF-8, is ASCII-compatible, plain ASCII text still "
+"renders properly on modern UTF-8 using systems."
+msgstr ""
+
+#. type: SS
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859"
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"ISO/IEC\\~8859 is a series of 15 8-bit character sets, all of which have "
+"ASCII in their low (7-bit) half, invisible control characters in positions "
+"128 to 159, and 96 fixed-width graphics in positions 160\\[en]255."
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"Of these, the most important is ISO/IEC\\~8859-1 (\"Latin Alphabet No. 1\" / "
+"Latin-1). It was widely adopted and supported by different systems, and is "
+"gradually being replaced with Unicode. The ISO/IEC\\~8859-1 characters are "
+"also the first 256 characters of Unicode."
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"Console support for the other ISO/IEC\\~8859 character sets is available "
+"under Linux through user-mode utilities (such as B<setfont>(8)) that modify "
+"keyboard bindings and the EGA graphics table and employ the \"user mapping\" "
+"font table in the console driver."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "Here are brief descriptions of each character set:"
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-1 (Latin-1)"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron
+#: opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Latin-1 covers many European languages such as Albanian, Basque, Danish, "
+"English, Faroese, Galician, Icelandic, Irish, Italian, Norwegian, "
+"Portuguese, Spanish, and Swedish. The lack of the ligatures Dutch IJ/ij, "
+"French œ, and „German“ quotation marks was considered tolerable."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-2 (Latin-2)"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Latin-2 supports many Latin-written Central and East European languages such "
+"as Bosnian, Croatian, Czech, German, Hungarian, Polish, Slovak, and "
+"Slovene. Replacing Romanian ș/ț with ş/ţ was considered tolerable."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-3 (Latin-3)"
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"Latin-3 was designed to cover of Esperanto, Maltese, and Turkish, but ISO/"
+"IEC\\~8859-9 later superseded it for Turkish."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-4 (Latin-4)"
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"Latin-4 introduced letters for North European languages such as Estonian, "
+"Latvian, and Lithuanian, but was superseded by ISO/IEC\\~8859-10 and ISO/"
+"IEC\\~8859-13."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-5"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Cyrillic letters supporting Bulgarian, Byelorussian, Macedonian, Russian, "
+"Serbian, and (almost completely) Ukrainian. It was never widely used, see "
+"the discussion of KOI8-R/KOI8-U below."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-6"
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"Was created for Arabic. The ISO/IEC\\~8859-6 glyph table is a fixed font of "
+"separate letter forms, but a proper display engine should combine these "
+"using the proper initial, medial, and final forms."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-7"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "Was created for Modern Greek in 1987, updated in 2003."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-8"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Supports Modern Hebrew without niqud (punctuation signs). Niqud and full-"
+"fledged Biblical Hebrew were outside the scope of this character set."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-9 (Latin-5)"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"This is a variant of Latin-1 that replaces Icelandic letters with Turkish "
+"ones."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-10 (Latin-6)"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Latin-6 added the Inuit (Greenlandic) and Sami (Lappish) letters that were "
+"missing in Latin-4 to cover the entire Nordic area."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-11"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Supports the Thai alphabet and is nearly identical to the TIS-620 standard."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-12"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "This character set does not exist."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-13 (Latin-7)"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Supports the Baltic Rim languages; in particular, it includes Latvian "
+"characters not found in Latin-4."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-14 (Latin-8)"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"This is the Celtic character set, covering Old Irish, Manx, Gaelic, Welsh, "
+"Cornish, and Breton."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-15 (Latin-9)"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Latin-9 is similar to the widely used Latin-1 but replaces some less common "
+"symbols with the Euro sign and French and Finnish letters that were missing "
+"in Latin-1."
+msgstr ""
+
+#. type: TP
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~8859-16 (Latin-10)"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"This character set covers many Southeast European languages, and most "
+"importantly supports Romanian more completely than Latin-2."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "KOI8-R / KOI8-U"
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"KOI8-R is a non-ISO character set popular in Russia before Unicode. The "
+"lower half is ASCII; the upper is a Cyrillic character set somewhat better "
+"designed than ISO/IEC\\~8859-5. KOI8-U, based on KOI8-R, has better support "
+"for Ukrainian. Neither of these sets are ISO/IEC\\~2022 compatible, unlike "
+"the ISO/IEC\\~8859 series."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Console support for KOI8-R is available under Linux through user-mode "
+"utilities that modify keyboard bindings and the EGA graphics table, and "
+"employ the \"user mapping\" font table in the console driver."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "GB 2312"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"GB 2312 is a mainland Chinese national standard character set used to "
+"express simplified Chinese. Just like JIS X 0208, characters are mapped "
+"into a 94x94 two-byte matrix used to construct EUC-CN. EUC-CN is the most "
+"important encoding for Linux and includes ASCII and GB 2312. Note that EUC-"
+"CN is often called as GB, GB 2312, or CN-GB."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "Big5"
+msgstr ""
+
+#. Thanks to Tomohiro KUBOTA for the following sections about
+#. national standards.
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"Big5 was a popular character set in Taiwan to express traditional Chinese. "
+"(Big5 is both a character set and an encoding.) It is a superset of ASCII. "
+"Non-ASCII characters are expressed in two bytes. Bytes 0xa1\\[en]0xfe are "
+"used as leading bytes for two-byte characters. Big5 and its extension were "
+"widely used in Taiwan and Hong Kong. It is not ISO/IEC\\~2022 compliant."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "JIS X 0208"
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"JIS X 0208 is a Japanese national standard character set. Though there are "
+"some more Japanese national standard character sets (like JIS X 0201, JIS X "
+"0212, and JIS X 0213), this is the most important one. Characters are "
+"mapped into a 94x94 two-byte matrix, whose each byte is in the range "
+"0x21\\[en]0x7e. Note that JIS X 0208 is a character set, not an encoding. "
+"This means that JIS X 0208 itself is not used for expressing text data. JIS "
+"X 0208 is used as a component to construct encodings such as EUC-JP, "
+"Shift_JIS, and ISO/IEC\\~2022-JP. EUC-JP is the most important encoding for "
+"Linux and includes ASCII and JIS X 0208. In EUC-JP, JIS X 0208 characters "
+"are expressed in two bytes, each of which is the JIS X 0208 code plus 0x80."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "KS X 1001"
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"KS X 1001 is a Korean national standard character set. Just as JIS X 0208, "
+"characters are mapped into a 94x94 two-byte matrix. KS X 1001 is used like "
+"JIS X 0208, as a component to construct encodings such as EUC-KR, Johab, and "
+"ISO/IEC\\~2022-KR. EUC-KR is the most important encoding for Linux and "
+"includes ASCII and KS X 1001. KS C 5601 is an older name for KS X 1001."
+msgstr ""
+
+#. type: SS
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "ISO/IEC\\~2022 and ISO/IEC\\~4873"
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"The ISO/IEC\\~2022 and ISO/IEC\\~4873 standards describe a font-control "
+"model based on VT100 practice. This model is (partially) supported by the "
+"Linux kernel and by B<xterm>(1). Several ISO/IEC\\~2022-based character "
+"encodings have been defined, especially for Japanese."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"There are 4 graphic character sets, called G0, G1, G2, and G3, and one of "
+"them is the current character set for codes with high bit zero (initially "
+"G0), and one of them is the current character set for codes with high bit "
+"one (initially G1). Each graphic character set has 94 or 96 characters, and "
+"is essentially a 7-bit character set. It uses codes either 040\\[en]0177 "
+"(041\\[en]0176) or 0240\\[en]0377 (0241\\[en]0376). G0 always has size 94 "
+"and uses codes 041\\[en]0176."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Switching between character sets is done using the shift functions "
+"B<\\[ha]N> (SO or LS1), B<\\[ha]O> (SI or LS0), ESC n (LS2), ESC o (LS3), "
+"ESC N (SS2), ESC O (SS3), ESC \\[ti] (LS1R), ESC } (LS2R), ESC | (LS3R). "
+"The function LSI<n> makes character set GI<n> the current one for codes with "
+"high bit zero. The function LSI<n>R makes character set GI<n> the current "
+"one for codes with high bit one. The function SSI<n> makes character set "
+"GI<n> (I<n>=2 or 3) the current one for the next character only (regardless "
+"of the value of its high order bit)."
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"A 94-character set is designated as GI<n> character set by an escape "
+"sequence ESC ( xx (for G0), ESC ) xx (for G1), ESC * xx (for G2), ESC + xx "
+"(for G3), where xx is a symbol or a pair of symbols found in the ISO/"
+"IEC\\~2375 International Register of Coded Character Sets. For example, ESC "
+"( @ selects the ISO/IEC\\~646 character set as G0, ESC ( A selects the UK "
+"standard character set (with pound instead of number sign), ESC ( B selects "
+"ASCII (with dollar instead of currency sign), ESC ( M selects a character "
+"set for African languages, ESC ( ! A selects the Cuban character set, and so "
+"on."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"A 96-character set is designated as GI<n> character set by an escape "
+"sequence ESC - xx (for G1), ESC . xx (for G2) or ESC / xx (for G3). For "
+"example, ESC - G selects the Hebrew alphabet as G1."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"A multibyte character set is designated as GI<n> character set by an escape "
+"sequence ESC $ xx or ESC $ ( xx (for G0), ESC $ ) xx (for G1), ESC $ * xx "
+"(for G2), ESC $ + xx (for G3). For example, ESC $ ( C selects the Korean "
+"character set for G0. The Japanese character set selected by ESC $ B has a "
+"more recent version selected by ESC & @ ESC $ B."
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"ISO/IEC\\~4873 stipulates a narrower use of character sets, where G0 is "
+"fixed (always ASCII), so that G1, G2, and G3 can be invoked only for codes "
+"with the high order bit set. In particular, B<\\[ha]N> and B<\\[ha]O> are "
+"not used anymore, ESC ( xx can be used only with xx=B, and ESC ) xx, ESC * "
+"xx, ESC + xx are equivalent to ESC - xx, ESC . xx, ESC / xx, respectively."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "TIS-620"
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"TIS-620 is a Thai national standard character set and a superset of ASCII. "
+"In the same fashion as the ISO/IEC\\~8859 series, Thai characters are mapped "
+"into 0xa1\\[en]0xfe."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "Unicode"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron
+#: opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Unicode (ISO/IEC 10646) is a standard which aims to unambiguously represent "
+"every character in every human language. Unicode's structure permits 20.1 "
+"bits to encode every character. Since most computers don't include 20.1-bit "
+"integers, Unicode is usually encoded as 32-bit integers internally and "
+"either a series of 16-bit integers (UTF-16) (needing two 16-bit integers "
+"only when encoding certain rare characters) or a series of 8-bit bytes "
+"(UTF-8)."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Linux represents Unicode using the 8-bit Unicode Transformation Format "
+"(UTF-8). UTF-8 is a variable length encoding of Unicode. It uses 1 byte to "
+"code 7 bits, 2 bytes for 11 bits, 3 bytes for 16 bits, 4 bytes for 21 bits, "
+"5 bytes for 26 bits, 6 bytes for 31 bits."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Let 0,1,x stand for a zero, one, or arbitrary bit. A byte 0xxxxxxx stands "
+"for the Unicode 00000000 0xxxxxxx which codes the same symbol as the ASCII "
+"0xxxxxxx. Thus, ASCII goes unchanged into UTF-8, and people using only "
+"ASCII do not notice any change: not in code, and not in file size."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron
+#: opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"A byte 110xxxxx is the start of a 2-byte code, and 110xxxxx 10yyyyyy is "
+"assembled into 00000xxx xxyyyyyy. A byte 1110xxxx is the start of a 3-byte "
+"code, and 1110xxxx 10yyyyyy 10zzzzzz is assembled into xxxxyyyy yyzzzzzz. "
+"(When UTF-8 is used to code the 31-bit ISO/IEC 10646 then this progression "
+"continues up to 6-byte codes.)"
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"For most texts in ISO/IEC\\~8859 character sets, this means that the "
+"characters outside of ASCII are now coded with two bytes. This tends to "
+"expand ordinary text files by only one or two percent. For Russian or Greek "
+"texts, this expands ordinary text files by 100%, since text in those "
+"languages is mostly outside of ASCII. For Japanese users this means that "
+"the 16-bit codes now in common use will take three bytes. While there are "
+"algorithmic conversions from some character sets (especially ISO/"
+"IEC\\~8859-1) to Unicode, general conversion requires carrying around "
+"conversion tables, which can be quite large for 16-bit codes."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Note that UTF-8 is self-synchronizing: 10xxxxxx is a tail, any other byte is "
+"the head of a code. Note that the only way ASCII bytes occur in a UTF-8 "
+"stream, is as themselves. In particular, there are no embedded NULs "
+"(\\[aq]\\e0\\[aq]) or \\[aq]/\\[aq]s that form part of some larger code."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Since ASCII, and, in particular, NUL and \\[aq]/\\[aq], are unchanged, the "
+"kernel does not notice that UTF-8 is being used. It does not care at all "
+"what the bytes it is handling stand for."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Rendering of Unicode data streams is typically handled through \"subfont\" "
+"tables which map a subset of Unicode to glyphs. Internally the kernel uses "
+"Unicode to describe the subfont loaded in video RAM. This means that in the "
+"Linux console in UTF-8 mode, one can use a character set with 512 different "
+"symbols. This is not enough for Japanese, Chinese, and Korean, but it is "
+"enough for most other purposes."
+msgstr ""
+
+#. type: SH
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "SEE ALSO"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "B<iconv>(1), B<ascii>(7), B<iso_8859-1>(7), B<unicode>(7), B<utf-8>(7)"
+msgstr ""
+
+#. type: TH
+#: debian-bookworm
+#, no-wrap
+msgid "2023-02-05"
+msgstr ""
+
+#. type: TH
+#: debian-bookworm
+#, no-wrap
+msgid "Linux man-pages 6.03"
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Standards discussed include such as ASCII, GB 2312, ISO 8859, JIS, KOI8-R, "
+"KS, and Unicode."
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"ASCII (American Standard Code For Information Interchange) is the original 7-"
+"bit character set, originally designed for American English. Also known as "
+"US-ASCII. It is currently described by the ISO 646:1991 IRV (International "
+"Reference Version) standard."
+msgstr ""
+
+#. type: SS
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "ISO 8859"
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"ISO 8859 is a series of 15 8-bit character sets, all of which have ASCII in "
+"their low (7-bit) half, invisible control characters in positions 128 to "
+"159, and 96 fixed-width graphics in positions 160\\[en]255."
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Of these, the most important is ISO 8859-1 (\"Latin Alphabet No. 1\" / "
+"Latin-1). It was widely adopted and supported by different systems, and is "
+"gradually being replaced with Unicode. The ISO 8859-1 characters are also "
+"the first 256 characters of Unicode."
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Console support for the other 8859 character sets is available under Linux "
+"through user-mode utilities (such as B<setfont>(8)) that modify keyboard "
+"bindings and the EGA graphics table and employ the \"user mapping\" font "
+"table in the console driver."
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-1 (Latin-1)"
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm
+msgid ""
+"Latin-1 covers many European languages such as Albanian, Basque, Danish, "
+"English, Faroese, Galician, Icelandic, Irish, Italian, Norwegian, "
+"Portuguese, Spanish, and Swedish. The lack of the ligatures Dutch IJ/ij, "
+"French œ, and old-style „German“ quotation marks was considered tolerable."
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-2 (Latin-2)"
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-3 (Latin-3)"
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Latin-3 was designed to cover of Esperanto, Maltese, and Turkish, but 8859-9 "
+"later superseded it for Turkish."
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-4 (Latin-4)"
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Latin-4 introduced letters for North European languages such as Estonian, "
+"Latvian, and Lithuanian, but was superseded by 8859-10 and 8859-13."
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-5"
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-6"
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Was created for Arabic. The 8859-6 glyph table is a fixed font of separate "
+"letter forms, but a proper display engine should combine these using the "
+"proper initial, medial, and final forms."
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-7"
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-8"
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-9 (Latin-5)"
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-10 (Latin-6)"
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-11"
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-12"
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-13 (Latin-7)"
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-14 (Latin-8)"
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-15 (Latin-9)"
+msgstr ""
+
+#. type: TP
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "8859-16 (Latin-10)"
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"KOI8-R is a non-ISO character set popular in Russia before Unicode. The "
+"lower half is ASCII; the upper is a Cyrillic character set somewhat better "
+"designed than ISO 8859-5. KOI8-U, based on KOI8-R, has better support for "
+"Ukrainian. Neither of these sets are ISO-2022 compatible, unlike the ISO "
+"8859 series."
+msgstr ""
+
+#. Thanks to Tomohiro KUBOTA for the following sections about
+#. national standards.
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Big5 was a popular character set in Taiwan to express traditional Chinese. "
+"(Big5 is both a character set and an encoding.) It is a superset of ASCII. "
+"Non-ASCII characters are expressed in two bytes. Bytes 0xa1\\[en]0xfe are "
+"used as leading bytes for two-byte characters. Big5 and its extension were "
+"widely used in Taiwan and Hong Kong. It is not ISO 2022 compliant."
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"JIS X 0208 is a Japanese national standard character set. Though there are "
+"some more Japanese national standard character sets (like JIS X 0201, JIS X "
+"0212, and JIS X 0213), this is the most important one. Characters are "
+"mapped into a 94x94 two-byte matrix, whose each byte is in the range "
+"0x21\\[en]0x7e. Note that JIS X 0208 is a character set, not an encoding. "
+"This means that JIS X 0208 itself is not used for expressing text data. JIS "
+"X 0208 is used as a component to construct encodings such as EUC-JP, "
+"Shift_JIS, and ISO-2022-JP. EUC-JP is the most important encoding for Linux "
+"and includes ASCII and JIS X 0208. In EUC-JP, JIS X 0208 characters are "
+"expressed in two bytes, each of which is the JIS X 0208 code plus 0x80."
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"KS X 1001 is a Korean national standard character set. Just as JIS X 0208, "
+"characters are mapped into a 94x94 two-byte matrix. KS X 1001 is used like "
+"JIS X 0208, as a component to construct encodings such as EUC-KR, Johab, and "
+"ISO-2022-KR. EUC-KR is the most important encoding for Linux and includes "
+"ASCII and KS X 1001. KS C 5601 is an older name for KS X 1001."
+msgstr ""
+
+#. type: SS
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "ISO 2022 and ISO 4873"
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"The ISO 2022 and 4873 standards describe a font-control model based on VT100 "
+"practice. This model is (partially) supported by the Linux kernel and by "
+"B<xterm>(1). Several ISO 2022-based character encodings have been defined, "
+"especially for Japanese."
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"A 94-character set is designated as GI<n> character set by an escape "
+"sequence ESC ( xx (for G0), ESC ) xx (for G1), ESC * xx (for G2), ESC + xx "
+"(for G3), where xx is a symbol or a pair of symbols found in the ISO 2375 "
+"International Register of Coded Character Sets. For example, ESC ( @ "
+"selects the ISO 646 character set as G0, ESC ( A selects the UK standard "
+"character set (with pound instead of number sign), ESC ( B selects ASCII "
+"(with dollar instead of currency sign), ESC ( M selects a character set for "
+"African languages, ESC ( ! A selects the Cuban character set, and so on."
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"ISO 4873 stipulates a narrower use of character sets, where G0 is fixed "
+"(always ASCII), so that G1, G2, and G3 can be invoked only for codes with "
+"the high order bit set. In particular, B<\\[ha]N> and B<\\[ha]O> are not "
+"used anymore, ESC ( xx can be used only with xx=B, and ESC ) xx, ESC * xx, "
+"ESC + xx are equivalent to ESC - xx, ESC . xx, ESC / xx, respectively."
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"TIS-620 is a Thai national standard character set and a superset of ASCII. "
+"In the same fashion as the ISO 8859 series, Thai characters are mapped into "
+"0xa1\\[en]0xfe."
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm
+msgid ""
+"Unicode (ISO 10646) is a standard which aims to unambiguously represent "
+"every character in every human language. Unicode's structure permits 20.1 "
+"bits to encode every character. Since most computers don't include 20.1-bit "
+"integers, Unicode is usually encoded as 32-bit integers internally and "
+"either a series of 16-bit integers (UTF-16) (needing two 16-bit integers "
+"only when encoding certain rare characters) or a series of 8-bit bytes "
+"(UTF-8)."
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm
+msgid ""
+"A byte 110xxxxx is the start of a 2-byte code, and 110xxxxx 10yyyyyy is "
+"assembled into 00000xxx xxyyyyyy. A byte 1110xxxx is the start of a 3-byte "
+"code, and 1110xxxx 10yyyyyy 10zzzzzz is assembled into xxxxyyyy yyzzzzzz. "
+"(When UTF-8 is used to code the 31-bit ISO 10646 then this progression "
+"continues up to 6-byte codes.)"
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"For most texts in ISO 8859 character sets, this means that the characters "
+"outside of ASCII are now coded with two bytes. This tends to expand "
+"ordinary text files by only one or two percent. For Russian or Greek texts, "
+"this expands ordinary text files by 100%, since text in those languages is "
+"mostly outside of ASCII. For Japanese users this means that the 16-bit "
+"codes now in common use will take three bytes. While there are algorithmic "
+"conversions from some character sets (especially ISO 8859-1) to Unicode, "
+"general conversion requires carrying around conversion tables, which can be "
+"quite large for 16-bit codes."
+msgstr ""
+
+#. type: TH
+#: debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "2023-03-12"
+msgstr ""
+
+#. type: TH
+#: debian-unstable opensuse-tumbleweed
+#, no-wrap
+msgid "Linux man-pages 6.05.01"
+msgstr ""
+
+#. type: TH
+#: opensuse-leap-15-6
+#, no-wrap
+msgid "Linux man-pages 6.04"
+msgstr ""