summaryrefslogtreecommitdiffstats
path: root/templates/man7/utf-8.7.pot
diff options
context:
space:
mode:
Diffstat (limited to 'templates/man7/utf-8.7.pot')
-rw-r--r--templates/man7/utf-8.7.pot516
1 files changed, 516 insertions, 0 deletions
diff --git a/templates/man7/utf-8.7.pot b/templates/man7/utf-8.7.pot
new file mode 100644
index 00000000..f8123c53
--- /dev/null
+++ b/templates/man7/utf-8.7.pot
@@ -0,0 +1,516 @@
+# SOME DESCRIPTIVE TITLE
+# Copyright (C) YEAR Free Software Foundation, Inc.
+# This file is distributed under the same license as the PACKAGE package.
+# FIRST AUTHOR <EMAIL@ADDRESS>, YEAR.
+#
+#, fuzzy
+msgid ""
+msgstr ""
+"Project-Id-Version: PACKAGE VERSION\n"
+"POT-Creation-Date: 2024-03-01 17:13+0100\n"
+"PO-Revision-Date: YEAR-MO-DA HO:MI+ZONE\n"
+"Last-Translator: FULL NAME <EMAIL@ADDRESS>\n"
+"Language-Team: LANGUAGE <LL@li.org>\n"
+"Language: \n"
+"MIME-Version: 1.0\n"
+"Content-Type: text/plain; charset=UTF-8\n"
+"Content-Transfer-Encoding: 8bit\n"
+
+#. type: TH
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "UTF-8"
+msgstr ""
+
+#. type: TH
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "2024-01-28"
+msgstr ""
+
+#. type: TH
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+#, no-wrap
+msgid "Linux man-pages 6.06"
+msgstr ""
+
+#. type: SH
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "NAME"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "UTF-8 - an ASCII compatible multibyte Unicode encoding"
+msgstr ""
+
+#. type: SH
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "DESCRIPTION"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron
+#: opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"The Unicode 3.0 character set occupies a 16-bit code space. The most "
+"obvious Unicode encoding (known as UCS-2) consists of a sequence of 16-bit "
+"words. Such strings can contain\\[em]as part of many 16-bit "
+"characters\\[em]bytes such as \\[aq]\\e0\\[aq] or \\[aq]/\\[aq], which have "
+"a special meaning in filenames and other C library function arguments. In "
+"addition, the majority of UNIX tools expect ASCII files and can't read 16-"
+"bit words as characters without major modifications. For these reasons, "
+"UCS-2 is not a suitable external encoding of Unicode in filenames, text "
+"files, environment variables, and so on. The ISO/IEC 10646 Universal "
+"Character Set (UCS), a superset of Unicode, occupies an even larger code "
+"space\\[em]31\\ bits\\[em]and the obvious UCS-4 encoding for it (a sequence "
+"of 32-bit words) has the same problems."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"The UTF-8 encoding of Unicode and UCS does not have these problems and is "
+"the common way in which Unicode is used on UNIX-style operating systems."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "Properties"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "The UTF-8 encoding has the following nice properties:"
+msgstr ""
+
+#. type: TP
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "*"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"UCS characters 0x00000000 to 0x0000007f (the classic US-ASCII characters) "
+"are encoded simply as bytes 0x00 to 0x7f (ASCII compatibility). This means "
+"that files and strings which contain only 7-bit ASCII characters have the "
+"same encoding under both ASCII and UTF-8 ."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"All UCS characters greater than 0x7f are encoded as a multibyte sequence "
+"consisting only of bytes in the range 0x80 to 0xfd, so no ASCII byte can "
+"appear as part of another character and there are no problems with, for "
+"example, \\[aq]\\e0\\[aq] or \\[aq]/\\[aq]."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "The lexicographic sorting order of UCS-4 strings is preserved."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "All possible 2\\[ha]31 UCS codes can be encoded using UTF-8."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"The bytes 0xc0, 0xc1, 0xfe, and 0xff are never used in the UTF-8 encoding."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"The first byte of a multibyte sequence which represents a single non-ASCII "
+"UCS character is always in the range 0xc2 to 0xfd and indicates how long "
+"this multibyte sequence is. All further bytes in a multibyte sequence are "
+"in the range 0x80 to 0xbf. This allows easy resynchronization and makes the "
+"encoding stateless and robust against missing bytes."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"UTF-8 encoded UCS characters may be up to six bytes long, however the "
+"Unicode standard specifies no characters above 0x10ffff, so Unicode "
+"characters can be only up to four bytes long in UTF-8."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "Encoding"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"The following byte sequences are used to represent a character. The "
+"sequence to be used depends on the UCS code number of the character:"
+msgstr ""
+
+#. type: TP
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "0x00000000 - 0x0000007F:"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "0I<xxxxxxx>"
+msgstr ""
+
+#. type: TP
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "0x00000080 - 0x000007FF:"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "110I<xxxxx> 10I<xxxxxx>"
+msgstr ""
+
+#. type: TP
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "0x00000800 - 0x0000FFFF:"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "1110I<xxxx> 10I<xxxxxx> 10I<xxxxxx>"
+msgstr ""
+
+#. type: TP
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "0x00010000 - 0x001FFFFF:"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "11110I<xxx> 10I<xxxxxx> 10I<xxxxxx> 10I<xxxxxx>"
+msgstr ""
+
+#. type: TP
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "0x00200000 - 0x03FFFFFF:"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "111110I<xx> 10I<xxxxxx> 10I<xxxxxx> 10I<xxxxxx> 10I<xxxxxx>"
+msgstr ""
+
+#. type: TP
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "0x04000000 - 0x7FFFFFFF:"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "1111110I<x> 10I<xxxxxx> 10I<xxxxxx> 10I<xxxxxx> 10I<xxxxxx> 10I<xxxxxx>"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"The I<xxx> bit positions are filled with the bits of the character code "
+"number in binary representation, most significant bit first (big-endian). "
+"Only the shortest possible multibyte sequence which can represent the code "
+"number of the character can be used."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"The UCS code values 0xd800\\[en]0xdfff (UTF-16 surrogates) as well as 0xfffe "
+"and 0xffff (UCS noncharacters) should not appear in conforming UTF-8 "
+"streams. According to RFC 3629 no point above U+10FFFF should be used, "
+"which limits characters to four bytes."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "Example"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"The Unicode character 0xa9 = 1010 1001 (the copyright sign) is encoded in "
+"UTF-8 as"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "11000010 10101001 = 0xc2 0xa9"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"and character 0x2260 = 0010 0010 0110 0000 (the \"not equal\" symbol) is "
+"encoded as:"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "11100010 10001001 10100000 = 0xe2 0x89 0xa0"
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "Application notes"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "Users have to select a UTF-8 locale, for example with"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "export LANG=en_GB.UTF-8"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "in order to activate the UTF-8 support in applications."
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Application software that has to be aware of the used character encoding "
+"should always set the locale with for example"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "setlocale(LC_CTYPE, \"\")"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "and programmers can then test the expression"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "strcmp(nl_langinfo(CODESET), \"UTF-8\") == 0"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"to determine whether a UTF-8 locale has been selected and whether therefore "
+"all plaintext standard input and output, terminal communication, plaintext "
+"file content, filenames, and environment variables are encoded in UTF-8."
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"Programmers accustomed to single-byte encodings such as US-ASCII or ISO/"
+"IEC\\~8859 have to be aware that two assumptions made so far are no longer "
+"valid in UTF-8 locales. Firstly, a single byte does not necessarily "
+"correspond any more to a single character. Secondly, since modern terminal "
+"emulators in UTF-8 mode also support Chinese, Japanese, and Korean double-"
+"width characters as well as nonspacing combining characters, outputting a "
+"single character does not necessarily advance the cursor by one position as "
+"it did in ASCII. Library functions such as B<mbsrtowcs>(3) and "
+"B<wcswidth>(3) should be used today to count characters and cursor "
+"positions."
+msgstr ""
+
+#. type: Plain text
+#: archlinux fedora-40 fedora-rawhide mageia-cauldron
+msgid ""
+"The official ESC sequence to switch from an ISO/IEC\\~2022 encoding scheme "
+"(as used for instance by VT100 terminals) to UTF-8 is ESC % G "
+"(\"\\ex1b%G\"). The corresponding return sequence from UTF-8 to ISO/"
+"IEC\\~2022 is ESC % @ (\"\\ex1b%@\"). Other ISO/IEC\\~2022 sequences (such "
+"as for switching the G0 and G1 sets) are not applicable in UTF-8 mode."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "Security"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"The Unicode and UCS standards require that producers of UTF-8 shall use the "
+"shortest form possible, for example, producing a two-byte sequence with "
+"first byte 0xc0 is nonconforming. Unicode 3.1 has added the requirement "
+"that conforming programs must not accept non-shortest forms in their input. "
+"This is for security reasons: if user input is checked for possible security "
+"violations, a program might check only for the ASCII version of \"/../\" or "
+"\";\" or NUL and overlook that there are many non-ASCII ways to represent "
+"these things in a non-shortest UTF-8 encoding."
+msgstr ""
+
+#. type: SS
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "Standards"
+msgstr ""
+
+#. .SH AUTHOR
+#. Markus Kuhn <mgk25@cl.cam.ac.uk>
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid "ISO/IEC 10646-1:2000, Unicode 3.1, RFC\\ 3629, Plan 9."
+msgstr ""
+
+#. type: SH
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "SEE ALSO"
+msgstr ""
+
+#. type: Plain text
+#: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide
+#: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"B<locale>(1), B<nl_langinfo>(3), B<setlocale>(3), B<charsets>(7), "
+"B<unicode>(7)"
+msgstr ""
+
+#. type: TH
+#: debian-bookworm
+#, no-wrap
+msgid "2023-02-10"
+msgstr ""
+
+#. type: TH
+#: debian-bookworm
+#, no-wrap
+msgid "Linux man-pages 6.03"
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm
+msgid ""
+"The Unicode 3.0 character set occupies a 16-bit code space. The most "
+"obvious Unicode encoding (known as UCS-2) consists of a sequence of 16-bit "
+"words. Such strings can contain\\[em]as part of many 16-bit "
+"characters\\[em]bytes such as \\[aq]\\e0\\[aq] or \\[aq]/\\[aq], which have "
+"a special meaning in filenames and other C library function arguments. In "
+"addition, the majority of UNIX tools expect ASCII files and can't read 16-"
+"bit words as characters without major modifications. For these reasons, "
+"UCS-2 is not a suitable external encoding of Unicode in filenames, text "
+"files, environment variables, and so on. The ISO 10646 Universal Character "
+"Set (UCS), a superset of Unicode, occupies an even larger code "
+"space\\[em]31\\ bits\\[em]and the obvious UCS-4 encoding for it (a sequence "
+"of 32-bit words) has the same problems."
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"Programmers accustomed to single-byte encodings such as US-ASCII or ISO 8859 "
+"have to be aware that two assumptions made so far are no longer valid in "
+"UTF-8 locales. Firstly, a single byte does not necessarily correspond any "
+"more to a single character. Secondly, since modern terminal emulators in "
+"UTF-8 mode also support Chinese, Japanese, and Korean double-width "
+"characters as well as nonspacing combining characters, outputting a single "
+"character does not necessarily advance the cursor by one position as it did "
+"in ASCII. Library functions such as B<mbsrtowcs>(3) and B<wcswidth>(3) "
+"should be used today to count characters and cursor positions."
+msgstr ""
+
+#. type: Plain text
+#: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+msgid ""
+"The official ESC sequence to switch from an ISO 2022 encoding scheme (as "
+"used for instance by VT100 terminals) to UTF-8 is ESC % G (\"\\ex1b%G\"). "
+"The corresponding return sequence from UTF-8 to ISO 2022 is ESC % @ "
+"(\"\\ex1b%@\"). Other ISO 2022 sequences (such as for switching the G0 and "
+"G1 sets) are not applicable in UTF-8 mode."
+msgstr ""
+
+#. type: TH
+#: debian-unstable opensuse-leap-15-6 opensuse-tumbleweed
+#, no-wrap
+msgid "2023-03-12"
+msgstr ""
+
+#. type: TH
+#: debian-unstable opensuse-tumbleweed
+#, no-wrap
+msgid "Linux man-pages 6.05.01"
+msgstr ""
+
+#. type: TH
+#: opensuse-leap-15-6
+#, no-wrap
+msgid "Linux man-pages 6.04"
+msgstr ""