# SOME DESCRIPTIVE TITLE # Copyright (C) YEAR Free Software Foundation, Inc. # This file is distributed under the same license as the PACKAGE package. # FIRST AUTHOR , YEAR. # #, fuzzy msgid "" msgstr "" "Project-Id-Version: PACKAGE VERSION\n" "POT-Creation-Date: 2024-03-01 17:12+0100\n" "PO-Revision-Date: YEAR-MO-DA HO:MI+ZONE\n" "Last-Translator: FULL NAME \n" "Language-Team: LANGUAGE \n" "Language: \n" "MIME-Version: 1.0\n" "Content-Type: text/plain; charset=UTF-8\n" "Content-Transfer-Encoding: 8bit\n" #. type: TH #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "unicode" msgstr "" #. type: TH #: archlinux fedora-40 fedora-rawhide mageia-cauldron #, no-wrap msgid "2024-01-28" msgstr "" #. type: TH #: archlinux fedora-40 fedora-rawhide mageia-cauldron #, no-wrap msgid "Linux man-pages 6.06" msgstr "" #. type: SH #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "NAME" msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "unicode - universal character set" msgstr "" #. type: SH #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "DESCRIPTION" msgstr "" #. type: Plain text #: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron #: opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The international standard ISO/IEC 10646 defines the Universal Character Set " "(UCS). UCS contains all characters of all other character set standards. " "It also guarantees \"round-trip compatibility\"; in other words, conversion " "tables can be built such that no information is lost when a string is " "converted from any other encoding to UCS and back." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "UCS contains the characters required to represent practically all known " "languages. This includes not only the Latin, Greek, Cyrillic, Hebrew, " "Arabic, Armenian, and Georgian scripts, but also Chinese, Japanese and " "Korean Han ideographs as well as scripts such as Hiragana, Katakana, Hangul, " "Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Tamil, Telugu, Kannada, " "Malayalam, Thai, Lao, Khmer, Bopomofo, Tibetan, Runic, Ethiopic, Canadian " "Syllabics, Cherokee, Mongolian, Ogham, Myanmar, Sinhala, Thaana, Yi, and " "others. For scripts not yet covered, research on how to best encode them " "for computer usage is still going on and they will be added eventually. " "This might eventually include not only Hieroglyphs and various historic Indo-" "European languages, but even some selected artistic scripts such as Tengwar, " "Cirth, and Klingon. UCS also covers a large number of graphical, " "typographical, mathematical, and scientific symbols, including those " "provided by TeX, Postscript, APL, MS-DOS, MS-Windows, Macintosh, OCR fonts, " "as well as many word processing and publishing systems, and more are being " "added." msgstr "" #. type: Plain text #: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron #: opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The UCS standard (ISO/IEC 10646) describes a 31-bit character set " "architecture consisting of 128 24-bit I, each divided into 256 16-" "bit I made up of 256 8-bit I with 256 I positions, one " "for each character. Part 1 of the standard (ISO/IEC 10646-1) defines the " "first 65534 code positions (0x0000 to 0xfffd), which form the I (BMP), that is plane 0 in group 0. Part 2 of the " "standard (ISO/IEC 10646-2) adds characters to group 0 outside the BMP in " "several I in the range 0x10000 to 0x10ffff. There are " "no plans to add characters beyond 0x10ffff to the standard, therefore of the " "entire code space, only a small fraction of group 0 will ever be actually " "used in the foreseeable future. The BMP contains all characters found in " "the commonly used other character sets. The supplemental planes added by " "ISO/IEC 10646-2 cover only more exotic characters for special scientific, " "dictionary printing, publishing industry, higher-level protocol and " "enthusiast needs." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The representation of each UCS character as a 2-byte word is referred to as " "the UCS-2 form (only for BMP characters), whereas UCS-4 is the " "representation of each character by a 4-byte word. In addition, there exist " "two encoding forms UTF-8 for backward compatibility with ASCII processing " "software and UTF-16 for the backward-compatible handling of non-BMP " "characters up to 0x10ffff by UCS-2 software." msgstr "" #. type: Plain text #: archlinux fedora-40 fedora-rawhide mageia-cauldron msgid "" "The UCS characters 0x0000 to 0x007f are identical to those of the classic US-" "ASCII character set and the characters in the range 0x0000 to 0x00ff are " "identical to those in ISO/IEC\\~8859-1 (Latin-1)." msgstr "" #. type: SS #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Combining characters" msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "Some code points in UCS have been assigned to I. " "These are similar to the nonspacing accent keys on a typewriter. A " "combining character just adds an accent to the previous character. The most " "important accented characters have codes of their own in UCS, however, the " "combining character mechanism allows us to add accents and other diacritical " "marks to any character. The combining characters always follow the " "character which they modify. For example, the German character Umlaut-A " "(\"Latin capital letter A with diaeresis\") can either be represented by the " "precomposed UCS code 0x00c4, or alternatively as the combination of a normal " "\"Latin capital letter A\" followed by a \"combining diaeresis\": 0x0041 " "0x0308." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "Combining characters are essential for instance for encoding the Thai script " "or for mathematical typesetting and users of the International Phonetic " "Alphabet." msgstr "" #. type: SS #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Implementation levels" msgstr "" #. type: Plain text #: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron #: opensuse-leap-15-6 opensuse-tumbleweed msgid "" "As not all systems are expected to support advanced mechanisms like " "combining characters, ISO/IEC 10646-1 specifies the following three " "I of UCS:" msgstr "" #. type: TP #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Level 1" msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "Combining characters and Hangul Jamo (a variant encoding of the Korean " "script, where a Hangul syllable glyph is coded as a triplet or pair of vowel/" "consonant codes) are not supported." msgstr "" #. type: TP #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Level 2" msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "In addition to level 1, combining characters are now allowed for some " "languages where they are essential (e.g., Thai, Lao, Hebrew, Arabic, " "Devanagari, Malayalam)." msgstr "" #. type: TP #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Level 3" msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "All UCS characters are supported." msgstr "" #. type: Plain text #: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron #: opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The Unicode 3.0 Standard published by the Unicode Consortium contains " "exactly the UCS Basic Multilingual Plane at implementation level 3, as " "described in ISO/IEC 10646-1:2000. Unicode 3.1 added the supplemental " "planes of ISO/IEC 10646-2. The Unicode standard and technical reports " "published by the Unicode Consortium provide much additional information on " "the semantics and recommended usages of various characters. They provide " "guidelines and algorithms for editing, sorting, comparing, normalizing, " "converting, and displaying Unicode strings." msgstr "" #. type: SS #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Unicode under Linux" msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "Under GNU/Linux, the C type I is a signed 32-bit integer type. Its " "values are always interpreted by the C library as UCS code values (in all " "locales), a convention that is signaled by the GNU C library to applications " "by defining the constant B<__STDC_ISO_10646__> as specified in the ISO C99 " "standard." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "UCS/Unicode can be used just like ASCII in input/output streams, terminal " "communication, plaintext files, filenames, and environment variables in the " "ASCII compatible UTF-8 multibyte encoding. To signal the use of UTF-8 as " "the character encoding to all applications, a suitable I has to be " "selected via environment variables (e.g., \"LANG=en_GB.UTF-8\")." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The B function returns the name of the selected " "encoding. Library functions such as B(3) and B(3) can " "be used to transform the internal I characters and strings into the " "system character encoding and back and B(3) tells how many " "positions (0\\[en]2) the cursor is advanced by the output of a character." msgstr "" #. type: SS #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Private Use Areas (PUA)" msgstr "" #. commit 9d85025b0418163fae079c9ba8f8445212de8568 #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "In the Basic Multilingual Plane, the range 0xe000 to 0xf8ff will never be " "assigned to any characters by the standard and is reserved for private " "usage. For the Linux community, this private area has been subdivided " "further into the range 0xe000 to 0xefff which can be used individually by " "any end-user and the Linux zone in the range 0xf000 to 0xf8ff where " "extensions are coordinated among all Linux users. The registry of the " "characters assigned to the Linux zone is maintained by LANANA and the " "registry itself is I in the Linux " "kernel sources (or I before Linux 4.10)." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "Two other planes are reserved for private usage, plane 15 (Supplementary " "Private Use Area-A, range 0xf0000 to 0xffffd) and plane 16 (Supplementary " "Private Use Area-B, range 0x100000 to 0x10fffd)." msgstr "" #. type: SS #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Literature" msgstr "" #. type: IP #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "\\[bu]" msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "Information technology \\[em] Universal Multiple-Octet Coded Character Set " "(UCS) \\[em] Part 1: Architecture and Basic Multilingual Plane. " "International Standard ISO/IEC 10646-1, International Organization for " "Standardization, Geneva, 2000." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "This is the official specification of UCS. Available from E<.UR http://www." "iso.ch/> E<.UE .>" msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The Unicode Standard, Version 3.0. The Unicode Consortium, Addison-Wesley, " "Reading, MA, 2000, ISBN 0-201-61633-5." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "S.\\& Harbison, G.\\& Steele. C: A Reference Manual. Fourth edition, " "Prentice Hall, Englewood Cliffs, 1995, ISBN 0-13-326224-3." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "A good reference book about the C programming language. The fourth edition " "covers the 1994 Amendment 1 to the ISO C90 standard, which adds a large " "number of new C library functions for handling wide and multibyte character " "encodings, but it does not yet cover ISO C99, which improved wide and " "multibyte character support even further." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "Unicode Technical Reports." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "E<.UR http://www.unicode.org\\:/reports/> E<.UE>" msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "Markus Kuhn: UTF-8 and Unicode FAQ for UNIX/Linux." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "E<.UR http://www.cl.cam.ac.uk\\:/\\[ti]mgk25\\:/unicode.html> E<.UE>" msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "Bruno Haible: Unicode HOWTO." msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "E<.UR http://www.tldp.org\\:/HOWTO\\:/Unicode-HOWTO.html> E<.UE>" msgstr "" #. .SH AUTHOR #. Markus Kuhn #. type: SH #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "SEE ALSO" msgstr "" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "B(1), B(3), B(7), B(7)" msgstr "" #. type: TH #: debian-bookworm #, no-wrap msgid "2023-02-05" msgstr "" #. type: TH #: debian-bookworm #, no-wrap msgid "Linux man-pages 6.03" msgstr "" #. type: Plain text #: debian-bookworm msgid "" "The international standard ISO 10646 defines the Universal Character Set " "(UCS). UCS contains all characters of all other character set standards. " "It also guarantees \"round-trip compatibility\"; in other words, conversion " "tables can be built such that no information is lost when a string is " "converted from any other encoding to UCS and back." msgstr "" #. type: Plain text #: debian-bookworm msgid "" "The UCS standard (ISO 10646) describes a 31-bit character set architecture " "consisting of 128 24-bit I, each divided into 256 16-bit I " "made up of 256 8-bit I with 256 I positions, one for each " "character. Part 1 of the standard (ISO 10646-1) defines the first 65534 " "code positions (0x0000 to 0xfffd), which form the I (BMP), that is plane 0 in group 0. Part 2 of the standard (ISO " "10646-2) adds characters to group 0 outside the BMP in several " "I in the range 0x10000 to 0x10ffff. There are no " "plans to add characters beyond 0x10ffff to the standard, therefore of the " "entire code space, only a small fraction of group 0 will ever be actually " "used in the foreseeable future. The BMP contains all characters found in " "the commonly used other character sets. The supplemental planes added by " "ISO 10646-2 cover only more exotic characters for special scientific, " "dictionary printing, publishing industry, higher-level protocol and " "enthusiast needs." msgstr "" #. type: Plain text #: debian-bookworm debian-unstable opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The UCS characters 0x0000 to 0x007f are identical to those of the classic US-" "ASCII character set and the characters in the range 0x0000 to 0x00ff are " "identical to those in ISO 8859-1 (Latin-1)." msgstr "" #. type: Plain text #: debian-bookworm msgid "" "As not all systems are expected to support advanced mechanisms like " "combining characters, ISO 10646-1 specifies the following three " "I of UCS:" msgstr "" #. type: Plain text #: debian-bookworm msgid "" "The Unicode 3.0 Standard published by the Unicode Consortium contains " "exactly the UCS Basic Multilingual Plane at implementation level 3, as " "described in ISO 10646-1:2000. Unicode 3.1 added the supplemental planes of " "ISO 10646-2. The Unicode standard and technical reports published by the " "Unicode Consortium provide much additional information on the semantics and " "recommended usages of various characters. They provide guidelines and " "algorithms for editing, sorting, comparing, normalizing, converting, and " "displaying Unicode strings." msgstr "" #. type: TH #: debian-unstable opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "2023-03-12" msgstr "" #. type: TH #: debian-unstable opensuse-tumbleweed #, no-wrap msgid "Linux man-pages 6.05.01" msgstr "" #. type: TH #: opensuse-leap-15-6 #, no-wrap msgid "Linux man-pages 6.04" msgstr ""