# German translation of manpages # This file is distributed under the same license as the manpages-l10n package. # Copyright © of this file: # Sebastian Rittau , 2000. # Martin Eberhard Schauer , 2010. msgid "" msgstr "" "Project-Id-Version: manpages-l10n 4.22.0\n" "POT-Creation-Date: 2024-06-01 06:33+0200\n" "PO-Revision-Date: 2024-06-01 19:41+0200\n" "Last-Translator: Dr. Tobias Quathamer \n" "Language-Team: German \n" "Language: de\n" "MIME-Version: 1.0\n" "Content-Type: text/plain; charset=UTF-8\n" "Content-Transfer-Encoding: 8bit\n" "Plural-Forms: nplurals=2; plural=(n != 1);\n" #. type: TH #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "UTF-8" msgstr "UTF-8" #. type: TH #: archlinux debian-unstable opensuse-tumbleweed #, no-wrap msgid "2024-05-02" msgstr "2. Mai 2024" #. type: TH #: archlinux debian-unstable #, no-wrap msgid "Linux man-pages 6.8" msgstr "Linux man-pages 6.8" #. type: SH #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "NAME" msgstr "BEZEICHNUNG" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "UTF-8 - an ASCII compatible multibyte Unicode encoding" msgstr "UTF-8 - eine ASCII-kompatible Multibyte-Unicode-Kodierung" #. type: SH #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "DESCRIPTION" msgstr "BESCHREIBUNG" #. type: Plain text #: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron #: opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The Unicode 3.0 character set occupies a 16-bit code space. The most " "obvious Unicode encoding (known as UCS-2) consists of a sequence of 16-bit " "words. Such strings can contain\\[em]as part of many 16-bit " "characters\\[em]bytes such as \\[aq]\\e0\\[aq] or \\[aq]/\\[aq], which have " "a special meaning in filenames and other C library function arguments. In " "addition, the majority of UNIX tools expect ASCII files and can't read 16-" "bit words as characters without major modifications. For these reasons, " "UCS-2 is not a suitable external encoding of Unicode in filenames, text " "files, environment variables, and so on. The ISO/IEC 10646 Universal " "Character Set (UCS), a superset of Unicode, occupies an even larger code " "space\\[em]31\\ bits\\[em]and the obvious UCS-4 encoding for it (a sequence " "of 32-bit words) has the same problems." msgstr "" "Der Unicode-3.0-Zeichensatz ist durch 16-Bit-Wörter definiert. Die " "offensichtlichste Unicode-Kodierung (UCS-2) besteht aus einer Folge von 16-" "Bit-Zeichen. Solche Zeichenketten können – als Bestandteile vieler 16-Bit-" "Zeichen – Bytes wie »\\e0« oder »/« enthalten, die z. B. in Dateinamen und " "anderen Argumenten von C-Bibliotheksfunktionen eine besondere Bedeutung " "haben. Außerdem arbeiten die meisten UNIX-Programme mit ASCII-Dateien und " "können 16-Bit-Wörter nicht ohne größere Änderungen verarbeiten. Darum ist " "UCS-2 keine geeignete externe Kodierung von Unicode in Dateinamen, Text-" "Dateien, Umgebungsvariablen usw. Der ISO/IEC 10646 Universal Character Set " "(UCS), eine Erweiterung von Unicode, belegt sogar einen noch größeren Code-" "Raum – 31 Bit. Die offensichtliche UCS-4-Kodierung dafür (eine Folge von 32-" "Bit-Wörtern) leidet unter denselben Problemen wie die UCS-2-Kodierung." #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The UTF-8 encoding of Unicode and UCS does not have these problems and is " "the common way in which Unicode is used on UNIX-style operating systems." msgstr "" "Die UTF-8-Kodierung von Unicode und UCS hat diese Probleme nicht. Sie ist " "der gebräuchliche Anwendungsfall des Unicode-Zeichensatzes auf UNIX-artigen " "Betriebssystemen." #. type: SS #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Properties" msgstr "Eigenschaften" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "The UTF-8 encoding has the following nice properties:" msgstr "Die UTF-8-Kodierung hat die folgenden netten Eigenschaften:" #. type: IP #: archlinux debian-unstable fedora-rawhide opensuse-tumbleweed #, no-wrap msgid "\\[bu]" msgstr "\\[bu]" #. type: Plain text #: archlinux debian-unstable fedora-rawhide opensuse-tumbleweed msgid "" "UCS characters 0x00000000 to 0x0000007f (the classic US-ASCII characters) " "are encoded simply as bytes 0x00 to 0x7f (ASCII compatibility). This means " "that files and strings which contain only 7-bit ASCII characters have the " "same encoding under both ASCII and UTF-8." msgstr "" "Die UCS-Zeichen 0x00000000 bis 0x0000007f (die klassischen US-ASCII-Zeichen) " "werden einfach als die Bytes 0x00 bis 0x7f kodiert und auf diese Weise die " "ASCII-Kompatibilität hergestellt. Dateinamen und Zeichenketten, die nur aus " "7-Bit-ASCII-Zeichen bestehen, haben darum unter ASCII und UTF-8 dieselbe " "Kodierung." #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "All UCS characters greater than 0x7f are encoded as a multibyte sequence " "consisting only of bytes in the range 0x80 to 0xfd, so no ASCII byte can " "appear as part of another character and there are no problems with, for " "example, \\[aq]\\e0\\[aq] or \\[aq]/\\[aq]." msgstr "" "Alle UCS-Zeichen über 0x7f werden als Folge mehrerer Bytes im Bereich 0x80 " "bis 0xfd dargestellt, so dass kein B-Byte als Teil eines anderen " "Zeichens auftritt und es keine Probleme z.B. mit »\\e0« oder »/« gibt." #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "The lexicographic sorting order of UCS-4 strings is preserved." msgstr "" "Die lexikographische Sortierreihenfolge von UCS-4-Zeichenketten bleibt " "erhalten." #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "All possible 2\\[ha]31 UCS codes can be encoded using UTF-8." msgstr "Alle möglichen 2\\[ha]31 UCS-Zeichen können mit UTF-8 kodiert werden." #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The bytes 0xc0, 0xc1, 0xfe, and 0xff are never used in the UTF-8 encoding." msgstr "" "Die Bytes 0xc0, 0xc1, 0xfe und 0xff werden in der UTF-8-Kodierung nicht " "verwendet." #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The first byte of a multibyte sequence which represents a single non-ASCII " "UCS character is always in the range 0xc2 to 0xfd and indicates how long " "this multibyte sequence is. All further bytes in a multibyte sequence are " "in the range 0x80 to 0xbf. This allows easy resynchronization and makes the " "encoding stateless and robust against missing bytes." msgstr "" "Das erste Byte einer Multibyte-Folge, die ein einzelnes nicht in ASCII " "enthaltenes UCS-Zeichen darstellt, ist grundsätzlich im Bereich 0xc2 bis " "0xfd und zeigt die Länge der Folge an. Alle anderen Bytes der Folge sind im " "Bereich 0x80 bis 0xbf. Dadurch wird eine einfache Neusynchronisierung " "ermöglich, da die Kodierung zustandslos und daher robust gegenüber fehlenden " "oder verloren gegangenen Bytes ist." #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "UTF-8 encoded UCS characters may be up to six bytes long, however the " "Unicode standard specifies no characters above 0x10ffff, so Unicode " "characters can be only up to four bytes long in UTF-8." msgstr "" "UTF-8-kodierte UCS-Zeichen können bis zu sechs Byte lang sein. Da aber die " "Unicode-Norm keine Zeichen über 0x10FFFF spezifiziert, können Unicode-" "Zeichen in UTF-8 nur bis zu vier Byte lang sein." #. type: SS #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Encoding" msgstr "Kodierung" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The following byte sequences are used to represent a character. The " "sequence to be used depends on the UCS code number of the character:" msgstr "" "Die folgenden Byte-Folgen werden für die Darstellung eines Zeichens " "verwendet. Die zu verwendende Folge hängt vom B-Code des Zeichens ab:" #. type: TP #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "0x00000000 - 0x0000007F:" msgstr "0x00000000 - 0x0000007F:" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "0I" msgstr "0I" #. type: TP #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "0x00000080 - 0x000007FF:" msgstr "0x00000080 - 0x000007FF:" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "110I 10I" msgstr "110I 10I" #. type: TP #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "0x00000800 - 0x0000FFFF:" msgstr "0x00000800 - 0x0000FFFF:" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "1110I 10I 10I" msgstr "1110I 10I 10I" #. type: TP #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "0x00010000 - 0x001FFFFF:" msgstr "0x00010000 - 0x001FFFFF:" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "11110I 10I 10I 10I" msgstr "11110I 10I 10I 10I" #. type: TP #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "0x00200000 - 0x03FFFFFF:" msgstr "0x00200000 - 0x03FFFFFF:" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "111110I 10I 10I 10I 10I" msgstr "111110I 10I 10I 10I 10I" #. type: TP #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "0x04000000 - 0x7FFFFFFF:" msgstr "0x04000000 - 0x7FFFFFFF:" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "1111110I 10I 10I 10I 10I 10I" msgstr "" "1111110I 10I 10I 10I 10I 10I" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The I bit positions are filled with the bits of the character code " "number in binary representation, most significant bit first (big-endian). " "Only the shortest possible multibyte sequence which can represent the code " "number of the character can be used." msgstr "" "Die I-Bits werden durch den Code des Zeichens in Binärdarstellung " "ersetzt, mit dem höchstwertigsten Bit zuerst (Big Endian). Es wird die " "jeweils kürzeste Multibyte-Folge benutzt, die den Code des Zeichens " "darstellen kann." #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The UCS code values 0xd800\\[en]0xdfff (UTF-16 surrogates) as well as 0xfffe " "and 0xffff (UCS noncharacters) should not appear in conforming UTF-8 " "streams. According to RFC 3629 no point above U+10FFFF should be used, " "which limits characters to four bytes." msgstr "" "Die UCS-Codewerte 0xd800…0xdfff (UTF-16-Ersatzzeichen) sowie 0xfffe und " "0xffff (in UCS keinem Zeichen zugeordnet; UCS noncharacters) sollten nicht " "in standardkonformen UTF-8-Datenströmen enthalten sein. Gemäß RFC 3629 " "sollte kein Punkt oberhalb von U+10FFFF verwendet werden, wodurch Zeichen " "auf vier Byte beschränkt werden." #. type: SS #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Example" msgstr "Beispiel" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The Unicode character 0xa9 = 1010 1001 (the copyright sign) is encoded in " "UTF-8 as" msgstr "" "Das Unicode-Zeichen 0xa9 = 1010 1001 (das Copyright-Zeichen) wird in UTF-8 " "als" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "11000010 10101001 = 0xc2 0xa9" msgstr "11000010 10101001 = 0xc2 0xa9" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "and character 0x2260 = 0010 0010 0110 0000 (the \"not equal\" symbol) is " "encoded as:" msgstr "" "dargestellt und das Zeichen 0x2260 = 0010 0010 0110 0000 (das Ungleich-" "Symbol) als:" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "11100010 10001001 10100000 = 0xe2 0x89 0xa0" msgstr "11100010 10001001 10100000 = 0xe2 0x89 0xa0" #. type: SS #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Application notes" msgstr "Bemerkungen zur Anwendung" # #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "Users have to select a UTF-8 locale, for example with" msgstr "Anwender müssen, z.B. mit" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "export LANG=en_GB.UTF-8" msgstr "export LANG=en_GB.UTF-8," #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "in order to activate the UTF-8 support in applications." msgstr "" "eine UTF-8-Locale wählen, um die UTF-8-Unterstützung in Programmen zu " "aktivieren." #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "Application software that has to be aware of the used character encoding " "should always set the locale with for example" msgstr "" "Anwendungs-Software, die auf den verwendeten Zeichensatz achten muss, sollte " "immer, z. B. mit" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "setlocale(LC_CTYPE, \"\")" msgstr "setlocale(LC_CTYPE, \"\")," #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "and programmers can then test the expression" msgstr "die Locale setzen und Programmierer anschließend den Ausdruck" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "strcmp(nl_langinfo(CODESET), \"UTF-8\") == 0" msgstr "strcmp(nl_langinfo(CODESET), \"UTF-8\") == 0" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "to determine whether a UTF-8 locale has been selected and whether therefore " "all plaintext standard input and output, terminal communication, plaintext " "file content, filenames, and environment variables are encoded in UTF-8." msgstr "" "auswerten, um festzustellen, ob eine UTF-8-Locale ausgewählt wurde und ob " "daher sämtliche Standard-Klartexteingaben und -ausgaben, " "Terminalkommunikation, Klartext-Dateiinhalte, Dateinamen und " "Umgebungsvariablen in UTF-8 kodiert sind." #. type: Plain text #: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron #: opensuse-tumbleweed msgid "" "Programmers accustomed to single-byte encodings such as US-ASCII or ISO/" "IEC\\ 8859 have to be aware that two assumptions made so far are no longer " "valid in UTF-8 locales. Firstly, a single byte does not necessarily " "correspond any more to a single character. Secondly, since modern terminal " "emulators in UTF-8 mode also support Chinese, Japanese, and Korean double-" "width characters as well as nonspacing combining characters, outputting a " "single character does not necessarily advance the cursor by one position as " "it did in ASCII. Library functions such as B(3) and " "B(3) should be used today to count characters and cursor " "positions." msgstr "" "An Einzel-Byte-Kodierungen wie US-ASCII oder ISO/IEC\\ 8859 gewöhnte " "Programmierer müssen daran denken, dass zwei bislang getroffene Annahmen in " "UTF-8-Locales nicht mehr gültig sind. Erstens bedeutet ein einziges Byte " "nicht mehr unbedingt ein einzelnes Zeichen. Zweitens, da moderne Terminal-" "Emulatoren im UTF-8-Modus auch chinesische, japanische und koreanische " "Zeichen doppelter Breite sowie Kombinationszeichen ohne horizontalen " "Vorschub unterstützen, setzt die Ausgabe eines einzelnen Zeichens nicht " "unbedingt den Cursor um eine Position weiter, wie es bei ASCII der Fall war. " "Heutzutage sollten Sie Bibliotheksfunktionen wie B(3) und " "B(3) nutzen, um Zeichen und Cursorpositionen zählen." #. type: Plain text #: archlinux debian-unstable fedora-40 fedora-rawhide mageia-cauldron #: opensuse-tumbleweed msgid "" "The official ESC sequence to switch from an ISO/IEC\\ 2022 encoding scheme " "(as used for instance by VT100 terminals) to UTF-8 is ESC % G " "(\"\\ex1b%G\"). The corresponding return sequence from UTF-8 to ISO/IEC\\ " "2022 is ESC % @ (\"\\ex1b%@\"). Other ISO/IEC\\ 2022 sequences (such as for " "switching the G0 and G1 sets) are not applicable in UTF-8 mode." msgstr "" "Die offizielle Escape-Sequenz aus einem ISO/IEC-2022-Kodierungsschema (wie " "zum Beispiel von VT100-Terminals verwendet) nach UTF-8 ist ESC % G " "(\"\\ex1b%G\"). Die entsprechende Sequenz für die Rückkehr von UTF-8 zu ISO/" "IEC\\ 2022 ist ESC % @ (\"\\ex1b%@\"). Andere ISO/IEC-2022-Sequenzen (wie " "zum Umschalten der G0- und G1-Sätze) sind im UTF-8-Modus nicht anwendbar." #. type: SS #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Security" msgstr "Sicherheit" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "The Unicode and UCS standards require that producers of UTF-8 shall use the " "shortest form possible, for example, producing a two-byte sequence with " "first byte 0xc0 is nonconforming. Unicode 3.1 has added the requirement " "that conforming programs must not accept non-shortest forms in their input. " "This is for security reasons: if user input is checked for possible security " "violations, a program might check only for the ASCII version of \"/../\" or " "\";\" or NUL and overlook that there are many non-ASCII ways to represent " "these things in a non-shortest UTF-8 encoding." msgstr "" "Die Standards Unicode und UCS fordern, dass Erzeuger von UTF-8 die kürzeste " "mögliche Form liefern. Z. B. ist der Erzeugung einer Zwei-Byte-Sequenz mit " "dem ersten Byte 0xc0 nicht konform. Unicode 3.1 fordert, dass konforme " "Programme in ihrer Eingabe Formen, die nicht die kürzesten sind, nicht " "akzeptieren dürfen. Dies geschieht aus Sicherheitsgründen: Wenn " "Benutzereingaben auf mögliche Sicherheitsverletzungen überprüft werden, " "könnte ein Programm nur nach den ASCII-Versionen von \"/../\" oder \";\" " "oder dem Nullbyte suchen und übersehen, dass es viele Möglichkeiten einer " "Nicht-ASCII-Darstellung neben der kürzesten UFT-8-Kodierung dieser Zeichen " "gibt." #. type: SS #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "Standards" msgstr "Standards" #. .SH AUTHOR #. Markus Kuhn #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "ISO/IEC 10646-1:2000, Unicode 3.1, RFC\\ 3629, Plan 9." msgstr "ISO/IEC 10646-1:2000, Unicode 3.1, RFC\\ 3629, Plan 9." #. type: SH #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed #, no-wrap msgid "SEE ALSO" msgstr "SIEHE AUCH" #. type: Plain text #: archlinux debian-bookworm debian-unstable fedora-40 fedora-rawhide #: mageia-cauldron opensuse-leap-15-6 opensuse-tumbleweed msgid "" "B(1), B(3), B(3), B(7), " "B(7)" msgstr "" "B(1), B(3), B(3), B(7), " "B(7)" #. type: TH #: debian-bookworm #, no-wrap msgid "2023-02-10" msgstr "10. Februar 2023" #. type: TH #: debian-bookworm #, no-wrap msgid "Linux man-pages 6.03" msgstr "Linux man-pages 6.03" #. type: Plain text #: debian-bookworm msgid "" "The Unicode 3.0 character set occupies a 16-bit code space. The most " "obvious Unicode encoding (known as UCS-2) consists of a sequence of 16-bit " "words. Such strings can contain\\[em]as part of many 16-bit " "characters\\[em]bytes such as \\[aq]\\e0\\[aq] or \\[aq]/\\[aq], which have " "a special meaning in filenames and other C library function arguments. In " "addition, the majority of UNIX tools expect ASCII files and can't read 16-" "bit words as characters without major modifications. For these reasons, " "UCS-2 is not a suitable external encoding of Unicode in filenames, text " "files, environment variables, and so on. The ISO 10646 Universal Character " "Set (UCS), a superset of Unicode, occupies an even larger code " "space\\[em]31\\ bits\\[em]and the obvious UCS-4 encoding for it (a sequence " "of 32-bit words) has the same problems." msgstr "" "Der Unicode-3.0-Zeichensatz ist durch 16-Bit-Wörter definiert. Die " "offensichtlichste Unicode-Kodierung (UCS-2) besteht aus einer Folge von 16-" "Bit-Zeichen. Solche Zeichenketten können – als Bestandteile vieler 16-Bit-" "Zeichen – Bytes wie »\\e0« oder »/« enthalten, die z. B. in Dateinamen und " "anderen Argumenten von C-Bibliotheksfunktionen eine besondere Bedeutung " "haben. Außerdem arbeiten die meisten UNIX-Programme mit ASCII-Dateien und " "können 16-Bit-Wörter nicht ohne größere Änderungen verarbeiten. Darum ist " "UCS-2 keine geeignete externe Kodierung von Unicode in Dateinamen, Text-" "Dateien, Umgebungsvariablen usw. Der ISO 10646 Universal Character Set " "(UCS), eine Erweiterung von Unicode, belegt sogar einen noch größeren Code-" "Raum – 31 Bit. Die offensichtliche UCS-4-Kodierung dafür (eine Folge von 32-" "Bit-Wörtern) leidet unter denselben Problemen wie die UCS-2-Kodierung." #. type: TP #: debian-bookworm fedora-40 mageia-cauldron opensuse-leap-15-6 #, no-wrap msgid "*" msgstr "*" #. type: Plain text #: debian-bookworm fedora-40 mageia-cauldron opensuse-leap-15-6 msgid "" "UCS characters 0x00000000 to 0x0000007f (the classic US-ASCII characters) " "are encoded simply as bytes 0x00 to 0x7f (ASCII compatibility). This means " "that files and strings which contain only 7-bit ASCII characters have the " "same encoding under both ASCII and UTF-8 ." msgstr "" "Die UCS-Zeichen 0x00000000 bis 0x0000007f (die klassischen US-ASCII-Zeichen) " "werden einfach als die Bytes 0x00 bis 0x7f kodiert und auf diese Weise die " "ASCII-Kompatibilität hergestellt. Dateinamen und Zeichenketten, die nur aus " "7-Bit-ASCII-Zeichen bestehen, haben darum unter ASCII und UTF-8 dieselbe " "Kodierung." #. type: Plain text #: debian-bookworm opensuse-leap-15-6 msgid "" "Programmers accustomed to single-byte encodings such as US-ASCII or ISO 8859 " "have to be aware that two assumptions made so far are no longer valid in " "UTF-8 locales. Firstly, a single byte does not necessarily correspond any " "more to a single character. Secondly, since modern terminal emulators in " "UTF-8 mode also support Chinese, Japanese, and Korean double-width " "characters as well as nonspacing combining characters, outputting a single " "character does not necessarily advance the cursor by one position as it did " "in ASCII. Library functions such as B(3) and B(3) " "should be used today to count characters and cursor positions." msgstr "" "An Einzel-Byte-Kodierungen wie US-ASCII oder ISO 8859 gewöhnte Programmierer " "müssen daran denken, dass zwei bislang getroffene Annahmen in UTF-8-Locales " "nicht mehr gültig sind. Erstens bedeutet ein einziges Byte nicht mehr " "unbedingt ein einzelnes Zeichen. Zweitens, da moderne Terminal-Emulatoren im " "UTF-8-Modus auch chinesische, japanische und koreanische Zeichen doppelter " "Breite sowie Kombinationszeichen ohne horizontalen Vorschub unterstützen, " "setzt die Ausgabe eines einzelnen Zeichens nicht unbedingt den Cursor um " "eine Position weiter, wie es bei ASCII der Fall war. Heutzutage sollten Sie " "Bibliotheksfunktionen wie B(3) und B(3) nutzen, um " "Zeichen und Cursorpositionen zählen." #. type: Plain text #: debian-bookworm opensuse-leap-15-6 msgid "" "The official ESC sequence to switch from an ISO 2022 encoding scheme (as " "used for instance by VT100 terminals) to UTF-8 is ESC % G (\"\\ex1b%G\"). " "The corresponding return sequence from UTF-8 to ISO 2022 is ESC % @ " "(\"\\ex1b%@\"). Other ISO 2022 sequences (such as for switching the G0 and " "G1 sets) are not applicable in UTF-8 mode." msgstr "" "Die offizielle Escape-Sequenz aus einem ISO-2022-Kodierungsschema (wie zum " "Beispiel von VT100-Terminals verwendet) nach UTF-8 ist ESC % G " "(\"\\ex1b%G\"). Die entsprechende Sequenz für die Rückkehr von UTF-8 zu ISO " "2022 ist ESC % @ (\"\\ex1b%@\"). Andere ISO-2022-Sequenzen (wie zum " "Umschalten der G0- und G1-Sätze) sind im UTF-8-Modus nicht anwendbar." #. type: TH #: fedora-40 mageia-cauldron #, no-wrap msgid "2024-01-28" msgstr "28. Januar 2024" #. type: TH #: fedora-40 mageia-cauldron #, no-wrap msgid "Linux man-pages 6.06" msgstr "Linux man-pages 6.06" #. type: TH #: fedora-rawhide #, no-wrap msgid "2024-03-14" msgstr "14. März 2024" #. type: TH #: fedora-rawhide #, no-wrap msgid "Linux man-pages 6.7" msgstr "Linux man-pages 6.7" #. type: TH #: opensuse-leap-15-6 #, no-wrap msgid "2023-03-12" msgstr "12. März 2023" #. type: TH #: opensuse-leap-15-6 #, no-wrap msgid "Linux man-pages 6.04" msgstr "Linux-Handbuchseiten 6.04" #. type: TH #: opensuse-tumbleweed #, no-wrap msgid "Linux man-pages (unreleased)" msgstr "Linux man-pages (unveröffentlicht)"