diff options
Diffstat (limited to 'upstream/mageia-cauldron/man3pm/Encode::JP.3pm')
-rw-r--r-- | upstream/mageia-cauldron/man3pm/Encode::JP.3pm | 141 |
1 files changed, 141 insertions, 0 deletions
diff --git a/upstream/mageia-cauldron/man3pm/Encode::JP.3pm b/upstream/mageia-cauldron/man3pm/Encode::JP.3pm new file mode 100644 index 00000000..f37a516d --- /dev/null +++ b/upstream/mageia-cauldron/man3pm/Encode::JP.3pm @@ -0,0 +1,141 @@ +.\" -*- mode: troff; coding: utf-8 -*- +.\" Automatically generated by Pod::Man 5.01 (Pod::Simple 3.43) +.\" +.\" Standard preamble: +.\" ======================================================================== +.de Sp \" Vertical space (when we can't use .PP) +.if t .sp .5v +.if n .sp +.. +.de Vb \" Begin verbatim text +.ft CW +.nf +.ne \\$1 +.. +.de Ve \" End verbatim text +.ft R +.fi +.. +.\" \*(C` and \*(C' are quotes in nroff, nothing in troff, for use with C<>. +.ie n \{\ +. ds C` "" +. ds C' "" +'br\} +.el\{\ +. ds C` +. ds C' +'br\} +.\" +.\" Escape single quotes in literal strings from groff's Unicode transform. +.ie \n(.g .ds Aq \(aq +.el .ds Aq ' +.\" +.\" If the F register is >0, we'll generate index entries on stderr for +.\" titles (.TH), headers (.SH), subsections (.SS), items (.Ip), and index +.\" entries marked with X<> in POD. Of course, you'll have to process the +.\" output yourself in some meaningful fashion. +.\" +.\" Avoid warning from groff about undefined register 'F'. +.de IX +.. +.nr rF 0 +.if \n(.g .if rF .nr rF 1 +.if (\n(rF:(\n(.g==0)) \{\ +. if \nF \{\ +. de IX +. tm Index:\\$1\t\\n%\t"\\$2" +.. +. if !\nF==2 \{\ +. nr % 0 +. nr F 2 +. \} +. \} +.\} +.rr rF +.\" ======================================================================== +.\" +.IX Title "Encode::JP 3pm" +.TH Encode::JP 3pm 2023-11-28 "perl v5.38.2" "Perl Programmers Reference Guide" +.\" For nroff, turn off justification. Always turn off hyphenation; it makes +.\" way too many mistakes in technical documents. +.if n .ad l +.nh +.SH NAME +Encode::JP \- Japanese Encodings +.SH SYNOPSIS +.IX Header "SYNOPSIS" +.Vb 3 +\& use Encode qw/encode decode/; +\& $euc_jp = encode("euc\-jp", $utf8); # loads Encode::JP implicitly +\& $utf8 = decode("euc\-jp", $euc_jp); # ditto +.Ve +.SH ABSTRACT +.IX Header "ABSTRACT" +This module implements Japanese charset encodings. Encodings +supported are as follows. +.PP +.Vb 10 +\& Canonical Alias Description +\& \-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\- +\& euc\-jp /\ebeuc.*jp$/i EUC (Extended Unix Character) +\& /\ebjp.*euc/i +\& /\ebujis$/i +\& shiftjis /\ebshift.*jis$/i Shift JIS (aka MS Kanji) +\& /\ebsjis$/i +\& 7bit\-jis /\ebjis$/i 7bit JIS +\& iso\-2022\-jp ISO\-2022\-JP [RFC1468] +\& = 7bit JIS with all Halfwidth Kana +\& converted to Fullwidth +\& iso\-2022\-jp\-1 ISO\-2022\-JP\-1 [RFC2237] +\& = ISO\-2022\-JP with JIS X 0212\-1990 +\& support. See below +\& MacJapanese Shift JIS + Apple vendor mappings +\& cp932 /\ebwindows\-31j$/i Code Page 932 +\& = Shift JIS + MS/IBM vendor mappings +\& jis0201\-raw JIS0201, raw format +\& jis0208\-raw JIS0208, raw format +\& jis0212\-raw JIS0212, raw format +\& \-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\-\- +.Ve +.SH DESCRIPTION +.IX Header "DESCRIPTION" +To find out how to use this module in detail, see Encode. +.SH "Note on ISO\-2022\-JP(\-1)?" +.IX Header "Note on ISO-2022-JP(-1)?" +ISO\-2022\-JP\-1 (RFC2237) is a superset of ISO\-2022\-JP (RFC1468) which +adds support for JIS X 0212\-1990. That means you can use the same +code to decode to utf8 but not vice versa. +.PP +.Vb 1 +\& $utf8 = decode(\*(Aqiso\-2022\-jp\-1\*(Aq, $stream); +.Ve +.PP +and +.PP +.Vb 1 +\& $utf8 = decode(\*(Aqiso\-2022\-jp\*(Aq, $stream); +.Ve +.PP +yield the same result but +.PP +.Vb 1 +\& $with_0212 = encode(\*(Aqiso\-2022\-jp\-1\*(Aq, $utf8); +.Ve +.PP +is now different from +.PP +.Vb 1 +\& $without_0212 = encode(\*(Aqiso\-2022\-jp\*(Aq, $utf8 ); +.Ve +.PP +In the latter case, characters that map to 0212 are first converted +to U+3013 (0xA2AE in EUC-JP; a white square also known as 'Tofu' or +\&'geta mark') then fed to the decoding engine. U+FFFD is not used, +in order to preserve text layout as much as possible. +.SH BUGS +.IX Header "BUGS" +The ASCII region (0x00\-0x7f) is preserved for all encodings, even +though this conflicts with mappings by the Unicode Consortium. +.SH "SEE ALSO" +.IX Header "SEE ALSO" +Encode |