summaryrefslogtreecommitdiffstats
path: root/upstream/debian-bookworm/man3/Encode::Encoder.3perl
blob: c68e4b62c662bcc62d48d157e66e4b41955556a8 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
.\" Automatically generated by Pod::Man 4.14 (Pod::Simple 3.43)
.\"
.\" Standard preamble:
.\" ========================================================================
.de Sp \" Vertical space (when we can't use .PP)
.if t .sp .5v
.if n .sp
..
.de Vb \" Begin verbatim text
.ft CW
.nf
.ne \\$1
..
.de Ve \" End verbatim text
.ft R
.fi
..
.\" Set up some character translations and predefined strings.  \*(-- will
.\" give an unbreakable dash, \*(PI will give pi, \*(L" will give a left
.\" double quote, and \*(R" will give a right double quote.  \*(C+ will
.\" give a nicer C++.  Capital omega is used to do unbreakable dashes and
.\" therefore won't be available.  \*(C` and \*(C' expand to `' in nroff,
.\" nothing in troff, for use with C<>.
.tr \(*W-
.ds C+ C\v'-.1v'\h'-1p'\s-2+\h'-1p'+\s0\v'.1v'\h'-1p'
.ie n \{\
.    ds -- \(*W-
.    ds PI pi
.    if (\n(.H=4u)&(1m=24u) .ds -- \(*W\h'-12u'\(*W\h'-12u'-\" diablo 10 pitch
.    if (\n(.H=4u)&(1m=20u) .ds -- \(*W\h'-12u'\(*W\h'-8u'-\"  diablo 12 pitch
.    ds L" ""
.    ds R" ""
.    ds C` ""
.    ds C' ""
'br\}
.el\{\
.    ds -- \|\(em\|
.    ds PI \(*p
.    ds L" ``
.    ds R" ''
.    ds C`
.    ds C'
'br\}
.\"
.\" Escape single quotes in literal strings from groff's Unicode transform.
.ie \n(.g .ds Aq \(aq
.el       .ds Aq '
.\"
.\" If the F register is >0, we'll generate index entries on stderr for
.\" titles (.TH), headers (.SH), subsections (.SS), items (.Ip), and index
.\" entries marked with X<> in POD.  Of course, you'll have to process the
.\" output yourself in some meaningful fashion.
.\"
.\" Avoid warning from groff about undefined register 'F'.
.de IX
..
.nr rF 0
.if \n(.g .if rF .nr rF 1
.if (\n(rF:(\n(.g==0)) \{\
.    if \nF \{\
.        de IX
.        tm Index:\\$1\t\\n%\t"\\$2"
..
.        if !\nF==2 \{\
.            nr % 0
.            nr F 2
.        \}
.    \}
.\}
.rr rF
.\" ========================================================================
.\"
.IX Title "Encode::Encoder 3perl"
.TH Encode::Encoder 3perl "2023-11-25" "perl v5.36.0" "Perl Programmers Reference Guide"
.\" For nroff, turn off justification.  Always turn off hyphenation; it makes
.\" way too many mistakes in technical documents.
.if n .ad l
.nh
.SH "NAME"
Encode::Encoder \-\- Object Oriented Encoder
.SH "SYNOPSIS"
.IX Header "SYNOPSIS"
.Vb 10
\&  use Encode::Encoder;
\&  # Encode::encode("ISO\-8859\-1", $data); 
\&  Encode::Encoder\->new($data)\->iso_8859_1; # OOP way
\&  # shortcut
\&  use Encode::Encoder qw(encoder);
\&  encoder($data)\->iso_8859_1;
\&  # you can stack them!
\&  encoder($data)\->iso_8859_1\->base64;  # provided base64() is defined
\&  # you can use it as a decoder as well
\&  encoder($base64)\->bytes(\*(Aqbase64\*(Aq)\->latin1;
\&  # stringified
\&  print encoder($data)\->utf8\->latin1;  # prints the string in latin1
\&  # numified
\&  encoder("\ex{abcd}\ex{ef}g")\->utf8 == 6; # true. bytes::length($data)
.Ve
.SH "ABSTRACT"
.IX Header "ABSTRACT"
\&\fBEncode::Encoder\fR allows you to use Encode in an object-oriented
style.  This is not only more intuitive than a functional approach,
but also handier when you want to stack encodings.  Suppose you want
your \s-1UTF\-8\s0 string converted to Latin1 then Base64: you can simply say
.PP
.Vb 1
\&  my $base64 = encoder($utf8)\->latin1\->base64;
.Ve
.PP
instead of
.PP
.Vb 2
\&  my $latin1 = encode("latin1", $utf8);
\&  my $base64 = encode_base64($utf8);
.Ve
.PP
or the lazier and more convoluted
.PP
.Vb 1
\&  my $base64 = encode_base64(encode("latin1", $utf8));
.Ve
.SH "Description"
.IX Header "Description"
Here is how to use this module.
.IP "\(bu" 4
There are at least two instance variables stored in a hash reference,
{data} and {encoding}.
.IP "\(bu" 4
When there is no method, it takes the method name as the name of the
encoding and encodes the instance \fIdata\fR with \fIencoding\fR.  If successful,
the instance \fIencoding\fR is set accordingly.
.IP "\(bu" 4
You can retrieve the result via \->data but usually you don't have to 
because the stringify operator ("") is overridden to do exactly that.
.SS "Predefined Methods"
.IX Subsection "Predefined Methods"
This module predefines the methods below:
.ie n .IP "$e = Encode::Encoder\->new([$data, $encoding]);" 4
.el .IP "\f(CW$e\fR = Encode::Encoder\->new([$data, \f(CW$encoding\fR]);" 4
.IX Item "$e = Encode::Encoder->new([$data, $encoding]);"
returns an encoder object.  Its data is initialized with \f(CW$data\fR if
present, and its encoding is set to \f(CW$encoding\fR if present.
.Sp
When \f(CW$encoding\fR is omitted, it defaults to utf8 if \f(CW$data\fR is already in
utf8 or "" (empty string) otherwise.
.IP "\fBencoder()\fR" 4
.IX Item "encoder()"
is an alias of Encode::Encoder\->\fBnew()\fR.  This one is exported on demand.
.ie n .IP "$e\->data([$data])" 4
.el .IP "\f(CW$e\fR\->data([$data])" 4
.IX Item "$e->data([$data])"
When \f(CW$data\fR is present, sets the instance data to \f(CW$data\fR and returns the
object itself.  Otherwise, the current instance data is returned.
.ie n .IP "$e\->encoding([$encoding])" 4
.el .IP "\f(CW$e\fR\->encoding([$encoding])" 4
.IX Item "$e->encoding([$encoding])"
When \f(CW$encoding\fR is present, sets the instance encoding to \f(CW$encoding\fR and
returns the object itself.  Otherwise, the current instance encoding is
returned.
.ie n .IP "$e\->bytes([$encoding])" 4
.el .IP "\f(CW$e\fR\->bytes([$encoding])" 4
.IX Item "$e->bytes([$encoding])"
decodes instance data from \f(CW$encoding\fR, or the instance encoding if
omitted.  If the conversion is successful, the instance encoding
will be set to "".
.Sp
The name \fIbytes\fR was deliberately picked to avoid namespace tainting
\&\*(-- this module may be used as a base class so method names that appear
in Encode::Encoding are avoided.
.SS "Example: base64 transcoder"
.IX Subsection "Example: base64 transcoder"
This module is designed to work with Encode::Encoding.
To make the Base64 transcoder example above really work, you could
write a module like this:
.PP
.Vb 10
\&  package Encode::Base64;
\&  use parent \*(AqEncode::Encoding\*(Aq;
\&  _\|_PACKAGE_\|_\->Define(\*(Aqbase64\*(Aq);
\&  use MIME::Base64;
\&  sub encode{ 
\&      my ($obj, $data) = @_; 
\&      return encode_base64($data);
\&  }
\&  sub decode{
\&      my ($obj, $data) = @_; 
\&      return decode_base64($data);
\&  }
\&  1;
\&  _\|_END_\|_
.Ve
.PP
And your caller module would be something like this:
.PP
.Vb 2
\&  use Encode::Encoder;
\&  use Encode::Base64;
\&
\&  # now you can really do the following
\&
\&  encoder($data)\->iso_8859_1\->base64;
\&  encoder($base64)\->bytes(\*(Aqbase64\*(Aq)\->latin1;
.Ve
.SS "Operator Overloading"
.IX Subsection "Operator Overloading"
This module overloads two operators, stringify ("") and numify (0+).
.PP
Stringify dumps the data inside the object.
.PP
Numify returns the number of bytes in the instance data.
.PP
They come in handy when you want to print or find the size of data.
.SH "SEE ALSO"
.IX Header "SEE ALSO"
Encode,
Encode::Encoding