third_party/heimdal/lib/asn1/README-template.md


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278

#Notes on Heimdal's ASN.1 compiler's "template" backend

```bash
size .libs/libasn1.dylib
size .libs/libasn1base.a | awk '{sum += $1} END {print sum}' | sed 's/^/TEXT baselib: /'
size .libs/asn1_*.o | awk '{sum += $1} END {print sum}' | sed 's/^/generated code stubs: /'
size *_asn1-template.o | awk '{sum += $1} END {print sum}' | sed 's/^/TEXT stubs: /'
```

Notes about the template parser:

 - assumption: code is large, tables smaller

 - size scales better as features as added:

    - adding encoding rules, textual value parsers, comparators, and so on, are
      just new template interpreter, and generally that means no change to
      templates.

    - so template sizing scales like `O(M + N)` where `M` is the size of the
      modules and `N` is the size of the interpreters

    - but codegen sizing scales like `O(M * N)`

    - as we add interpreters the size advantage of templates increases

 - smaller tables and code, more memory sharing, smaller cache footprint,
   should lead to better performance

    - templates are shared for encode/decode/free/copy/print interpreters,
      whereas none of those operations as generated by the codegen backend
      share any code

 - very compressible -- we waste a lot of space in `struct asn1_template` on
   64-bit systems, and still it's smaller than the code generated by the
   codegen backend

   Note that the template backend does currently dedup templates, though that
   is a quadratic operation that we may eventually have to make optional (right
   now it's not a problem).

   If we made the `ptr` field a `uint32_t` instead of a pointer, and wrote a
   linker for templates, and squeezed out some bits of `tt` and `offset` (we'll
   never need even 8 bits for tags, let alone 20!, and we'll never need 32 bits
   for struct sizes and field offsets either, maybe not even 16-bits), we could
   cut the size of `struct asn1_template` in half.

   Also, once we add OER/JER we could have an option to not support TLV ERs and
   then drop a lot of the tag-related parts of the minified AST that templates
   are, further shrinking the templates.

   The smaller the templates, the faster interpreting will be.

 - use explicit stack instead of recursion in template interpreter to reduce
   stack use and increase speed

   The code generated by the codegen backend is also recursive, though the
   compiler could inline some calls.  Using an explicit stack in an iterative
   interpreter would likely be a big win.

 - how to generate template based stubs

   (Note: it's now the default for Heimdal itself.)

   Use the `--template` option to `asn1_compile` to use the template backend,
   or leave it off to use the codegen backend.

 - the template backend now has more functionality than the codegen backend

 - much easier to extend!  adding new encoding rules is just adding a few
   functions to template.c, one set of length/encode/decode functions per ER,
   so we could add OER/PER/XDR/GSER/JER with very little work outside that one
   file and `gen_template.c` (to generate stub functions and possibly slight
   alterations to templates) and gen.c (to generate declarations of those stub
   functions).

 - template decoding has been fuzzed extensively with American Fuzzy Lop (AFL)

TODO:

 - Generate templates for enumerations, with their names and values, so that
   values of enumerated types can be printed.

 - Remove old fuzzer.  Rely on AFL only.

 - Fuzzing tests, always more fuzzing:

    - Instructions:

```
    $ git clone https://github.com/heimdal/heimdal
    $ cd heimdal
    $ srcdir=$PWD
    $ autoreconf -fi
    $ 
    $ mkdir build
    $ cd build
    $ 
    $ ../configure --srcdir=$srcdir ...
    $ make -j4
    $
    $ cd lib/asn1
    $ make clean
    $ AFL_HARDEN=1 make -j4 asn1_print check CC=afl-gcc # or CC=afl-clang
    $ 
    $ # $srcdir/lib/asn1/fuzz-inputs/ has at least one minimized DER value
    $ # produced by taking an EK certificate and truncating the signatureValue
    $ # and tbsCertificate.subjectPublicKeyInfo fields then re-encoding, thus
    $ # cutting down the size of the certificate by 45%.  AFL finds interesting
    $ # code paths much faster if the input corpus is minimized.
    $ 
    $ mkdir f
    $ ../../libtool --mode=execute afl-fuzz -i $srcdir/lib/asn1/fuzz-inputs -o $PWD/f ./asn1_print '@@' Certificate
    $ 
    $ # Or
    $ ../../libtool --mode=execute afl-fuzz -i $srcdir/lib/asn1/fuzz-inputs -o $PWD/f ./asn1_print -A '@@'
    $
    $ # Examine crash reports, if any.  Each crash report consists of an input
    $ # that caused a crash, so run valgrind on each such input:
    $ 
    $ for i in f/crashes/id*; do
    >   echo $i
    >   ../../libtool --mode=execute valgrind --num-callers=64 ./asn1_print $i \
    >        Certificate IOSCertificationRequest >/dev/null 2> f/crashes/vg-${i##*/}
    > done
    $ 
    $ # then review the valgrind output:
    $ $PAGER f/crashes/vg-*
```

    - Here's a screenshot of AFL running on the previous commit:

```
                     american fuzzy lop 2.52b (asn1_print)

┌─ process timing ─────────────────────────────────────┬─ overall results ─────┐
│        run time : 1 days, 22 hrs, 39 min, 51 sec     │  cycles done : 18     │
│   last new path : 0 days, 0 hrs, 38 min, 5 sec       │  total paths : 2310   │
│ last uniq crash : none seen yet                      │ uniq crashes : 0      │
│  last uniq hang : none seen yet                      │   uniq hangs : 0      │
├─ cycle progress ────────────────────┬─ map coverage ─┴───────────────────────┤
│  now processing : 997* (43.16%)     │    map density : 2.19% / 8.74%         │
│ paths timed out : 0 (0.00%)         │ count coverage : 3.25 bits/tuple       │
├─ stage progress ────────────────────┼─ findings in depth ────────────────────┤
│  now trying : interest 16/8         │ favored paths : 319 (13.81%)           │
│ stage execs : 13.1k/13.4k (98.18%)  │  new edges on : 506 (21.90%)           │
│ total execs : 91.9M                 │ total crashes : 0 (0 unique)           │
│  exec speed : 576.2/sec             │  total tmouts : 2158 (180 unique)      │
├─ fuzzing strategy yields ───────────┴───────────────┬─ path geometry ────────┤
│   bit flips : 565/5.60M, 124/5.60M, 74/5.59M        │    levels : 19         │
│  byte flips : 4/699k, 17/375k, 15/385k              │   pending : 552        │
│ arithmetics : 323/20.7M, 8/10.6M, 1/517k            │  pend fav : 0          │
│  known ints : 85/1.76M, 148/9.98M, 175/16.8M        │ own finds : 2308       │
│  dictionary : 0/0, 0/0, 12/6.62M                    │  imported : n/a        │
│       havoc : 757/6.35M, 0/0                        │ stability : 100.00%    │
│        trim : 14.30%/336k, 46.60%                   ├────────────────────────┘
└─────────────────────────────────────────────────────┘          [cpu000:196%]
```

    - TODO: Make building with AFL a ./cofigure option.

    - TODO: Make fuzzing with AFL a make target.

    - Fuzz decode round-tripping (don't just decode, but also encoded the
      decoded).

 - Performance testing

 - `ASN1_MALLOC_ENCODE()` as a function, replaces `encode_` and `length_`

 - Fix SIZE constraits

 - Proper implementation of `SET { ... }`

 - Compact types that only contain on entry to not having a header.


SIZE - Futher down is later generations of the template parser

```
	code:
	==================
	__TEXT	__DATA	__OBJC	others	dec	hex
	462848	12288	0	323584	798720	c3000 (O2)

	trivial types:
	==================
	__TEXT	__DATA	__OBJC	others	dec	hex
	446464	12288	0	323584	782336	bf000 (O2)

	OPTIONAL
	==================
	__TEXT	__DATA	__OBJC	others	dec	hex
	425984	16384	0	323584	765952	bb000 (O2)

	SEQ OF
	==================
	__TEXT	__DATA	__OBJC	others	dec	hex
	368640	32768	0	327680	729088	b2000 (O2)
	348160	32768	0	327680	708608	ad000 (Os)

	BOOLEAN
	==================
	339968	32768	0	327680	700416	ab000 (Os)

	TYPE_EXTERNAL:
	==================
	331776	32768	0	327680	692224	a9000 (Os)

	SET OF
	==================
	327680	32768	0	327680	688128	a8000 (Os)

	TYPE_EXTERNAL everywhere
	==================
	__TEXT	__DATA	__OBJC	others	dec	hex
	167936	69632	0	327680	565248	8a000 (Os)

	TAG uses ->ptr (header and trailer)
	==================
	229376	102400	0	421888	753664	b8000 (O0)

	TAG uses ->ptr (header only)
	==================
	221184	77824	0	421888	720896	b0000 (O0)

	BER support for octet string (not working)
	==================
	180224	73728	0	417792	671744	a4000 (O2)

	CHOICE and BIT STRING missign
	==================
	__TEXT	__DATA	__OBJC	others	dec	hex
	172032	73728	0	417792	663552	a2000 (Os)

	No accessor functions to global variable
	==================
	__TEXT	__DATA	__OBJC	others	dec	hex
	159744	73728	0	393216	626688	99000 (Os)

	All types tables (except choice) (id still objects)
	==================
	__TEXT	__DATA	__OBJC	others	dec	hex
	167936	77824	0	421888	667648	a3000
	base lib: 22820

	__TEXT	__DATA	__OBJC	others	dec	hex
	==================
	167936	77824	0	421888	667648	a3000 (Os)
	baselib: 22820
	generated code stubs: 41472
	TEXT stubs: 112560

	All types, id still objects
	==================
	__TEXT	__DATA	__OBJC	others	dec	hex
	155648	81920	0	430080	667648	a3000 (Os)
	TEXT baselib: 23166
	generated code stubs: 20796
	TEXT stubs: 119891

	All types, id still objects, dup compression
	==================
	__TEXT	__DATA	__OBJC	others	dec	hex
	143360	65536	0	376832	585728	8f000 (Os)
	TEXT baselib: 23166
	generated code stubs: 20796
	TEXT stubs: 107147

	All types, dup compression, id vars
	==================
	__TEXT	__DATA	__OBJC	others	dec	hex
	131072	65536	0	352256	548864	86000
	TEXT baselib: 23166
	generated code stubs: 7536
	TEXT stubs: 107147
```