summaryrefslogtreecommitdiffstats
path: root/ml/dlib/docs/docs/kernel_1da.txt
blob: fde5871e097f8959becf471664a425f2dc20537e (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
<text>


	The Canterbury Corpus

file      size      packed size      bpb         corruption

text:      152089      45580       2.39754        no             
play:      125179      42432       2.71176        no             
html:      24603       7745        2.51839        no             
Csrc:      11150       3165        2.27085        no             
list:      3721        1238        2.66165        no             
Excl:      1029744     194875      1.51397        no             
tech:      426754      111838      2.09653        no             
poem:      481861      148110      2.45897        no             
fax:       513216      56075       0.874096       no             
SPRC:      38240       14248       2.98075        no             
man:       4227        1736        3.28555        no             

average: 2.34273

time: 1.812sec



	The Calgary Corpus

file      size      packed size      bpb         corruption

bib:       111261      29161       2.09676        no             
book1:     768771      235667      2.4524         no             
book2:     610856      165032      2.16132        no             
geo:       102400      67663       5.28617        no             
news:      377109      128148      2.71853        no             
obj1:      21504       10750       3.99926        no             
obj2:      246814      82894       2.68685        no             
paper1:    53161       17398       2.61816        no             
paper2:    82199       26449       2.57414        no             
pic:       513216      56075       0.874096       no             
progc:     39611       13188       2.6635         no             
progl:     71646       17135       1.9133         no             
progp:     49379       11764       1.90591        no             
trans:     93695       19602       1.67369        no             

average: 2.54458

time: 2.36sec



	The Artificial Corpus

file      size      packed size      bpb         corruption

a:         1           6           48             no             
aaa:       100000      19          0.00152        no             
alphabet:  100000      66          0.00528        no             
random:    100000      89652       7.17216        no             

average: 13.7947

time: 375ms



	The Large Corpus 

file      size      packed size      bpb         corruption

E.coli:    4638690     1130363     1.94945        no             
bible:     4047392     871537      1.72266        no             
word:      2473400     589688      1.9073         no             

average: 1.8598

time: 4.484sec

</text>