summaryrefslogtreecommitdiffstats
path: root/src/plugins/fts-lucene/textcat.conf
blob: d75c4fe68f611256d384a2bf62fb0e323cef9dd0 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
#
# A sample config file for the language models 
# provided with Gertjan van Noords language guesser
# (http://odur.let.rug.nl/~vannoord/TextCat/)
#
# Notes: 
# - You may consider eliminating a couple of small languages from this
# list because they cause false positives with big languages and are
# bad for performance. (Do you really want to recognize Drents?)
# - Putting the most probable languages at the top of the list
# improves performance, because this will raise the threshold for
# likely candidates more quickly.
#
LM/english.lm			english
LM/italian.lm			italian
LM/danish.lm			danish
LM/dutch.lm			dutch
LM/finnish.lm			finnish
LM/french.lm			french
LM/german.lm			german
LM/norwegian.lm			norwegian
LM/portuguese.lm		portuguese
LM/russian.lm			russian
LM/spanish.lm			spanish
LM/swedish.lm			swedish