Projekt

Obecné

Profil

Stáhnout (3.52 KB) Statistiky
| Větev: | Tag: | Revize:
1
 | From svn.tartarus.org/snowball/trunk/website/algorithms/danish/stop.txt
2
 | This file is distributed under the BSD License.
3
 | See http://snowball.tartarus.org/license.php
4
 | Also see http://www.opensource.org/licenses/bsd-license.html
5
 |  - Encoding was converted to UTF-8.
6
 |  - This notice was added.
7
 |
8
 | NOTE: To use this file with StopFilterFactory, you must specify format="snowball"
9

    
10
 | A Danish stop word list. Comments begin with vertical bar. Each stop
11
 | word is at the start of a line.
12

    
13
 | This is a ranked list (commonest to rarest) of stopwords derived from
14
 | a large text sample.
15

    
16

    
17
og           | and
18
i            | in
19
jeg          | I
20
det          | that (dem. pronoun)/it (pers. pronoun)
21
at           | that (in front of a sentence)/to (with infinitive)
22
en           | a/an
23
den          | it (pers. pronoun)/that (dem. pronoun)
24
til          | to/at/for/until/against/by/of/into, more
25
er           | present tense of "to be"
26
som          | who, as
27
på           | on/upon/in/on/at/to/after/of/with/for, on
28
de           | they
29
med          | with/by/in, along
30
han          | he
31
af           | of/by/from/off/for/in/with/on, off
32
for          | at/for/to/from/by/of/ago, in front/before, because
33
ikke         | not
34
der          | who/which, there/those
35
var          | past tense of "to be"
36
mig          | me/myself
37
sig          | oneself/himself/herself/itself/themselves
38
men          | but
39
et           | a/an/one, one (number), someone/somebody/one
40
har          | present tense of "to have"
41
om           | round/about/for/in/a, about/around/down, if
42
vi           | we
43
min          | my
44
havde        | past tense of "to have"
45
ham          | him
46
hun          | she
47
nu           | now
48
over         | over/above/across/by/beyond/past/on/about, over/past
49
da           | then, when/as/since
50
fra          | from/off/since, off, since
51
du           | you
52
ud           | out
53
sin          | his/her/its/one's
54
dem          | them
55
os           | us/ourselves
56
op           | up
57
man          | you/one
58
hans         | his
59
hvor         | where
60
eller        | or
61
hvad         | what
62
skal         | must/shall etc.
63
selv         | myself/youself/herself/ourselves etc., even
64
her          | here
65
alle         | all/everyone/everybody etc.
66
vil          | will (verb)
67
blev         | past tense of "to stay/to remain/to get/to become"
68
kunne        | could
69
ind          | in
70
når          | when
71
være         | present tense of "to be"
72
dog          | however/yet/after all
73
noget        | something
74
ville        | would
75
jo           | you know/you see (adv), yes
76
deres        | their/theirs
77
efter        | after/behind/according to/for/by/from, later/afterwards
78
ned          | down
79
skulle       | should
80
denne        | this
81
end          | than
82
dette        | this
83
mit          | my/mine
84
også         | also
85
under        | under/beneath/below/during, below/underneath
86
have         | have
87
dig          | you
88
anden        | other
89
hende        | her
90
mine         | my
91
alt          | everything
92
meget        | much/very, plenty of
93
sit          | his, her, its, one's
94
sine         | his, her, its, one's
95
vor          | our
96
mod          | against
97
disse        | these
98
hvis         | if
99
din          | your/yours
100
nogle        | some
101
hos          | by/at
102
blive        | be/become
103
mange        | many
104
ad           | by/through
105
bliver       | present tense of "to be/to become"
106
hendes       | her/hers
107
været        | be
108
thi          | for (conj)
109
jer          | you
110
sådan        | such, like this/like that
(12-12/39)