1
|
| From svn.tartarus.org/snowball/trunk/website/algorithms/norwegian/stop.txt
|
2
|
| This file is distributed under the BSD License.
|
3
|
| See http://snowball.tartarus.org/license.php
|
4
|
| Also see http://www.opensource.org/licenses/bsd-license.html
|
5
|
| - Encoding was converted to UTF-8.
|
6
|
| - This notice was added.
|
7
|
|
|
8
|
| NOTE: To use this file with StopFilterFactory, you must specify format="snowball"
|
9
|
|
10
|
| A Norwegian stop word list. Comments begin with vertical bar. Each stop
|
11
|
| word is at the start of a line.
|
12
|
|
13
|
| This stop word list is for the dominant bokmål dialect. Words unique
|
14
|
| to nynorsk are marked *.
|
15
|
|
16
|
| Revised by Jan Bruusgaard <Jan.Bruusgaard@ssb.no>, Jan 2005
|
17
|
|
18
|
og | and
|
19
|
i | in
|
20
|
jeg | I
|
21
|
det | it/this/that
|
22
|
at | to (w. inf.)
|
23
|
en | a/an
|
24
|
et | a/an
|
25
|
den | it/this/that
|
26
|
til | to
|
27
|
er | is/am/are
|
28
|
som | who/that
|
29
|
på | on
|
30
|
de | they / you(formal)
|
31
|
med | with
|
32
|
han | he
|
33
|
av | of
|
34
|
ikke | not
|
35
|
ikkje | not *
|
36
|
der | there
|
37
|
så | so
|
38
|
var | was/were
|
39
|
meg | me
|
40
|
seg | you
|
41
|
men | but
|
42
|
ett | one
|
43
|
har | have
|
44
|
om | about
|
45
|
vi | we
|
46
|
min | my
|
47
|
mitt | my
|
48
|
ha | have
|
49
|
hadde | had
|
50
|
hun | she
|
51
|
nå | now
|
52
|
over | over
|
53
|
da | when/as
|
54
|
ved | by/know
|
55
|
fra | from
|
56
|
du | you
|
57
|
ut | out
|
58
|
sin | your
|
59
|
dem | them
|
60
|
oss | us
|
61
|
opp | up
|
62
|
man | you/one
|
63
|
kan | can
|
64
|
hans | his
|
65
|
hvor | where
|
66
|
eller | or
|
67
|
hva | what
|
68
|
skal | shall/must
|
69
|
selv | self (reflective)
|
70
|
sjøl | self (reflective)
|
71
|
her | here
|
72
|
alle | all
|
73
|
vil | will
|
74
|
bli | become
|
75
|
ble | became
|
76
|
blei | became *
|
77
|
blitt | have become
|
78
|
kunne | could
|
79
|
inn | in
|
80
|
når | when
|
81
|
være | be
|
82
|
kom | come
|
83
|
noen | some
|
84
|
noe | some
|
85
|
ville | would
|
86
|
dere | you
|
87
|
som | who/which/that
|
88
|
deres | their/theirs
|
89
|
kun | only/just
|
90
|
ja | yes
|
91
|
etter | after
|
92
|
ned | down
|
93
|
skulle | should
|
94
|
denne | this
|
95
|
for | for/because
|
96
|
deg | you
|
97
|
si | hers/his
|
98
|
sine | hers/his
|
99
|
sitt | hers/his
|
100
|
mot | against
|
101
|
å | to
|
102
|
meget | much
|
103
|
hvorfor | why
|
104
|
dette | this
|
105
|
disse | these/those
|
106
|
uten | without
|
107
|
hvordan | how
|
108
|
ingen | none
|
109
|
din | your
|
110
|
ditt | your
|
111
|
blir | become
|
112
|
samme | same
|
113
|
hvilken | which
|
114
|
hvilke | which (plural)
|
115
|
sånn | such a
|
116
|
inni | inside/within
|
117
|
mellom | between
|
118
|
vår | our
|
119
|
hver | each
|
120
|
hvem | who
|
121
|
vors | us/ours
|
122
|
hvis | whose
|
123
|
både | both
|
124
|
bare | only/just
|
125
|
enn | than
|
126
|
fordi | as/because
|
127
|
før | before
|
128
|
mange | many
|
129
|
også | also
|
130
|
slik | just
|
131
|
vært | been
|
132
|
være | to be
|
133
|
båe | both *
|
134
|
begge | both
|
135
|
siden | since
|
136
|
dykk | your *
|
137
|
dykkar | yours *
|
138
|
dei | they *
|
139
|
deira | them *
|
140
|
deires | theirs *
|
141
|
deim | them *
|
142
|
di | your (fem.) *
|
143
|
då | as/when *
|
144
|
eg | I *
|
145
|
ein | a/an *
|
146
|
eit | a/an *
|
147
|
eitt | a/an *
|
148
|
elles | or *
|
149
|
honom | he *
|
150
|
hjå | at *
|
151
|
ho | she *
|
152
|
hoe | she *
|
153
|
henne | her
|
154
|
hennar | her/hers
|
155
|
hennes | hers
|
156
|
hoss | how *
|
157
|
hossen | how *
|
158
|
ikkje | not *
|
159
|
ingi | noone *
|
160
|
inkje | noone *
|
161
|
korleis | how *
|
162
|
korso | how *
|
163
|
kva | what/which *
|
164
|
kvar | where *
|
165
|
kvarhelst | where *
|
166
|
kven | who/whom *
|
167
|
kvi | why *
|
168
|
kvifor | why *
|
169
|
me | we *
|
170
|
medan | while *
|
171
|
mi | my *
|
172
|
mine | my *
|
173
|
mykje | much *
|
174
|
no | now *
|
175
|
nokon | some (masc./neut.) *
|
176
|
noka | some (fem.) *
|
177
|
nokor | some *
|
178
|
noko | some *
|
179
|
nokre | some *
|
180
|
si | his/hers *
|
181
|
sia | since *
|
182
|
sidan | since *
|
183
|
so | so *
|
184
|
somt | some *
|
185
|
somme | some *
|
186
|
um | about*
|
187
|
upp | up *
|
188
|
vere | be *
|
189
|
vore | was *
|
190
|
verte | become *
|
191
|
vort | become *
|
192
|
varte | became *
|
193
|
vart | became *
|
194
|
|