1
|
| From svn.tartarus.org/snowball/trunk/website/algorithms/swedish/stop.txt
|
2
|
| This file is distributed under the BSD License.
|
3
|
| See http://snowball.tartarus.org/license.php
|
4
|
| Also see http://www.opensource.org/licenses/bsd-license.html
|
5
|
| - Encoding was converted to UTF-8.
|
6
|
| - This notice was added.
|
7
|
|
|
8
|
| NOTE: To use this file with StopFilterFactory, you must specify format="snowball"
|
9
|
|
10
|
| A Swedish stop word list. Comments begin with vertical bar. Each stop
|
11
|
| word is at the start of a line.
|
12
|
|
13
|
| This is a ranked list (commonest to rarest) of stopwords derived from
|
14
|
| a large text sample.
|
15
|
|
16
|
| Swedish stop words occasionally exhibit homonym clashes. For example
|
17
|
| så = so, but also seed. These are indicated clearly below.
|
18
|
|
19
|
och | and
|
20
|
det | it, this/that
|
21
|
att | to (with infinitive)
|
22
|
i | in, at
|
23
|
en | a
|
24
|
jag | I
|
25
|
hon | she
|
26
|
som | who, that
|
27
|
han | he
|
28
|
på | on
|
29
|
den | it, this/that
|
30
|
med | with
|
31
|
var | where, each
|
32
|
sig | him(self) etc
|
33
|
för | for
|
34
|
så | so (also: seed)
|
35
|
till | to
|
36
|
är | is
|
37
|
men | but
|
38
|
ett | a
|
39
|
om | if; around, about
|
40
|
hade | had
|
41
|
de | they, these/those
|
42
|
av | of
|
43
|
icke | not, no
|
44
|
mig | me
|
45
|
du | you
|
46
|
henne | her
|
47
|
då | then, when
|
48
|
sin | his
|
49
|
nu | now
|
50
|
har | have
|
51
|
inte | inte någon = no one
|
52
|
hans | his
|
53
|
honom | him
|
54
|
skulle | 'sake'
|
55
|
hennes | her
|
56
|
där | there
|
57
|
min | my
|
58
|
man | one (pronoun)
|
59
|
ej | nor
|
60
|
vid | at, by, on (also: vast)
|
61
|
kunde | could
|
62
|
något | some etc
|
63
|
från | from, off
|
64
|
ut | out
|
65
|
när | when
|
66
|
efter | after, behind
|
67
|
upp | up
|
68
|
vi | we
|
69
|
dem | them
|
70
|
vara | be
|
71
|
vad | what
|
72
|
över | over
|
73
|
än | than
|
74
|
dig | you
|
75
|
kan | can
|
76
|
sina | his
|
77
|
här | here
|
78
|
ha | have
|
79
|
mot | towards
|
80
|
alla | all
|
81
|
under | under (also: wonder)
|
82
|
någon | some etc
|
83
|
eller | or (else)
|
84
|
allt | all
|
85
|
mycket | much
|
86
|
sedan | since
|
87
|
ju | why
|
88
|
denna | this/that
|
89
|
själv | myself, yourself etc
|
90
|
detta | this/that
|
91
|
åt | to
|
92
|
utan | without
|
93
|
varit | was
|
94
|
hur | how
|
95
|
ingen | no
|
96
|
mitt | my
|
97
|
ni | you
|
98
|
bli | to be, become
|
99
|
blev | from bli
|
100
|
oss | us
|
101
|
din | thy
|
102
|
dessa | these/those
|
103
|
några | some etc
|
104
|
deras | their
|
105
|
blir | from bli
|
106
|
mina | my
|
107
|
samma | (the) same
|
108
|
vilken | who, that
|
109
|
er | you, your
|
110
|
sådan | such a
|
111
|
vår | our
|
112
|
blivit | from bli
|
113
|
dess | its
|
114
|
inom | within
|
115
|
mellan | between
|
116
|
sådant | such a
|
117
|
varför | why
|
118
|
varje | each
|
119
|
vilka | who, that
|
120
|
ditt | thy
|
121
|
vem | who
|
122
|
vilket | who, that
|
123
|
sitta | his
|
124
|
sådana | such a
|
125
|
vart | each
|
126
|
dina | thy
|
127
|
vars | whose
|
128
|
vårt | our
|
129
|
våra | our
|
130
|
ert | your
|
131
|
era | your
|
132
|
vilkas | whose
|
133
|
|