Projekt

Obecné

Profil

Stáhnout (4.62 KB) Statistiky
| Větev: | Tag: | Revize:
1
 | From svn.tartarus.org/snowball/trunk/website/algorithms/dutch/stop.txt
2
 | This file is distributed under the BSD License.
3
 | See http://snowball.tartarus.org/license.php
4
 | Also see http://www.opensource.org/licenses/bsd-license.html
5
 |  - Encoding was converted to UTF-8.
6
 |  - This notice was added.
7
 |
8
 | NOTE: To use this file with StopFilterFactory, you must specify format="snowball"
9

    
10
 | A Dutch stop word list. Comments begin with vertical bar. Each stop
11
 | word is at the start of a line.
12

    
13
 | This is a ranked list (commonest to rarest) of stopwords derived from
14
 | a large sample of Dutch text.
15

    
16
 | Dutch stop words frequently exhibit homonym clashes. These are indicated
17
 | clearly below.
18

    
19
de             |  the
20
en             |  and
21
van            |  of, from
22
ik             |  I, the ego
23
te             |  (1) chez, at etc, (2) to, (3) too
24
dat            |  that, which
25
die            |  that, those, who, which
26
in             |  in, inside
27
een            |  a, an, one
28
hij            |  he
29
het            |  the, it
30
niet           |  not, nothing, naught
31
zijn           |  (1) to be, being, (2) his, one's, its
32
is             |  is
33
was            |  (1) was, past tense of all persons sing. of 'zijn' (to be) (2) wax, (3) the washing, (4) rise of river
34
op             |  on, upon, at, in, up, used up
35
aan            |  on, upon, to (as dative)
36
met            |  with, by
37
als            |  like, such as, when
38
voor           |  (1) before, in front of, (2) furrow
39
had            |  had, past tense all persons sing. of 'hebben' (have)
40
er             |  there
41
maar           |  but, only
42
om             |  round, about, for etc
43
hem            |  him
44
dan            |  then
45
zou            |  should/would, past tense all persons sing. of 'zullen'
46
of             |  or, whether, if
47
wat            |  what, something, anything
48
mijn           |  possessive and noun 'mine'
49
men            |  people, 'one'
50
dit            |  this
51
zo             |  so, thus, in this way
52
door           |  through by
53
over           |  over, across
54
ze             |  she, her, they, them
55
zich           |  oneself
56
bij            |  (1) a bee, (2) by, near, at
57
ook            |  also, too
58
tot            |  till, until
59
je             |  you
60
mij            |  me
61
uit            |  out of, from
62
der            |  Old Dutch form of 'van der' still found in surnames
63
daar           |  (1) there, (2) because
64
haar           |  (1) her, their, them, (2) hair
65
naar           |  (1) unpleasant, unwell etc, (2) towards, (3) as
66
heb            |  present first person sing. of 'to have'
67
hoe            |  how, why
68
heeft          |  present third person sing. of 'to have'
69
hebben         |  'to have' and various parts thereof
70
deze           |  this
71
u              |  you
72
want           |  (1) for, (2) mitten, (3) rigging
73
nog            |  yet, still
74
zal            |  'shall', first and third person sing. of verb 'zullen' (will)
75
me             |  me
76
zij            |  she, they
77
nu             |  now
78
ge             |  'thou', still used in Belgium and south Netherlands
79
geen           |  none
80
omdat          |  because
81
iets           |  something, somewhat
82
worden         |  to become, grow, get
83
toch           |  yet, still
84
al             |  all, every, each
85
waren          |  (1) 'were' (2) to wander, (3) wares, (3)
86
veel           |  much, many
87
meer           |  (1) more, (2) lake
88
doen           |  to do, to make
89
toen           |  then, when
90
moet           |  noun 'spot/mote' and present form of 'to must'
91
ben            |  (1) am, (2) 'are' in interrogative second person singular of 'to be'
92
zonder         |  without
93
kan            |  noun 'can' and present form of 'to be able'
94
hun            |  their, them
95
dus            |  so, consequently
96
alles          |  all, everything, anything
97
onder          |  under, beneath
98
ja             |  yes, of course
99
eens           |  once, one day
100
hier           |  here
101
wie            |  who
102
werd           |  imperfect third person sing. of 'become'
103
altijd         |  always
104
doch           |  yet, but etc
105
wordt          |  present third person sing. of 'become'
106
wezen          |  (1) to be, (2) 'been' as in 'been fishing', (3) orphans
107
kunnen         |  to be able
108
ons            |  us/our
109
zelf           |  self
110
tegen          |  against, towards, at
111
na             |  after, near
112
reeds          |  already
113
wil            |  (1) present tense of 'want', (2) 'will', noun, (3) fender
114
kon            |  could; past tense of 'to be able'
115
niets          |  nothing
116
uw             |  your
117
iemand         |  somebody
118
geweest        |  been; past participle of 'be'
119
andere         |  other
(31-31/39)