1 | From svn.tartarus.org/snowball/trunk/website/algorithms/english/stop.txt
2 | This file is distributed under the BSD License.
3 | See http://snowball.tartarus.org/license.php
4 | Also see http://www.opensource.org/licenses/bsd-license.html
5 | - Encoding was converted to UTF-8.
6 | - This notice was added.
8 | An English stop word list. Comments begin with vertical bar. Each stop
9 | word is at the start of a line.
11 | Many of the forms below are quite rare (e.g. "yourselves") but included for
17 i | subject, always in upper case of course
20 my | possessive adjective
21 | the possessive pronoun `mine' is best suppressed, because of the
22 | sense of coal-mine etc.
28 | care is required here because US = United States. It is usually
29 | safe to remove it if it is in lower case.
30 our | possessive adjective
31 ours | possessive pronoun
33 | second person (archaic `thou' forms not included)
34 you | subject and object
35 your | possessive adjective
36 yours | possessive pronoun
37 yourself | reflexive (singular)
38 yourselves | reflexive (plural)
39 | third person singular
42 his | possessive adjective and pronoun
46 her | object and possessive adjective
47 hers | possessive pronoun
50 it | subject and object
51 its | possessive adjective
56 their | possessive adjective
57 theirs | possessive pronoun
58 themselves | reflexive
59 | other forms (demonstratives, interrogatives)
69 | VERB FORMS (using F.R. Palmer's nomenclature)
71 am | 1st person, present
72 is | -s form (3rd person, present)
74 was | 1st person, past
77 been | past participle
90 | The forms below are, I believe, best omitted, because of the significant
97 | fight the good fight with all thy MIGHT
99 | would, could, should, ought might however be included
126 | COMPOUND FORMS, increasingly encountered nowadays in 'formal' writing
166 | auxiliary + negation
177 | miscellaneous forms
203 | THE REST (Overlap among prepositions, conjunctions, adverbs etc is so
204 | high, that classification is pointless.)
274 | Just for the record, the following words are among the commonest in English