Problem with spell
Root Boy Jim
rbj at icst-cmr.arpa
Wed Mar 11 00:54:46 AEST 1987
millions was stopped by the stop list. Why? ons is a non-word which
might be construed by spell as the plural of the valid word on. Hence
ons is in the stop list. The stop list is used like the main list, with
prefix and suffix strripping. Hence millions is seen as a derivative
(milli-ons, like milli-meters) of a word on the stop list and is stopped.
Another one of my favorite stopped word is dishes (dis-hes, hes on stop
list as spurious plural of he). Also microbes, micro-bes. There are thre
or four others that I cannot remember at this late hour.
There is really no solution, short of a total (and perhaps needed) rewrite
of spell, a program that originated in the dark ages on a PDP without
separate I & D space. For an excellnt review of the theory & implementation
of spell, see McIlroy, M. D. "Development of a Spelling List", IEEE Trans.
Communications, Jan 1982, 91-99. Also an article in the Programming Pearls
column in comm ACM about a year ago.
Which is another reason why I don't believe in automated spelling programs.
Spell will also accept any legal prefix and suffix tacked onto any base
word, as in: day unday ununday dayest redisunmispresuffixernessize. Just
store *all* the words in the dictionary, or distribute an empty dictionary
and let people add their own words. Kind of like an adventure.
(Root Boy) Jim "Just Say Yes" Cottrell <rbj at icst-cmr.arpa>
Why did Paul Simon name his album after Elvis Presley's house?
More information about the Comp.unix.questions
mailing list