Problem with spell
750025@Mohamed_el_Lozy
mohamed at hscfvax.UUCP
Fri Mar 6 13:37:11 AEST 1987
millions was stopped by the stop list. Why? ons is a non-word which
might be construed by spell as the plural of the valid word on. Hence
ons is in the stop list. The stop list is used like the main list, with
prefix and suffix strripping. Hence millions is seen as a derivative
(milli-ons, like milli-meters) of a word on the stop list and is stopped.
Another one of my favorite stopped word is dishes (dis-hes, hes on stop
list as spurious plural of he). Also microbes, micro-bes. There are thre
or four others that I cannot remember at this late hour.
There is really no solution, short of a total (and perhaps needed) rewrite
of spell, a program that originated in the dark ages on a PDP without
separate I & D space. For an excellnt review of the theory and implementation
of spell, see McIlroy, M. D. "Development of a Spelling List", IEEE Trans.
Communications, Jan 1982, 91-99. Also an article in the Programming Pearls
column in comm ACM about a year ago.
More information about the Comp.unix.questions
mailing list