home bbs files messages ]

Forums before death by AOL, social media and spammers... "We can't have nice things"

   alt.os.linux.mandriva      Somewhat decent but also getting bloated      29,919 messages   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]

   Message 29,021 of 29,919   
   Moe Trin to Bobbie Sellers   
   Re: Generating one's one dictionary modi   
   10 Feb 13 23:36:41   
   
   From: ibuprofin@painkiller.example.tld.invalid   
      
   On Sun, 10 Feb 2013, in the Usenet newsgroup alt.os.linux.mandriva, in article   
   , Bobbie Sellers wrote:   
      
   >Jim Beard wrote:   
      
   >> Adam wrote:   
      
   >>> Sometimes I have no idea where spell checkers get their basic   
   >>> list from. I think some of them must be decades old.   
      
   That's a big old "that depends" - Mentioned in my other reply, I'm   
   using two lists, one from the MOBY project (which was DOS shareware   
   as I understand it) from 1988, and the Webster's Second International   
   originally copyrighted in 1934.  Much of what I've seen has been   
   created by inhaling huge quantities of text much as a classic   
   dictionary creator would do.   
      
   >> When spell-checkers first came into general use, I thought they   
   >> were great. I just kept putting words the spell-checker did not   
   >> know into the list it used, and all was great. Until a new OS or a   
   >> new wordprocessing program came along, and all that effort was in   
   >> the used bit bin.   
      
   Let's qualify some of that - in UNIX, I'm used to the spell checker   
   looking at a "word list" - currently in /usr/share/dict/ and (here in   
   the USA) consisting of words in ASCII, one per line.  There's often a   
   DSW going on about the lists, and problems with copyrights and such.   
   From the LSM:   
   =========================   
   Begin2   
   Title         /usr/dict/words for Linux (linux.words)   
   Version       2   
   Desc1         This is word list containing 45402 words.  Great care has   
   Desc2         been taken to be sure that this word list is free of   
   Desc3         copyright.  This list is suitable for English language   
   Desc4         spelling checkers and as a target for look(1).   
   Author        Rik Faith   
   AuthorEmail   faith@cs.unc.edu   
   Maintainer    Rik Faith   
   MaintEmail    faith@cs.unc.edu   
   Site1         ftp.cs.unc.edu   
   Path1         /pub/faith/linux/utils   
   File1         linux.words.2.tar.gz   
   FileSize1     140k   
   Site2         tsx-11.mit.edu   
   Path2         /pub/linux/docs   
   Site3         sunsite.unc.edu   
   Path3         /pub/Linux/libs   
   CopyPolicy1   Free of non-commercial restrictions.   
   CopyPolicy2   Free for personal, educational, and research purposes.   
   Keywords      dict, dictionary, words, wordlist   
   Entered       Sun Oct 10 19:02:47 1993   
   EnteredBy     Rik Faith   
   CheckedEmail  faith@cs.unc.edu   
   End   
   =========================   
      
   but most distributions have far larger lists, and multiple languages.   
      
   Those words that _you_ are adding go into a "personal" dictionary or   
   word-list, and this is a dot-file in your home directory - examples   
   such as "~/.ispell_english", "~/.hunspell_en_US" or "~/.aspell.en.pws"   
   or similar.  Easiest way to find the personal list is to use the spell   
   checker to check a document with an intentionally mis-spelled word,   
   tell the spell checker to add that word to the dictionary, and then   
   use 'ls -ldart .[a-z]* | tail'   
      
   I'm not using windoze, and my word-processor is "/bin/vi", so I can't   
   answer for what a real word processor might use/do.   
      
   >        Does anyone know how to do this, back up the work   
   >and add the modified dictionary to a new installation?   
      
   This _MAY_ depend on the distribution - I've seen them using several   
   different spell-checker programs, but as mentioned they tend to be   
   using wordlists in /usr/share/dict/.   Some spell checkers allow you   
   to add word-lists, and the hoop-jumping is described in the man-page.   
      
   >        I have been thru the same hassle myself as my   
   >posts to several list use specialised jargon and for   
   >example medical terminology.   
      
   Yup - that's true with many technical fields.   
      
   >Adding to the dictionary seems to be broken.  There seems to be no   
   >way to permanently add words in proper spellings to the things.   
      
   'ls -ldart ~/.[a-z]*'  and see if you can identify the "personal"   
   word list.  These are (in the US) typically ASCII text, one word per   
   line, and as such, generic across distributions and spell-checkers.   
   the FILENAME may be something weird, but the contents are normal.   
   Because I'm using a networked file-system, my home directory is on   
   the file-server, and that is unchanged as I may replace computers,   
   never mind distributions.  My current file has been around for a   
   while, as it still has "Sunnyvale" and the street I used to live on   
   there, neither of which were in the "Webster's Second International"   
   (and I've been here for almost 17 years).   
      
           Old guy   
      
   --- SoupGate-Win32 v1.05   
    * Origin: you cannot sedate... all the things you hate (1:229/2)   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]


(c) 1994,  bbs@darkrealms.ca