home bbs files messages ]

Forums before death by AOL, social media and spammers... "We can't have nice things"

   comp.ai      Awaiting the gospel from Sarah Connor      1,954 messages   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]

   Message 1,003 of 1,954   
   Ted Dunning to All   
   Re: A Newbie Question - Matching Two Tex   
   16 Apr 06 02:09:21   
   
   From: ted.dunning@gmail.com   
      
   Common words weighted by IDF as suggested by Yang should be your first   
   step, but if you decide to go the LSA route, you should definitely look   
   into a more current version of the same fundamental idea such as   
   probabilistic LSI (pLSI), latent Dirichlet allocation (LDA) or, most   
   recently, discrete component analysis (DCA or MDCA).   
      
   See Buntine and Jakelin's article for a unified view of this developing   
   field.  (http://cosco.hiit.fi/Articles/buntineBohinj.pdf)   
      
   See Blei, Ng and Jordan's article for the Berkeley orthodoxy.   
   (http://citeseer.ist.psu.edu/455140.html or   
   http://www.cs.berkeley.edu/~jordan/papers/blei03a.ps.gz)   
      
   [ comp.ai is moderated.  To submit, just post and be patient, or if ]   
   [ that fails mail your article to , and ]   
   [ ask your news administrator to fix the problems with your system. ]   
      
   --- SoupGate-Win32 v1.05   
    * Origin: you cannot sedate... all the things you hate (1:229/2)   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]


(c) 1994,  bbs@darkrealms.ca