Forums before death by AOL, social media and spammers... "We can't have nice things"
|    comp.ai    |    Awaiting the gospel from Sarah Connor    |    1,954 messages    |
[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]
|    Message 1,003 of 1,954    |
|    Ted Dunning to All    |
|    Re: A Newbie Question - Matching Two Tex    |
|    16 Apr 06 02:09:21    |
      From: ted.dunning@gmail.com              Common words weighted by IDF as suggested by Yang should be your first       step, but if you decide to go the LSA route, you should definitely look       into a more current version of the same fundamental idea such as       probabilistic LSI (pLSI), latent Dirichlet allocation (LDA) or, most       recently, discrete component analysis (DCA or MDCA).              See Buntine and Jakelin's article for a unified view of this developing       field. (http://cosco.hiit.fi/Articles/buntineBohinj.pdf)              See Blei, Ng and Jordan's article for the Berkeley orthodoxy.       (http://citeseer.ist.psu.edu/455140.html or       http://www.cs.berkeley.edu/~jordan/papers/blei03a.ps.gz)              [ comp.ai is moderated. To submit, just post and be patient, or if ]       [ that fails mail your article to |
[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]
(c) 1994, bbs@darkrealms.ca