Forums before death by AOL, social media and spammers... "We can't have nice things"
|    comp.ai    |    Awaiting the gospel from Sarah Connor    |    1,954 messages    |
[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]
|    Message 1,901 of 1,954    |
|    blabla12345 to All    |
|    Semantic/Conceptual Similarity of Two Te    |
|    14 Jul 10 09:18:18    |
      From: tatata9999@gmail.com              First, I'm an idiot, that is, I'm completely new to AI. Here's a       problem I'm trying to solve:       a base text file that has about 5000 words (of 5 or 6k)       and       a small text file that has about 500 to 1000 words (of 0.5 to 1k,       about 10 to 20% of the bigger file)       I need to find how close the second/smaller file is to the bigger/base       file conceptually or semantically.              I understand there are a lot of praise about LSA or LSI for machine       learning of texts. In the meantime, I'm thinking totally off my head       or you may call crazy to pose the following question, how difficult it       would be to extract 10 to 20 concepts or key meanings of the big file,       then figure out       what are the 3 to 5 Most Valuable Concepts (MVC) of the 10 or 20       concepts, and then, what's the author's view towards these MVCs,       hence, these concepts become contextual... Well, it's slightly       easier, better, better accuracy with the smaller file, apply it the       smaller file then...              Am I out of my mind?              Thanks for your time.              [ comp.ai is moderated ... your article may take a while to appear. ]              --- SoupGate-Win32 v1.05        * Origin: you cannot sedate... all the things you hate (1:229/2)    |
[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]
(c) 1994, bbs@darkrealms.ca