home bbs files messages ]

Forums before death by AOL, social media and spammers... "We can't have nice things"

   comp.ai      Awaiting the gospel from Sarah Connor      1,954 messages   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]

   Message 1,843 of 1,954   
   krishnapostings@gmail.com to All   
   Text classification, clustering librarie   
   20 Jan 09 09:57:34   
   
   I am looking for libraries that implement well know techniques (Naive-   
   Bayes, SVM, Fisher etc) for text classification, clustering. A few I   
   came across are Orange(http://www.ailab.si/orange), Rainbow(http://   
   www.cs.cmu.edu/~mccallum/bow/rainbow/), LIBSVM(http://   
   www.csie.ntu.edu.tw/~cjlin/libsvm/). I have some problems with these.   
   Regarding scale, I am trying to classify documents which have over   
   100000 features, 100000 items and 1000 categories.   
   Orange: Scope of techniques covered is good, scalability is a blocking   
   issue   
   Rainbow: Buggy, tried to patch a couple but still there are   
   impediments, for e.g., can't even run SVM   
   LIBSVM: Haven't tried yet on my data.   
      
   I am wondering if there are other libraries (free, commercial).   
      
   Thanks,   
   Krishna   
      
   [ comp.ai is moderated ... your article may take a while to appear. ]   
      
   --- SoupGate-Win32 v1.05   
    * Origin: you cannot sedate... all the things you hate (1:229/2)   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]


(c) 1994,  bbs@darkrealms.ca