home bbs files messages ]

Forums before death by AOL, social media and spammers... "We can't have nice things"

   comp.ai      Awaiting the gospel from Sarah Connor      1,954 messages   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]

   Message 270 of 1,954   
   rif to ajit   
   Re: A query about ISOMAP   
   20 Feb 04 22:57:22   
   
   From: rif@mit.edu   
      
   ajit_v_rROMEO@yahoo.co.uk (ajit) writes:   
      
   > Hello,   
   >   
   > I have a query about some of the extensions of the ISOMAP algorithm. I   
   > basically have around N = 17000 points, each in D-dimensional space (D   
   > being 400) which supposedly lie on a manifold. I wish to use ISOMAP to   
   > project these points onto a d-dimensional space (d << D :  'd' could   
   > be around 40 or less). My problem is that the basic ISOMAP algorithm   
   > requires the computation (and storage) of a pairwise geodesic-distance   
   > matrix, which in this case would be very large, i.e. 17000 * 17000. I   
   > came to know about landmark ISOMAP, an extension of ISOMAP in which   
   > one needs to compute the matrix of geodesic distances to only some   
   > selected 'n' "landmark points" (where n << N).   
   >   
   > My problem is that even this modification would still require me to   
   > first compute a large N by N matrix of pairwise Euclidian distances   
   > (and I will then apply Dijkstra's shortest path algorithm to obtain   
   > the geodesic distances). Of course, I could keep computing the   
   > Euclidian distances from one point to all others, every single time,   
   > while running the Dijkstra's algorithm, but that is going to be very   
   > time consuming.   
   >   
   > Is there a way out?   
   >   
   >   
      
   You may be misinterpreting the Landmark Isomap algorithm.  Looking at   
   "Global versus local methods in nonlinear dimensionality reduction,"   
   by de Silva and Tenenbaum, they suggest starting with a neighborhood   
   graph G which is already itself sparse.  Instead of storing all the   
   distances in this neighborhood graph, only the k closest neighbors of   
   each point are stored.  Therefore, this step is always tractable,   
   assuming you believe that O(N^2 D) work is tractable for very large   
   datasets.   
      
   The original ISOMAP then requires the computation of a dense N by N   
   matrix of geodesic distances.  Landmark ISOMAP's modification is in   
   this step --- the geodesic matrix is only a L by N dense matrix, where   
   L is the number of landmarks.   
      
   Hope this helps.   
      
   Cheers,   
      
   rif   
      
   [ comp.ai is moderated.  To submit, just post and be patient, or if ]   
   [ that fails mail your article to , and ]   
   [ ask your news administrator to fix the problems with your system. ]   
      
   --- SoupGate-Win32 v1.05   
    * Origin: you cannot sedate... all the things you hate (1:229/2)   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]


(c) 1994,  bbs@darkrealms.ca