home bbs files messages ]

Forums before death by AOL, social media and spammers... "We can't have nice things"

   comp.compilers      Compiler construction, theory, etc. (Mod      2,753 messages   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]

   Message 908 of 2,753   
   Tom Copeland to Tommy Nordgren   
   Re: Ada95 to Ada2005 parser - currently    
   16 Mar 07 03:16:42   
   
   From: tom@infoether.com   
      
   On Thu, 2007-03-08 at 19:54 -0500, Tommy Nordgren wrote:   
   > ANTLR supports unicode, but one point to consider with ANY tool, is   
   > that you will need an module that supports converting the input text   
   > files to canonical utf-16.   
      
   JavaCC also handles Unicode characters; for example, this would tokenize   
   and optional minus sign followed by the Unicode code points for "degrees   
   in Fahrenheit" and "degrees in Celsius" followed by a couple of digits:   
      
   TOKEN : {   
      " \u2109">   
     |  " \u2103">   
     | <#DIGITS : ["0"-"9"](["0"-"9"])*>   
   }   
      
   JavaCC doesn't yet handle supplementary characters (those outside the   
   Basic Multilingual Plane).  But that's on our radar, so we shall see...   
      
   Yours,   
      
   Tom   
      
   --- SoupGate-Win32 v1.05   
    * Origin: you cannot sedate... all the things you hate (1:229/2)   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]


(c) 1994,  bbs@darkrealms.ca