home bbs files messages ]

Forums before death by AOL, social media and spammers... "We can't have nice things"

   comp.sys.atari.st      Discussion about 16 bit Atari micros      15,439 messages   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]

   Message 15,059 of 15,439   
   Arachide to All   
   Re: ASCII files   
   27 Mar 16 17:23:26   
   
   From: houten.van@orange.fr   
      
   Le 23/03/2016 14:35, Henk Robbers a écrit :   
   >   
   > LS   
   >   
   > I want to determine whether a file should not be   
   > considered ASCII text based on the percentage of non printable   
   > characters in the file.   
   >   
   > Which percentage is generally accepted?   
      
   It depends on the language I think.   
   In standard english, you don't use characters above 128. But in french   
   (and others), there a a lot of letters with accents that appear above 128.   
      
   So, if you consider that the file is in english, you should have a   
   maximum of:   
      
   - CR/LF (13 and 10)   
   - TAB (9)   
   - Space (32)   
   - characters from 33 to 127.   
      
   If the file is in another language, some characters above 128.   
      
   In my opinion, the marker for a text file would be the total lack of   
   bytes under 32 except 9/10/13.   
      
   Guillaume.   
      
   --- SoupGate-Win32 v1.05   
    * Origin: you cannot sedate... all the things you hate (1:229/2)   

[   << oldest   |   < older   |   list   |   newer >   |   newest >>   ]


(c) 1994,  bbs@darkrealms.ca