Just a sample of the Echomail archive
[ << oldest | < older | list | newer > | newest >> ]
|  Message 2236  |
|  Vitaliy Aksyonov to Michiel van der Vlist  |
|  Re: Need volonteers to test another patc  |
|  03 Mar 24 16:31:50  |
 REPLY: 2:280/5555 65e4ee8f MSGID: 1:104/117 65e50984 CHRS: US-ASCII 2 TZUTC: -0700 TID: hpt/lnx 1.9 2022-07-03 Hello Michiel. 03 Mar 24 22:31, you wrote to Nicholas Boel: NB>> So, at this point it's basically working and displaying properly, NB>> but then comes in the 'characters' vs 'bytes' thing that Golded NB>> isn't supporting, so it is wrapping what it thinks is double, NB>> triple, or even quadruple the amount of 'characters' that are NB>> there. MvdV> When it comes to mapping the number of characters to the number of MvdV> bytes, when you look at the UTF-8 encoding table, about two screens MvdV> down here: MvdV> https://en.wikipedia.org/wiki/UTF-8 MvdV> You can see that the length of the byte sequence can be determined MvdV> just by looking at the first byte. Look from bit 7 to the right. The MvdV> number of '1' bits equals the number of bytes in the character. All MvdV> the follow up bytes start with '10'. So to get the number of MvdV> characters ignore the bytes starting with '10' when counting the MvdV> bytes. MvdV> Breaking a line should only occur /before/ a byte starting with '0' or MvdV> '11'. MvdV> Knowing all that it should be doable to let Golded display properly. MvdV> Perhaps the best strategy is to have Golded alway use UTF-8 MvdV> internally. Almost everyone else does these days... That would be perfect. It only takes huge amount of effort. Especially with keeping code backward compatible with systems, which may not have Unicode support. I keep thinking about it and looking for possible ways to implement. For now I'd be happy to make iconv work properly. In this case GoldEd user may get rid of most (if not all) of translation tables. The problem is that source code has huge amounts of duplicated code and sometimes functions and variables names say nothing about what do they do. I spend huge amount of time just to understand what's going on. For example function, which splits message to lines is almost 1000 lines long! It has variables, used in multiple places, it not only splits the message, but guess charset, do recoding and other fun stuff. Vitaliy --- GoldED+/LNX 1.1.5-b20240223 * Origin: Aurora, Colorado (1:104/117) SEEN-BY: 15/0 18/200 50/109 90/1 104/117 105/81 106/201 128/260 129/305 SEEN-BY: 135/225 153/7715 218/700 226/30 227/114 229/110 112 113 206 SEEN-BY: 229/307 317 400 426 428 470 664 700 266/512 280/464 5555 SEEN-BY: 282/1038 291/111 292/854 301/1 320/219 322/757 342/200 396/45 SEEN-BY: 460/16 58 256 1124 5858 463/68 467/888 633/280 712/848 3634/12 SEEN-BY: 5000/111 5001/100 5005/49 5015/46 5020/828 846 1042 4441 SEEN-BY: 5030/49 5054/8 30 5061/133 5075/128 5083/444 5090/958 PATH: 104/117 5020/1042 460/58 229/426 |
[ << oldest | < older | list | newer > | newest >> ]