"Frank D. Engel, Jr." wrote:
--- Frank Heckenbach frank@g-n-u.de wrote:
Emil Jerabek wrote:
Indeed. Which leaves the question if 6.4.2.2 d) 2)/3) apply if
only
some letters exist. I wouldn't think so. So I propose the
following
character encoding for the "Really Stupid Pascal Compiler":
0 - 1 f 2 A 3 L 4 s 5 E 6 u 7 r 8 T 9 0 10 1 11 2 ... 18 9 19 (space) 20 . 21 +
Great! A 5-bit encoding, still having 10 free slots for i18n extensions :)
Indeed. Perhaps we should add one Cyrillic, one Japanese and one Klingon letter for a start.
OTOH, if we don't add any more characters we could encode it in 4.5 bits which is certainly worth considering.
Particularly for PACKED data types ;-)
We can store 7 of them in a 32 bit record, by using the technique of multiplying 7 values in the range 1..23. We can't allow a 0 char representation for this, but this allows expanding the char. set by 1 item. This is 4.57 bits per char.