RPM submission script
Eric S. Raymond
esr at thyrsus.com
Fri Nov 7 21:27:14 UTC 2003
Justin Mason <jm at jmason.org>:
> er, UTF-8 *is* multibyte ;)
Well, technically, yes...but not the way people usually mean it (16-bit
chars like Java).
> However, using UTF-8 should be OK, alright, since UTF-8 multibyte
> sequences must contain bit 7 set in all chars, so "\n" and ":" will not
> show up as bytes. viz: http://www.cl.cam.ac.uk/~mgk25/unicode.html#utf-8
> notes 'no ASCII byte (0x00-0x7F) can appear as part of any other
> character.'
Exactly.
--
<a href="http://www.catb.org/~esr/">Eric S. Raymond</a>
More information about the devel
mailing list