RPM submission script

Eric S. Raymond esr at thyrsus.com
Fri Nov 7 21:27:14 UTC 2003


Justin Mason <jm at jmason.org>:
> er, UTF-8 *is* multibyte ;)

Well, technically, yes...but not the way people usually mean it (16-bit
chars like Java).
 
> However, using UTF-8 should be OK, alright, since UTF-8 multibyte
> sequences must contain bit 7 set in all chars, so "\n" and ":" will not
> show up as bytes. viz: http://www.cl.cam.ac.uk/~mgk25/unicode.html#utf-8
> notes 'no ASCII byte (0x00-0x7F) can appear as part of any other
> character.'

Exactly.
-- 
		<a href="http://www.catb.org/~esr/">Eric S. Raymond</a>





More information about the devel mailing list