On Mon, 2009-03-30 at 16:26 -0400, John J. McDonough wrote:
For Western European coding I don't think there are any multi-byte sequences.
The UTF-8 byte sequence for every single Latin-1 character outside of ASCII starts with either 0xc2 or 0xc3, and is two bytes long.