----- Original Message ----- From: "Ignacio Vazquez-Abrams" ivazqueznet@gmail.com To: "For participants of the Documentation Project" fedora-docs-list@redhat.com Sent: Monday, March 30, 2009 3:51 PM Subject: Re: More Publican Pain
0xc2 is the first byte of a 2-byte UTF-8 sequence for a character between U+0080 and U+00BF.
Ahh, that makes sense then. It is always followed by a non-breaking space, 0xa0. For Western European coding I don't think there are any multi-byte sequences. So the problem is entirely on the codepage.
--McD