[Bug 638786] New: Review Request: guessencoding - Guess encoding of files and return configured reader

bugzilla at redhat.com bugzilla at redhat.com
Wed Sep 29 22:37:13 UTC 2010


Please do not reply directly to this email. All additional
comments should be made in the comments box of this bug.

Summary: Review Request: guessencoding - Guess encoding of files and return configured reader

https://bugzilla.redhat.com/show_bug.cgi?id=638786

           Summary: Review Request: guessencoding - Guess encoding of
                    files and return configured reader
           Product: Fedora
           Version: rawhide
          Platform: All
        OS/Version: Linux
            Status: NEW
          Severity: medium
          Priority: medium
         Component: Package Review
        AssignedTo: nobody at fedoraproject.org
        ReportedBy: orion at cora.nwra.com
         QAContact: extras-qa at fedoraproject.org
                CC: notting at redhat.com, fedora-package-review at redhat.com
   Estimated Hours: 0.0
    Classification: Fedora


Spec URL: http://www.cora.nwra.com/~orion/fedora/guessencoding.spec
SRPM URL:
http://www.cora.nwra.com/~orion/fedora/guessencoding-1.2-1.fc13.src.rpm
Description:
The purpose of this library is to "guess" the encoding of files, and retrieve
a reader that is properly configured to use the right encoding as guessed.
The library is able to recognize the various Unicode encoding variants:

    * UTF-8
    * UTF-16LE - Low Endian
    * UTF-16BE - Big Endian
    * UTF-32

If a Unicode encoding isn't recognized, it's an 8-bit encoding. If the 8-bit
encoding is not US-ASCII, the default platform 8-bit encoding is assumed
whatever it is. However, the library cannot guess between different 8-bit
encodings. Only statistical analysis, n-grams and similar techniques specific
to each language used in those files can help guessing the encoding, but this
is not supported by the library.

-- 
Configure bugmail: https://bugzilla.redhat.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.



More information about the package-review mailing list