Please do not reply directly to this email. All additional comments should be made in the comments box of this bug report.
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=175513
Summary: UTF-8 error from sa-learn Product: Fedora Core Version: fc4 Platform: i386 OS/Version: Linux Status: NEW Severity: normal Priority: normal Component: spamassassin AssignedTo: wtogami@redhat.com ReportedBy: iny@iki.fi CC: fedora-perl-devel- list@redhat.com,felicity@kluge.net,jm@jmason.org,parkerm @pobox.com,reg+redhat@sidney.com,wtogami@redhat.com
From Bugzilla Helper:
User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.7.12) Gecko/20050923 Fedora/1.7.12-1.5.1
Description of problem: $ sa-learn --spam Maildir/.training.spam/cur/ Parsing of undecoded UTF-8 will give garbage when decoding entities at /usr/lib/perl5/vendor_perl/5.8.6/Mail/SpamAssassin/HTML.pm line 182.
Version-Release number of selected component (if applicable): spamassassin-3.0.4-2.fc4
How reproducible: Always
Steps to Reproduce: 1. Invoke sa-learn
Actual Results: Got this message.
Expected Results: Shouldn't have got it.
Additional info:
Please do not reply directly to this email. All additional comments should be made in the comments box of this bug report.
Summary: UTF-8 error from sa-learn
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=175513
wtogami@redhat.com changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO_REPORTER
------- Additional Comments From wtogami@redhat.com 2005-12-12 01:35 EST ------- It would be helpful if you could isolate the message that causes that error, save it into its own mbox file, and attach that mbox file. We need the complete message including headers and everything intact.
Please do not reply directly to this email. All additional comments should be made in the comments box of this bug report.
Summary: UTF-8 error from sa-learn
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=175513
------- Additional Comments From reg+redhat@sidney.com 2005-12-12 03:48 EST ------- This looks like upstream bug http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4046 which is fixed in the svn trunk, but not the 3.0 branch.
Please do not reply directly to this email. All additional comments should be made in the comments box of this bug report.
Summary: UTF-8 error from sa-learn
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=175513
------- Additional Comments From wtogami@redhat.com 2005-12-12 03:57 EST ------- Hmm, would that patch apply in 3.0.5? I personally see this error often in my 3.0.5 testing.
Please do not reply directly to this email. All additional comments should be made in the comments box of this bug report.
Summary: UTF-8 error from sa-learn
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=175513
------- Additional Comments From reg+redhat@sidney.com 2005-12-12 05:24 EST ------- The comments in that bug are confusing. No patches that are in that bug were applied. The bug was closed because other changes in trunk made the warning go away. and Justin determined that the problem was only cosmetic. The warnings did not affect rule hits.
The relevant code in the trunk version of HTML.pm looks like this:
# Ignore stupid warning that can't be suppressed: 'Parsing of # undecoded UTF-8 will give garbage when decoding entities at ..' (bug 4046) { local $SIG{__WARN__} = sub { warn @_ unless (defined $_[0] && $_[0] =~ /^Parsing of undecoded UTF-/); };
$self->SUPER::parse($text); }
In 3.0 there is a call to
$hp->parse(pack ('C0A*', $text));
at or near line 182, instead of a call to $self->SUPER::parse($text); and that's what you would wrap the block around.
Please do not reply directly to this email. All additional comments should be made in the comments box of this bug report.
Summary: UTF-8 error from sa-learn
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=175513
------- Additional Comments From iny@iki.fi 2005-12-12 11:01 EST ------- Created an attachment (id=122138) --> (https://bugzilla.redhat.com/bugzilla/attachment.cgi?id=122138&action=vie...) mail that causes the bug
$ sa-learn --spam sa-learn-bug-mail Parsing of undecoded UTF-8 will give garbage when decoding entities at /usr/lib/perl5/vendor_perl/5.8.6/Mail/SpamAssassin/HTML.pm line 182. Learned from 0 message(s) (1 message(s) examined).
Please do not reply directly to this email. All additional comments should be made in the comments box of this bug report.
Summary: UTF-8 error from sa-learn
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=175513
orion@cora.nwra.com changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO_REPORTER |ASSIGNED CC| |orion@cora.nwra.com
------- Additional Comments From orion@cora.nwra.com 2006-05-17 11:11 EST ------- Seems like all relevant information has been provided.
Please do not reply directly to this email. All additional comments should be made in the comments box of this bug report.
Summary: UTF-8 error from sa-learn
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=175513
Christian.Iseli@licr.org changed:
What |Removed |Added ---------------------------------------------------------------------------- Status|ASSIGNED |NEEDINFO
------- Additional Comments From Christian.Iseli@licr.org 2007-01-19 19:23 EST ------- This report targets the FC3 or FC4 products, which have now been EOL'd.
Could you please check that it still applies to a current Fedora release, and either update the target product or close it ?
Thanks.
perl-devel@lists.fedoraproject.org