[Bug 135975] Perl's 'study' function breaks regexp matching
by Red Hat Bugzilla
Please do not reply directly to this email. All additional
comments should be made in the comments box of this bug report.
Summary: Perl's 'study' function breaks regexp matching
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=135975
jvdias(a)redhat.com changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|MODIFIED |ASSIGNED
BugsThisDependsOn| |172792
------- Additional Comments From jvdias(a)redhat.com 2005-11-09 15:45 EST -------
While the fix partially fixed the problem, this bug is still a problem if
UTF-8 support is enabled, which it is by default since the default locale
is en_US.UTF-8.
Workaround: run your perl scripts with 'LC_ALL=C' .
--
Configure bugmail: https://bugzilla.redhat.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
18 years, 5 months
use of study() with utf8 enabled breaks regexps
by Jason Vas Dias
To: perlbug(a)perl.org
Subject: use of study() with utf8 enabled breaks regexps
Cc: fedora-perl-devel-list(a)redhat.com
Reply-To: jvdias(a)redhat.com
Message-Id: <5.8.7_481_1131565834@jvdias>
This is a bug report for perl from jvdias(a)redhat.com,
generated with the help of perlbug 1.35 running under perl v5.8.7.
-----------------------------------------------------------------
[Please enter your report here]
Use of study() with utf8 support enabled breaks perl-5.8.7's
regular expressions :
OK without UTF:
$ echo 'ABDCEFGHIJK' |
perl -pe 'study; s/HIJK/1234/;'
ABDCEFG1234
$ echo 'ABCDEFGHIJK' |
perl -e '$_=<>; study; print /HIJK/,"\n";'
1
FAILS with UTF:
$ echo 'ABDCEFGHIJK' |
PERL_UNICODE=31 perl -pe 'study; s/HIJK/1234/;'
ABDCEFGHIJK
$ echo 'ABCDEFGHIJK' |
PERL_UNICODE=31 perl -e '$_=<>; study; print /HIJK/,"\n";'
(re did not match)
Seems to be study() that is the culprit:
$ echo 'ABDCEFGHIJK' |
PERL_UNICODE=31 perl -pe 's/HIJK/1234/;'
ABDCEFG1234
And it is because $_ gets utf8-ness from STDIN:
$ echo 'ABDCEFGHIJK' |
PERL_UNICODE=63 perl -e '$_=<>; study; print /HIJK/ ? "OK" : "FAIL","\n";'
FAIL
$ echo 'ABDCEFGHIJK' |
perl -e '$_=<>; study; print /HIJK/ ? "OK" : "FAIL","\n";'
OK
$ echo 'ABDCEFGHIJK' | PERL_UNICODE=63
perl -e '$_=<>; print /HIJK/ ? "OK" : "FAIL","\n";'
OK
This was in the 'en_US.UTF-8' locale. If I make utf-8 support
conditional on locale, the problem goes away for the C locale:
$ echo 'ABDCEFGHIJK' |
PERL_UNICODE=127 LC_ALL=C perl -e '$_=<>; study; print /HIJK/ ? "OK" : "FAIL","\n";'
OK
[Please do not change anything below this line]
-----------------------------------------------------------------
---
Flags:
category=core
severity=medium
---
This perlbug was built using Perl v5.8.7 in the Red Hat build system.
It is being executed now by Perl v5.8.7 - Tue Nov 8 16:24:03 EST 2005.
Site configuration information for perl v5.8.7:
Configured by Red Hat, Inc. at Tue Nov 8 16:24:03 EST 2005.
Summary of my perl5 (revision 5 version 8 subversion 7) configuration:
Platform:
osname=linux, osvers=2.6.14-1.1655_fc5, archname=i386-linux-thread-multi
uname='linux jvdias 2.6.14-1.1655_fc5 #1 tue nov 8 06:55:58 est 2005 i686 i686 i386 gnulinux '
config_args='-des -Doptimize=-O2 -g -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=4 -m32 -march=i386 -mtune=pentium4 -fasynchronous-unwind-tables -Dversion=5.8.7 -Dmyhostname=localhost -Dperladmin=root@localhost -Dcc=gcc -Dcf_by=Red Hat, Inc. -Dinstallprefix=/usr -Dprefix=/usr -Darchname=i386-linux -Dvendorprefix=/usr -Dsiteprefix=/usr -Duseshrplib -Dusethreads -Duseithreads -Duselargefiles -Dd_dosuid -Dd_semctl_semun -Di_db -Ui_ndbm -Di_gdbm -Di_shadow -Di_syslog -Dman3ext=3pm -Duseperlio -Dinstallusrbinperl=n -Ubincompat5005 -Uversiononly -Dpager=/usr/bin/less -isr -Dd_gethostent_r_proto -Ud_endhostent_r_proto -Ud_sethostent_r_proto -Ud_endprotoent_r_proto -Ud_setprotoent_r_proto -Ud_endservent_r_proto -Ud_setservent_r_proto -Dinc_version_list=5.8.6 5.8.5 5.8.4 5.8.3 -Dscriptdir=/usr/bin'
hint=recommended, useposix=true, d_sigaction=define
usethreads=define use5005threads=undef useithreads=define usemultiplicity=define
useperlio=define d_sfio=undef uselargefiles=define usesocks=undef
use64bitint=undef use64bitall=undef uselongdouble=undef
usemymalloc=n, bincompat5005=undef
Compiler:
cc='gcc', ccflags ='-D_REENTRANT -D_GNU_SOURCE -DDEBUGGING -fno-strict-aliasing -pipe -I/usr/local/include -D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64 -I/usr/include/gdbm',
optimize='-O2 -g -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions -fstack-protector --param=ssp-buffer-size=4 -m32 -march=i386 -mtune=pentium4 -fasynchronous-unwind-tables',
cppflags='-D_REENTRANT -D_GNU_SOURCE -DDEBUGGING -fno-strict-aliasing -pipe -I/usr/local/include -I/usr/include/gdbm'
ccversion='', gccversion='4.0.2 20051007 (Red Hat 4.0.2-3)', gccosandvers=''
intsize=4, longsize=4, ptrsize=4, doublesize=8, byteorder=1234
d_longlong=define, longlongsize=8, d_longdbl=define, longdblsize=12
ivtype='long', ivsize=4, nvtype='double', nvsize=8, Off_t='off_t', lseeksize=8
alignbytes=4, prototype=define
Linker and Libraries:
ld='gcc', ldflags =' -L/usr/local/lib'
libpth=/usr/local/lib /lib /usr/lib
libs=-lresolv -lnsl -lgdbm -ldb -ldl -lm -lcrypt -lutil -lpthread -lc
perllibs=-lresolv -lnsl -ldl -lm -lcrypt -lutil -lpthread -lc
libc=/lib/libc-2.3.90.so, so=so, useshrplib=true, libperl=libperl.so
gnulibc_version='2.3.90'
Dynamic Linking:
dlsrc=dl_dlopen.xs, dlext=so, d_dlsymun=undef, ccdlflags='-Wl,-E -Wl,-rpath,/usr/lib/perl5/5.8.7/i386-linux-thread-multi/CORE'
cccdlflags='-fPIC', lddlflags='-shared -L/usr/local/lib'
Locally applied patches:
---
@INC for perl v5.8.7:
/usr/lib/perl5/site_perl/5.8.7/i386-linux-thread-multi
/usr/lib/perl5/site_perl/5.8.6/i386-linux-thread-multi
/usr/lib/perl5/site_perl/5.8.5/i386-linux-thread-multi
/usr/lib/perl5/site_perl/5.8.4/i386-linux-thread-multi
/usr/lib/perl5/site_perl/5.8.3/i386-linux-thread-multi
/usr/lib/perl5/site_perl/5.8.7
/usr/lib/perl5/site_perl/5.8.6
/usr/lib/perl5/site_perl/5.8.5
/usr/lib/perl5/site_perl/5.8.4
/usr/lib/perl5/site_perl/5.8.3
/usr/lib/perl5/site_perl
/usr/lib/perl5/vendor_perl/5.8.7/i386-linux-thread-multi
/usr/lib/perl5/vendor_perl/5.8.6/i386-linux-thread-multi
/usr/lib/perl5/vendor_perl/5.8.5/i386-linux-thread-multi
/usr/lib/perl5/vendor_perl/5.8.4/i386-linux-thread-multi
/usr/lib/perl5/vendor_perl/5.8.3/i386-linux-thread-multi
/usr/lib/perl5/vendor_perl/5.8.7
/usr/lib/perl5/vendor_perl/5.8.6
/usr/lib/perl5/vendor_perl/5.8.5
/usr/lib/perl5/vendor_perl/5.8.4
/usr/lib/perl5/vendor_perl/5.8.3
/usr/lib/perl5/vendor_perl
/usr/lib/perl5/5.8.7/i386-linux-thread-multi
/usr/lib/perl5/5.8.7
.
---
Environment for perl v5.8.7:
HOME=/home/boston/jvdias
LANG=en_US.UTF-8
LANGUAGE (unset)
LD_LIBRARY_PATH (unset)
LOGDIR (unset)
PATH=/usr/kerberos/bin:/usr/local/bin:/usr/bin:/bin:/usr/X11R6/bin:/usr/games:/home/boston/jvdias/bin
PERL_BADLANG (unset)
SHELL=/bin/bash
18 years, 5 months
[Bug 136009] MakeMaker::MM_Unix doesn't honor LD_RUN_PATH requirements
by Red Hat Bugzilla
Please do not reply directly to this email. All additional
comments should be made in the comments box of this bug report.
Summary: MakeMaker::MM_Unix doesn't honor LD_RUN_PATH requirements
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=136009
jvdias(a)redhat.com changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|ASSIGNED |MODIFIED
------- Additional Comments From jvdias(a)redhat.com 2005-11-09 13:10 EST -------
This is now fixed - MM_Unix uses upstream method of preventing use of empty
LD_RUN_PATH while allowing non-empty LD_RUN_PATH to be used.
Patch applied in:
perl-5.8.7-7 (FC5), perl-5.8.6-16 (FC4), perl-5.8.5-17 (RHEL4),
perl-5.8.0-90 (RHEL-3).
--
Configure bugmail: https://bugzilla.redhat.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
18 years, 5 months
[Bug 172153] New: /etc/mail is not an ideal place for spamassasin config files
by Red Hat Bugzilla
Please do not reply directly to this email. All additional
comments should be made in the comments box of this bug report.
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=172153
Summary: /etc/mail is not an ideal place for spamassasin config
files
Product: Fedora Core
Version: devel
Platform: All
OS/Version: Linux
Status: NEW
Severity: normal
Priority: normal
Component: spamassassin
AssignedTo: wtogami(a)redhat.com
ReportedBy: rcoker(a)redhat.com
CC: fedora-perl-devel-
list@redhat.com,felicity@kluge.net,jm(a)jmason.org,parkerm
@pobox.com,reg+redhat@sidney.com,wtogami(a)redhat.com
>From Bugzilla Helper:
User-Agent: Mozilla/5.0 (compatible; Konqueror/3.4; Linux) KHTML/3.4.92 (like Gecko)
Description of problem:
The directory /etc/mail is for the Sendmail MTA. When using an MTA other than
Sendmail it's not used for anything other than Spamassasin. It doesn't make
sense to have this implicit connection between Spamassasin and Sendmail.
Could you please change spamassasin to use /etc/spamassasin for it's config
files with a sym-link for compatibility if necessary.
Version-Release number of selected component (if applicable):
How reproducible:
Always
Steps to Reproduce:
1.
2.
3.
Additional info:
--
Configure bugmail: https://bugzilla.redhat.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
18 years, 5 months
[Bug 145215] garbage after split()
by Red Hat Bugzilla
Please do not reply directly to this email. All additional
comments should be made in the comments box of this bug report.
Summary: garbage after split()
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=145215
jvdias(a)redhat.com changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |MODIFIED
------- Additional Comments From jvdias(a)redhat.com 2005-11-08 21:02 EST -------
We apologize for the long delay in processing this bug report.
This bug is fixed with perl-5.8.0-90.2 , which should be in the next RHEL-3
update release.
--
Configure bugmail: https://bugzilla.redhat.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
18 years, 5 months
[Bug 157071] Perl doesn't lovercase accented caracters in UTF-8
by Red Hat Bugzilla
Please do not reply directly to this email. All additional
comments should be made in the comments box of this bug report.
Summary: Perl doesn't lovercase accented caracters in UTF-8
https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=157071
jvdias(a)redhat.com changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |CLOSED
Resolution| |NOTABUG
------- Additional Comments From jvdias(a)redhat.com 2005-11-08 17:32 EST -------
Yes, I know the perl unicode implementation is far from user-friendly
or intuitive - this is an upstream issue that is being addressed - but
it does work (just) if used correctly .
perl's lc / uc DO work for UTF-8, IF the UTF-8 is properly encoded, AND perl is
running in wide-character mode , AND the characters have defined upper/lower
case counterparts in your current locale.
These examples should expose the issues - I suggest you also read the
perlunicode and perllocale man-pages .
$ perl -C -e 'use locale; use utf8; use Encode qw(decode);
$s=decode(utf8,"\xc5\x99\xc4\x9b"); print uc $s,"\n";'
ŘĚ
$ perl -C -e 'use locale; use utf8; use Encode qw(decode);
$s=decode(utf8,"\xc5\x99\xc4\x9b"); print $s,"\n";'
řě
$ perl -e 'use Encode qw(decode); $s=decode(utf8,"\xc5\x99\xc4\x9b"); print
$s,"\n";'
Wide character in print at -e line 1.
řě
$ perl -C -e 'use Encode qw(decode); $s=decode(utf8,"\xc5\x99\xc4\x9b"); print
$s,"\n";'
řě
$ PERL_UNICODE=31 perl -e 'use Encode qw(decode);
$s=decode(utf8,"\xc5\x99\xc4\x9b"); print uc $s,"\n";'
ŘĚ
$ PERL_UNICODE=31 perl -e 'use Encode qw(decode); $s=decode(utf8,"ŘĚ"); print
lc $s,"\n";'
řě
--
Configure bugmail: https://bugzilla.redhat.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
18 years, 5 months