On Fri, 30 Aug 2019 11:19:37 -0700
Kevin Fenzi <kevin(a)scrye.com> wrote:
On 8/28/19 2:13 PM, Peter Lemenkov wrote:
> Hello All,
>
> I've just got hit by this again - lockups on s390. Looks like I have
> 100% reproducer (just try to build Erlang and it will stuck
> eventually).
>
> *
https://koji.fedoraproject.org/koji/taskinfo?taskID=37327589
>
> Where should I open a ticket?
Bugzilla.redhat.com or somewhere else?
I'm really not fully sure. ;(
Perhaps a kernel bug and get the s390x folks involved?
I've already replied in
https://pagure.io/releng/issue/8711 - if F-30
builds are OK, the builders are the same, then it should be a change in
F-31+, perhaps glibc ... I'm going to give it a try locally.
Dan
> kevin
> --
>
> >
> > чт, 25 июл. 2019 г. в 17:44, Kevin Fenzi <kevin(a)scrye.com>:
> >>
> >> On 7/25/19 3:04 AM, Peter Lemenkov wrote:
> >>> Hello All!
> >>> It started to get stuck again. Right now I'm experiencing this
> >>> issue with RabbitMQ for F-30 and F-31:
> >>>
> >>> *
https://koji.fedoraproject.org/koji/taskinfo?taskID=36457376
> >>> *
https://koji.fedoraproject.org/koji/taskinfo?taskID=36457345
> >>
> >> So, yeah.
> >>
> >>> | |-kojid,31279 /usr/sbin/kojid --fg --force-lock --verbose
> >>> | | `-mock,31584 -tt /usr/libexec/mock/mock -r
> >>> | | koji/f30-build-16961487-1222718 --old-chroot --no-clean
> >>> | | --target s390x ... `-rpmbuild,32205 -bb --target s390x
> >>> | | --nodeps /builddir/build/SPECS/rabbitmq-server.spec `-sh,
> >>> | | 32237 -e /var/tmp/rpm-tmp.GwzEQt `-make,32238 -j4
> >>> | | VERSION=3.7.16 V=1 `-sh,32318 -c... `-make,2112
> >>> | | -C
/builddir/build/BUILD/rabbitmq-server-3.7.16/deps/amqp10_client
> >>> | | IS_DEP=1 `-make,2237 --no-print-directory app-build
> >>> | | `-beam.smp,2302 -sbtu -A0 -- -root /usr/lib64/erlang
> >>> | | -progname erl -- -home /builddir -- ...
> >>> | | |-{beam.smp},2303
> >>> | | |-{beam.smp},2304
> >>> | | |-erl_child_setup,2305
> >>> | | |1024 -{beam.smp},2306
> >>> | | |-{beam.smp},2307
> >>> | | |-{beam.smp},2308
> >>> | | |-{beam.smp},2309
> >>> | | |-{beam.smp},2310
> >>> | | |-{beam.smp},2311
> >>> | | |-{beam.smp},2312
> >>> | | |-{beam.smp},2313
> >>> | | |-{beam.smp},2314
> >>> | | |-{beam.smp},2315
> >>> | | |-{beam.smp},2316
> >>> | | |-{beam.smp},2317
> >>> | | |-{beam.smp},2318
> >>> | | |-{beam.smp},2319
> >>> | | |-{beam.smp},2320
> >>> | | |-{beam.smp},2321
> >>> | | |-{beam.smp},2322
> >>> | | |-{beam.smp},2323
> >>> | | |-{beam.smp},2324
> >>> | | `-{beam.smp},2325
> >>
> >> When I strace the 2302 process:
> >>
> >> strace: Process 2302 attached with 23 threads
> >> [pid 2324] ppoll([{fd=12, events=POLLIN|POLLRDNORM}], 1, NULL,
> >> NULL, 8 <unfinished ...>
> >> [pid 2320] futex(0x3ff58800550, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2321] futex(0x3ff58800590, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2319] futex(0x3ff58800510, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2318] futex(0x3ff588004d0, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2317] futex(0x3ff58800490, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2316] futex(0x3ff58800450, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2315] futex(0x3ff58800410, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2313] futex(0x3ff58800390, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2312] futex(0x3ff58800350, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2308] restart_syscall(<... resuming interrupted
> >> syscall_0xfffffffffffffdfc ...> <unfinish
> >> ed ...>
> >> [pid 2303] read(14, <unfinished ...>
> >> [pid 2302] select(0, NULL, NULL, NULL, NULL <unfinished ...>
> >> [pid 2309] restart_syscall(<... resuming interrupted select ...>
> >> <unfinished ...>
> >> [pid 2323] futex(0x3ff58800610, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2322] futex(0x3ff588005d0, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2314] futex(0x3ff588003d0, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2311] futex(0x3ff58800310, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2310] futex(0x3ff588002d0, FUTEX_WAIT_PRIVATE, 4294967295,
> >> NULL <unfinished ...>
> >> [pid 2306] restart_syscall(<... resuming interrupted
> >> syscall_0xfffffffffffffdfc ...> <unfinish
> >> ed ...>
> >> [pid 2304] futex(0x2aa3d9af520, FUTEX_WAIT_PRIVATE, 0, NULL
> >> <unfinished ...>
> >> [pid 2307] timerfd_settime(11, 0, {it_interval={tv_sec=0,
> >> tv_nsec=0}, it_value={tv_sec=0, tv_n
> >> sec=0}}, NULL) = 0
> >> [pid 2325] epoll_wait(4, <unfinished ...>
> >> [pid 2307] futex(0x3ff588001d0, FUTEX_WAKE_PRIVATE, 1) = 1
> >> [pid 2306] <... restart_syscall resumed>) = 0
> >> [pid 2307] fcntl(2, F_GETFL <unfinished ...>
> >> [pid 2306] timerfd_settime(11, 0, {it_interval={tv_sec=0,
> >> tv_nsec=0}, it_value={tv_sec=23, tv_
> >> nsec=941692107}}, <unfinished ...>
> >>
> >> and then... it starts going again. So something was stuck and
> >> strace unstuck it?
> >>
> >> So, it looks like some odd signal thing with s390x?
> >>
> >> Not sure, but perhaps we should file a bug and try and track it
> >> more?
> >>
> >> kevin
> >>
> >>
> >> _______________________________________________
> >> devel mailing list -- devel(a)lists.fedoraproject.org
> >> To unsubscribe send an email to devel-leave(a)lists.fedoraproject.org
> >> Fedora Code of Conduct:
> >>
https://docs.fedoraproject.org/en-US/project/code-of-conduct/ List
> >> Guidelines:
https://fedoraproject.org/wiki/Mailing_list_guidelines
> >> List Archives:
> >>
https://lists.fedoraproject.org/archives/list/devel@lists.fedoraproject.org
> >
> >
> >
>
>