'port_options' or 'port_select'
by Mariusz Mazur
man teamd.conf:
runner.agg_select_policy (string)
(...)
*port_options* — Aggregator with highest priority according to per-port
Actual code:
static const char *lacp_agg_select_policy_names_list[] = {
"lacp_prio", "lacp_prio_stable", "bandwidth", "count", "*port_config*",
};
I'd send a patch, but I have no idea which one to fix. Jiri?
7 years, 8 months
Stacked ifaces teardown results in kernel locking
by Mariusz Mazur
I have three team interfaces:
- sw1 and sw2, both are three-NIC lacps
- and an activebackup called team0 (I'm open to better naming suggestions
:) on top of sw1+sw2.
Here is how vlan interface teardown looks like using the unstacked sw1
interface:
[root@vc1n3 ~]# systemctl start teamd@sw1
[root@vc1n3 ~]# ip l s sw1 up
[root@vc1n3 ~]# ip link add link sw1 name lan.246 type vlan id 246
[root@vc1n3 ~]# ip l s lan.246 up
[root@vc1n3 ~]# systemctl stop teamd@sw1
[root@vc1n3 ~]#
Everything went fine, both sw1 and lan.246@sw1 are now gone.
Now let's try this with stacked team interfaces:
[root@vc1n3 ~]# systemctl start teamd@sw1 teamd@sw2
[root@vc1n3 ~]# systemctl start teamd@team0
[root@vc1n3 ~]# ip l s team0 up
[root@vc1n3 ~]# ip link add link team0 name lan.246 type vlan id 246
[root@vc1n3 ~]# ip l s lan.246 up
[root@vc1n3 ~]# systemctl stop teamd@team0
… and it freezes. I can't tell you what the state of the network interfaces
is, because running 'ip link show' also freezes. The teamd process becomes
unkillable. Only thing left to do now is to hard reset the system, cause I
can't do a proper shutdown in any shape or form.
Some info:
- libteam git master from today
- kernel 4.7.0
- full teamd configs are at the very end of this email
Logs:
Aug 09 08:54:58 vc1n3 polkitd[10340]: Registered Authentication Agent for
unix-process:11455:5149256 (system bus name :1.369 [/usr/bin/pkttyagent
--notify-fd 5 --fallback], object path
/org/freedesktop/PolicyKit1/AuthenticationAgent, locale C.utf-8)
Aug 09 08:54:58 vc1n3 kernel: team0: Port device sw2 removed
Aug 09 08:54:58 vc1n3 systemd[1]: Stopping Team Daemon for device team0...
Aug 09 08:54:58 vc1n3 teamd[11418]: Got SIGINT, SIGQUIT or SIGTERM.
Aug 09 08:55:35 vc1n3 polkitd[10340]: Unregistered Authentication Agent for
unix-process:11455:5149256 (system bus name :1.369, object path
/org/freedesktop/PolicyKit1/AuthenticationAgent, locale C.utf-8)
(disconnected from bus)
Aug 09 08:56:01 vc1n3 systemd[1]: Started Session c163 of user root.
Aug 09 08:56:01 vc1n3 systemd[1]: Starting Session c163 of user root.
Aug 09 08:56:01 vc1n3 CROND[11489]: (root) CMD
(/usr/bin/update_axeos_certificates --quiet)
Aug 09 08:56:28 vc1n3 systemd[1]: teamd(a)team0.service: State 'stop-sigterm'
timed out. Killing.
Aug 09 08:57:58 vc1n3 systemd[1]: teamd(a)team0.service: Processes still
around after SIGKILL. Ignoring.
Aug 09 08:58:46 vc1n3 kernel: INFO: task snmpd:2217 blocked for more than
120 seconds.
Aug 09 08:58:46 vc1n3 kernel: Tainted: G O 4.7.0-aos0.1
#1
Aug 09 08:58:46 vc1n3 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 09 08:58:46 vc1n3 kernel: snmpd D ffff88107fc96800 0
2217 1 0x00000000
Aug 09 08:58:46 vc1n3 kernel: ffff88085b0c2580 0000000000000020
ffff881052e62480 ffff88085ac5c000
Aug 09 08:58:46 vc1n3 kernel: ffff88085ac5bdc0 ffffffff81889624
ffff88085b0c2580 0000000000000246
Aug 09 08:58:46 vc1n3 kernel: 00000000ffffffff ffffffff8149831c
ffffffff81889620 ffffffff814985b5
Aug 09 08:58:46 vc1n3 kernel: Call Trace:
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149831c>] ? schedule+0x2c/0x80
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff814985b5>] ?
schedule_preempt_disabled+0x5/0x10
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a60c>] ?
__mutex_lock_slowpath+0x10c/0x300
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a809>] ? mutex_lock+0x9/0x20
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813d6697>] ? dev_ioctl+0x2e7/0x570
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff811971f4>] ? clear_inode+0x14/0x80
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff810e848a>] ?
cgroup_sk_alloc+0x2a/0x40
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a20e6>] ?
sock_do_ioctl+0x36/0x40
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a253e>] ? sock_ioctl+0x19e/0x240
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8118fe94>] ?
do_vfs_ioctl+0x84/0x5a0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff81181076>] ? alloc_file+0x16/0xa0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a1fc5>] ?
sock_alloc_file+0x95/0x110
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff81190421>] ? SyS_ioctl+0x71/0x80
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a4653>] ? SyS_socket+0x63/0xa0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149c576>] ?
entry_SYSCALL_64_fastpath+0x1e/0xa8
Aug 09 08:58:46 vc1n3 kernel: INFO: task kworker/3:0:11165 blocked for more
than 120 seconds.
Aug 09 08:58:46 vc1n3 kernel: Tainted: G O 4.7.0-aos0.1
#1
Aug 09 08:58:46 vc1n3 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 09 08:58:46 vc1n3 kernel: kworker/3:0 D ffff88085fd96800 0
11165 2 0x00000000
Aug 09 08:58:46 vc1n3 kernel: Workqueue: ipv6_addrconf addrconf_verify_work
Aug 09 08:58:46 vc1n3 kernel: ffff88085ade8000 0000000000000000
0000000000000000 ffff8808557d4000
Aug 09 08:58:46 vc1n3 kernel: ffff8808557d3e18 ffffffff81889624
ffff88085ade8000 0000000000000246
Aug 09 08:58:46 vc1n3 kernel: 00000000ffffffff ffffffff8149831c
ffffffff81889620 ffffffff814985b5
Aug 09 08:58:46 vc1n3 kernel: Call Trace:
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149831c>] ? schedule+0x2c/0x80
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff814985b5>] ?
schedule_preempt_disabled+0x5/0x10
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a60c>] ?
__mutex_lock_slowpath+0x10c/0x300
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff810286f4>] ?
__switch_to+0x244/0x570
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a809>] ? mutex_lock+0x9/0x20
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8145e9f5>] ?
addrconf_verify_work+0x5/0x10
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff81086de5>] ?
process_one_work+0x145/0x3d0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff810870d0>] ?
worker_thread+0x60/0x4a0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff81497e67>] ? __schedule+0x1e7/0x670
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff81087070>] ?
process_one_work+0x3d0/0x3d0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8108beea>] ? kthread+0xca/0xe0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149c79f>] ?
ret_from_fork+0x1f/0x40
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8108be20>] ?
kthread_create_on_node+0x180/0x180
Aug 09 08:58:46 vc1n3 kernel: INFO: task teamd:11385 blocked for more than
120 seconds.
Aug 09 08:58:46 vc1n3 kernel: Tainted: G O 4.7.0-aos0.1
#1
Aug 09 08:58:46 vc1n3 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 09 08:58:46 vc1n3 kernel: teamd D ffff88107fd16800 0
11385 1 0x00000000
Aug 09 08:58:46 vc1n3 kernel: ffff88105aed3e80 ffff88085b8a1600
00000000000000db ffff8810545f4000
Aug 09 08:58:46 vc1n3 kernel: ffff8810545f3c78 ffffffff81889624
ffff88105aed3e80 0000000000000246
Aug 09 08:58:46 vc1n3 kernel: 00000000ffffffff ffffffff8149831c
ffffffff81889620 ffffffff814985b5
Aug 09 08:58:46 vc1n3 kernel: Call Trace:
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149831c>] ? schedule+0x2c/0x80
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff814985b5>] ?
schedule_preempt_disabled+0x5/0x10
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a60c>] ?
__mutex_lock_slowpath+0x10c/0x300
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff81190dc0>] ?
poll_select_copy_remaining+0x120/0x120
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a809>] ? mutex_lock+0x9/0x20
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813cfa00>] ?
rtnetlink_rcv+0x10/0x30
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813ed17a>] ?
netlink_unicast+0x14a/0x1e0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813ed609>] ?
netlink_sendmsg+0x349/0x3a0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a3b5b>] ? sock_sendmsg+0x2b/0x40
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a43dd>] ?
___sys_sendmsg+0x24d/0x260
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a4538>] ?
___sys_recvmsg+0x148/0x1c0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff811bdc34>] ?
ep_send_events_proc+0x74/0x170
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a3bb>] ?
__mutex_unlock_slowpath+0xbb/0x160
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff811be517>] ? ep_poll+0x127/0x350
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a519c>] ?
__sys_sendmsg+0x3c/0x70
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149c576>] ?
entry_SYSCALL_64_fastpath+0x1e/0xa8
Aug 09 08:58:46 vc1n3 kernel: INFO: task teamd:11386 blocked for more than
120 seconds.
Aug 09 08:58:46 vc1n3 kernel: Tainted: G O 4.7.0-aos0.1
#1
Aug 09 08:58:46 vc1n3 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 09 08:58:46 vc1n3 kernel: teamd D ffff88107fc96800 0
11386 1 0x00000000
Aug 09 08:58:46 vc1n3 kernel: ffff88105aed7080 ffff88085abf1f00
00000000000000db ffff881057ff4000
Aug 09 08:58:46 vc1n3 kernel: ffff881057ff3c78 ffffffff81889624
ffff88105aed7080 0000000000000246
Aug 09 08:58:46 vc1n3 kernel: 00000000ffffffff ffffffff8149831c
ffffffff81889620 ffffffff814985b5
Aug 09 08:58:46 vc1n3 kernel: Call Trace:
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149831c>] ? schedule+0x2c/0x80
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff814985b5>] ?
schedule_preempt_disabled+0x5/0x10
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a60c>] ?
__mutex_lock_slowpath+0x10c/0x300
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff81190dc0>] ?
poll_select_copy_remaining+0x120/0x120
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a809>] ? mutex_lock+0x9/0x20
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813cfa00>] ?
rtnetlink_rcv+0x10/0x30
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813ed17a>] ?
netlink_unicast+0x14a/0x1e0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813ed609>] ?
netlink_sendmsg+0x349/0x3a0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a3b5b>] ? sock_sendmsg+0x2b/0x40
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a43dd>] ?
___sys_sendmsg+0x24d/0x260
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a4538>] ?
___sys_recvmsg+0x148/0x1c0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff811bdc34>] ?
ep_send_events_proc+0x74/0x170
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a3bb>] ?
__mutex_unlock_slowpath+0xbb/0x160
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8109b627>] ? update_curr+0xa7/0x130
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff810a2e72>] ?
pick_next_task_fair+0x2c2/0x490
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a519c>] ?
__sys_sendmsg+0x3c/0x70
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149c576>] ?
entry_SYSCALL_64_fastpath+0x1e/0xa8
Aug 09 08:58:46 vc1n3 kernel: INFO: task teamd:11418 blocked for more than
120 seconds.
Aug 09 08:58:46 vc1n3 kernel: Tainted: G O 4.7.0-aos0.1
#1
Aug 09 08:58:46 vc1n3 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 09 08:58:46 vc1n3 kernel: teamd D ffff88107fc16800 0
11418 1 0x00000004
Aug 09 08:58:46 vc1n3 kernel: ffff88105af10000 0000000000000001
0000000000000000 ffff881057fe4000
Aug 09 08:58:46 vc1n3 kernel: ffff881057fe3750 ffff88085560f814
ffff88105af10000 0000000000000246
Aug 09 08:58:46 vc1n3 kernel: 00000000ffffffff ffffffff8149831c
ffff88085560f810 ffffffff814985b5
Aug 09 08:58:46 vc1n3 kernel: Call Trace:
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149831c>] ? schedule+0x2c/0x80
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff814985b5>] ?
schedule_preempt_disabled+0x5/0x10
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a60c>] ?
__mutex_lock_slowpath+0x10c/0x300
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff81429f63>] ?
inetdev_event+0x23/0x4b0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a809>] ? mutex_lock+0x9/0x20
Aug 09 08:58:46 vc1n3 kernel: [<ffffffffa0828e34>] ?
team_device_event+0xe4/0x110 [team]
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8108cc00>] ?
notifier_call_chain+0x40/0x60
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813c1ef1>] ?
netdev_update_features+0x21/0x30
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813c1c72>] ?
__netdev_update_features+0x2e2/0x540
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813bcda7>] ? netdev_info+0x57/0x60
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813c204d>] ?
netdev_change_features+0xd/0x30
Aug 09 08:58:46 vc1n3 kernel: [<ffffffffa0828cde>] ?
team_port_del+0x1ae/0x1e0 [team]
Aug 09 08:58:46 vc1n3 kernel: [<ffffffffa0828d38>] ?
team_del_slave+0x28/0x40 [team]
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813d0650>] ? do_setlink+0x320/0xb60
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149a3bb>] ?
__mutex_unlock_slowpath+0xbb/0x160
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff811be3bd>] ?
ep_scan_ready_list+0x1ad/0x1c0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff811be3d0>] ?
ep_scan_ready_list+0x1c0/0x1c0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff811bdfc2>] ?
ep_call_nested.constprop.10+0xe2/0x100
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813d15a1>] ?
rtnl_newlink+0x621/0x870
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813d1084>] ?
rtnl_newlink+0x104/0x870
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813d1c01>] ?
rtnl_getlink+0x121/0x1a0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813cfaa8>] ?
rtnetlink_rcv_msg+0x88/0x210
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff81177b7b>] ?
__kmalloc_node_track_caller+0x2b/0x210
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813cfa20>] ?
rtnetlink_rcv+0x30/0x30
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813ed82c>] ?
netlink_rcv_skb+0x9c/0xc0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813cfa0f>] ?
rtnetlink_rcv+0x1f/0x30
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813ed17a>] ?
netlink_unicast+0x14a/0x1e0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813ed609>] ?
netlink_sendmsg+0x349/0x3a0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a3b5b>] ? sock_sendmsg+0x2b/0x40
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a43dd>] ?
___sys_sendmsg+0x24d/0x260
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff811978d4>] ?
generic_update_time+0x74/0xc0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff81198f69>] ? touch_atime+0x99/0xa0
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff811871fa>] ? pipe_read+0x2ca/0x300
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8117e4fa>] ? __vfs_read+0xba/0x110
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff813a519c>] ?
__sys_sendmsg+0x3c/0x70
Aug 09 08:58:46 vc1n3 kernel: [<ffffffff8149c576>] ?
entry_SYSCALL_64_fastpath+0x1e/0xa8
Aug 09 08:59:28 vc1n3 systemd[1]: teamd(a)team0.service: State
'stop-final-sigterm' timed out. Killing.
Aug 09 09:00:01 vc1n3 systemd[1]: Started Session c164 of user root.
Aug 09 09:00:01 vc1n3 CROND[11542]: (root) CMD (exec /usr/lib/sa/sa1 600 6)
Aug 09 09:00:01 vc1n3 systemd[1]: Starting Session c164 of user root.
Aug 09 09:00:01 vc1n3 CROND[11543]: (root) CMD
(/usr/bin/update_axeos_certificates --quiet)
Aug 09 09:00:01 vc1n3 systemd[1]: Started Session c165 of user root.
Aug 09 09:00:01 vc1n3 systemd[1]: Starting Session c165 of user root.
Aug 09 09:00:46 vc1n3 kernel: INFO: task snmpd:2217 blocked for more than
120 seconds.
Aug 09 09:00:46 vc1n3 kernel: Tainted: G O 4.7.0-aos0.1
#1
Aug 09 09:00:46 vc1n3 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 09 09:00:46 vc1n3 kernel: snmpd D ffff88107fc96800 0
2217 1 0x00000000
Aug 09 09:00:46 vc1n3 kernel: ffff88085b0c2580 0000000000000020
ffff881052e62480 ffff88085ac5c000
Aug 09 09:00:46 vc1n3 kernel: ffff88085ac5bdc0 ffffffff81889624
ffff88085b0c2580 0000000000000246
Aug 09 09:00:46 vc1n3 kernel: 00000000ffffffff ffffffff8149831c
ffffffff81889620 ffffffff814985b5
Aug 09 09:00:46 vc1n3 kernel: Call Trace:
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149831c>] ? schedule+0x2c/0x80
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff814985b5>] ?
schedule_preempt_disabled+0x5/0x10
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a60c>] ?
__mutex_lock_slowpath+0x10c/0x300
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a809>] ? mutex_lock+0x9/0x20
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813d6697>] ? dev_ioctl+0x2e7/0x570
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff811971f4>] ? clear_inode+0x14/0x80
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff810e848a>] ?
cgroup_sk_alloc+0x2a/0x40
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a20e6>] ?
sock_do_ioctl+0x36/0x40
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a253e>] ? sock_ioctl+0x19e/0x240
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8118fe94>] ?
do_vfs_ioctl+0x84/0x5a0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff81181076>] ? alloc_file+0x16/0xa0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a1fc5>] ?
sock_alloc_file+0x95/0x110
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff81190421>] ? SyS_ioctl+0x71/0x80
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a4653>] ? SyS_socket+0x63/0xa0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149c576>] ?
entry_SYSCALL_64_fastpath+0x1e/0xa8
Aug 09 09:00:46 vc1n3 kernel: INFO: task kworker/3:0:11165 blocked for more
than 120 seconds.
Aug 09 09:00:46 vc1n3 kernel: Tainted: G O 4.7.0-aos0.1
#1
Aug 09 09:00:46 vc1n3 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 09 09:00:46 vc1n3 kernel: kworker/3:0 D ffff88085fd96800 0
11165 2 0x00000000
Aug 09 09:00:46 vc1n3 kernel: Workqueue: ipv6_addrconf addrconf_verify_work
Aug 09 09:00:46 vc1n3 kernel: ffff88085ade8000 0000000000000000
0000000000000000 ffff8808557d4000
Aug 09 09:00:46 vc1n3 kernel: ffff8808557d3e18 ffffffff81889624
ffff88085ade8000 0000000000000246
Aug 09 09:00:46 vc1n3 kernel: 00000000ffffffff ffffffff8149831c
ffffffff81889620 ffffffff814985b5
Aug 09 09:00:46 vc1n3 kernel: Call Trace:
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149831c>] ? schedule+0x2c/0x80
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff814985b5>] ?
schedule_preempt_disabled+0x5/0x10
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a60c>] ?
__mutex_lock_slowpath+0x10c/0x300
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff810286f4>] ?
__switch_to+0x244/0x570
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a809>] ? mutex_lock+0x9/0x20
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8145e9f5>] ?
addrconf_verify_work+0x5/0x10
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff81086de5>] ?
process_one_work+0x145/0x3d0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff810870d0>] ?
worker_thread+0x60/0x4a0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff81497e67>] ? __schedule+0x1e7/0x670
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff81087070>] ?
process_one_work+0x3d0/0x3d0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8108beea>] ? kthread+0xca/0xe0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149c79f>] ?
ret_from_fork+0x1f/0x40
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8108be20>] ?
kthread_create_on_node+0x180/0x180
Aug 09 09:00:46 vc1n3 kernel: INFO: task teamd:11385 blocked for more than
120 seconds.
Aug 09 09:00:46 vc1n3 kernel: Tainted: G O 4.7.0-aos0.1
#1
Aug 09 09:00:46 vc1n3 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 09 09:00:46 vc1n3 kernel: teamd D ffff88107fd16800 0
11385 1 0x00000000
Aug 09 09:00:46 vc1n3 kernel: ffff88105aed3e80 ffff88085b8a1600
00000000000000db ffff8810545f4000
Aug 09 09:00:46 vc1n3 kernel: ffff8810545f3c78 ffffffff81889624
ffff88105aed3e80 0000000000000246
Aug 09 09:00:46 vc1n3 kernel: 00000000ffffffff ffffffff8149831c
ffffffff81889620 ffffffff814985b5
Aug 09 09:00:46 vc1n3 kernel: Call Trace:
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149831c>] ? schedule+0x2c/0x80
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff814985b5>] ?
schedule_preempt_disabled+0x5/0x10
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a60c>] ?
__mutex_lock_slowpath+0x10c/0x300
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff81190dc0>] ?
poll_select_copy_remaining+0x120/0x120
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a809>] ? mutex_lock+0x9/0x20
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813cfa00>] ?
rtnetlink_rcv+0x10/0x30
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813ed17a>] ?
netlink_unicast+0x14a/0x1e0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813ed609>] ?
netlink_sendmsg+0x349/0x3a0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a3b5b>] ? sock_sendmsg+0x2b/0x40
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a43dd>] ?
___sys_sendmsg+0x24d/0x260
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a4538>] ?
___sys_recvmsg+0x148/0x1c0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff811bdc34>] ?
ep_send_events_proc+0x74/0x170
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a3bb>] ?
__mutex_unlock_slowpath+0xbb/0x160
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff811be517>] ? ep_poll+0x127/0x350
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a519c>] ?
__sys_sendmsg+0x3c/0x70
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149c576>] ?
entry_SYSCALL_64_fastpath+0x1e/0xa8
Aug 09 09:00:46 vc1n3 kernel: INFO: task teamd:11386 blocked for more than
120 seconds.
Aug 09 09:00:46 vc1n3 kernel: Tainted: G O 4.7.0-aos0.1
#1
Aug 09 09:00:46 vc1n3 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 09 09:00:46 vc1n3 kernel: teamd D ffff88107fc96800 0
11386 1 0x00000000
Aug 09 09:00:46 vc1n3 kernel: ffff88105aed7080 ffff88085abf1f00
00000000000000db ffff881057ff4000
Aug 09 09:00:46 vc1n3 kernel: ffff881057ff3c78 ffffffff81889624
ffff88105aed7080 0000000000000246
Aug 09 09:00:46 vc1n3 kernel: 00000000ffffffff ffffffff8149831c
ffffffff81889620 ffffffff814985b5
Aug 09 09:00:46 vc1n3 kernel: Call Trace:
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149831c>] ? schedule+0x2c/0x80
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff814985b5>] ?
schedule_preempt_disabled+0x5/0x10
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a60c>] ?
__mutex_lock_slowpath+0x10c/0x300
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff81190dc0>] ?
poll_select_copy_remaining+0x120/0x120
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a809>] ? mutex_lock+0x9/0x20
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813cfa00>] ?
rtnetlink_rcv+0x10/0x30
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813ed17a>] ?
netlink_unicast+0x14a/0x1e0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813ed609>] ?
netlink_sendmsg+0x349/0x3a0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a3b5b>] ? sock_sendmsg+0x2b/0x40
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a43dd>] ?
___sys_sendmsg+0x24d/0x260
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a4538>] ?
___sys_recvmsg+0x148/0x1c0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff811bdc34>] ?
ep_send_events_proc+0x74/0x170
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a3bb>] ?
__mutex_unlock_slowpath+0xbb/0x160
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8109b627>] ? update_curr+0xa7/0x130
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff810a2e72>] ?
pick_next_task_fair+0x2c2/0x490
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a519c>] ?
__sys_sendmsg+0x3c/0x70
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149c576>] ?
entry_SYSCALL_64_fastpath+0x1e/0xa8
Aug 09 09:00:46 vc1n3 kernel: INFO: task teamd:11418 blocked for more than
120 seconds.
Aug 09 09:00:46 vc1n3 kernel: Tainted: G O 4.7.0-aos0.1
#1
Aug 09 09:00:46 vc1n3 kernel: "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 09 09:00:46 vc1n3 kernel: teamd D ffff88107fc16800 0
11418 1 0x00000004
Aug 09 09:00:46 vc1n3 kernel: ffff88105af10000 0000000000000001
0000000000000000 ffff881057fe4000
Aug 09 09:00:46 vc1n3 kernel: ffff881057fe3750 ffff88085560f814
ffff88105af10000 0000000000000246
Aug 09 09:00:46 vc1n3 kernel: 00000000ffffffff ffffffff8149831c
ffff88085560f810 ffffffff814985b5
Aug 09 09:00:46 vc1n3 kernel: Call Trace:
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149831c>] ? schedule+0x2c/0x80
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff814985b5>] ?
schedule_preempt_disabled+0x5/0x10
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a60c>] ?
__mutex_lock_slowpath+0x10c/0x300
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff81429f63>] ?
inetdev_event+0x23/0x4b0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a809>] ? mutex_lock+0x9/0x20
Aug 09 09:00:46 vc1n3 kernel: [<ffffffffa0828e34>] ?
team_device_event+0xe4/0x110 [team]
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8108cc00>] ?
notifier_call_chain+0x40/0x60
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813c1ef1>] ?
netdev_update_features+0x21/0x30
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813c1c72>] ?
__netdev_update_features+0x2e2/0x540
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813bcda7>] ? netdev_info+0x57/0x60
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813c204d>] ?
netdev_change_features+0xd/0x30
Aug 09 09:00:46 vc1n3 kernel: [<ffffffffa0828cde>] ?
team_port_del+0x1ae/0x1e0 [team]
Aug 09 09:00:46 vc1n3 kernel: [<ffffffffa0828d38>] ?
team_del_slave+0x28/0x40 [team]
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813d0650>] ? do_setlink+0x320/0xb60
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149a3bb>] ?
__mutex_unlock_slowpath+0xbb/0x160
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff811be3bd>] ?
ep_scan_ready_list+0x1ad/0x1c0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff811be3d0>] ?
ep_scan_ready_list+0x1c0/0x1c0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff811bdfc2>] ?
ep_call_nested.constprop.10+0xe2/0x100
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813d15a1>] ?
rtnl_newlink+0x621/0x870
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813d1084>] ?
rtnl_newlink+0x104/0x870
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813d1c01>] ?
rtnl_getlink+0x121/0x1a0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813cfaa8>] ?
rtnetlink_rcv_msg+0x88/0x210
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff81177b7b>] ?
__kmalloc_node_track_caller+0x2b/0x210
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813cfa20>] ?
rtnetlink_rcv+0x30/0x30
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813ed82c>] ?
netlink_rcv_skb+0x9c/0xc0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813cfa0f>] ?
rtnetlink_rcv+0x1f/0x30
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813ed17a>] ?
netlink_unicast+0x14a/0x1e0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813ed609>] ?
netlink_sendmsg+0x349/0x3a0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a3b5b>] ? sock_sendmsg+0x2b/0x40
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a43dd>] ?
___sys_sendmsg+0x24d/0x260
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff811978d4>] ?
generic_update_time+0x74/0xc0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff81198f69>] ? touch_atime+0x99/0xa0
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff811871fa>] ? pipe_read+0x2ca/0x300
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8117e4fa>] ? __vfs_read+0xba/0x110
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff813a519c>] ?
__sys_sendmsg+0x3c/0x70
Aug 09 09:00:46 vc1n3 kernel: [<ffffffff8149c576>] ?
entry_SYSCALL_64_fastpath+0x1e/0xa8
Aug 09 09:00:58 vc1n3 systemd[1]: teamd(a)team0.service: Processes still
around after final SIGKILL. Entering failed mode.
Aug 09 09:00:58 vc1n3 systemd[1]: Stopped Team Daemon for device team0.
Aug 09 09:00:58 vc1n3 systemd[1]: teamd(a)team0.service: Unit entered failed
state.
Aug 09 09:00:58 vc1n3 systemd[1]: teamd(a)team0.service: Failed with result
'timeout'.
Configs:
[root@vc1n3 interfaces]# cat team-sw1
{
"device": "sw1",
"hwaddr": "00:0a:f7:06:1e:68",
"runner": {
"name": "lacp",
"active": true,
"fast_rate": true,
"tx_hash": ["eth", "ipv4", "ipv6"]
},
"link_watch": {"name": "ethtool"},
"ports": {"enp17s0f0": {}, "eno2": {}, "eno3": {}}
}
[root@vc1n3 interfaces]# cat team-sw2
{
"device": "sw2",
"hwaddr": "00:0a:f7:06:1e:6a",
"runner": {
"name": "lacp",
"active": true,
"fast_rate": true,
"tx_hash": ["eth", "ipv4", "ipv6"]
},
"link_watch": {"name": "ethtool"},
"ports": {"enp17s0f1": {}, "eno4": {}, "eno5": {}}
}
[root@vc1n3 interfaces]# cat team-team0
{
"device": "team0",
"runner": {"name": "activebackup"},
"link_watch": {
"name": "ethtool"
},
"ports": {
"sw1": {
"prio": 100
},
"sw2": {
"prio": 50
}
}
}
7 years, 8 months
lacp port priorities are reversed
by Mariusz Mazur
Man for teamd.conf states:
lacp_prio — Aggregator with highest priority according to
LACP
standard will be selected. Aggregator priority is affected
by
per-port option lacp_prio.
ports.PORTIFNAME.lacp_prio (int)
Port priority according to LACP standard. The lower number
means
higher priority.
This is actually the opposite of what's happening. The ports with the
highest lacp_prio number, not the lowest, are what gets to become master.
Here's my current config:
{
"device": "team0",
"hwaddr": "00:0a:f7:06:1e:68",
"runner": {
"name": "lacp",
"active": true,
"fast_rate": true,
"tx_hash": ["eth", "ipv4", "ipv6"]
},
"link_watch": {"name": "ethtool"},
"agg_select_policy": "lacp_prio",
"ports": {
"enp17s0f0": {"lacp_prio": 1, "lacp_key": 1},
"eno2": {"lacp_prio": 1, "lacp_key": 1},
"eno3": {"lacp_prio": 1, "lacp_key": 1},
"enp17s0f1": {"lacp_prio": 200, "lacp_key": 2},
"eno4": {"lacp_prio": 200, "lacp_key": 2},
"eno5": {"lacp_prio": 200, "lacp_key": 2}
}
}
And the ports with lacp_prio: 200 are the master:
[root@vc1n3 interfaces]# teamdctl team0 state|egrep '^ en|aggre'
eno2
aggregator ID: 2
eno3
aggregator ID: 2
eno4
aggregator ID: 3, Selected
eno5
aggregator ID: 3, Selected
enp17s0f0
aggregator ID: 2
enp17s0f1
aggregator ID: 3, Selected
And yes, if I shut down the lacp_key: 2 ports and then reenable them later,
it switches back to them as the Selected ones.
7 years, 8 months
[jpirko/libteam] 4a9e1f: teamd: fix the issue that network blocks
when syst...
by GitHub
Branch: refs/heads/master
Home: https://github.com/jpirko/libteam
Commit: 4a9e1fac5d69e6abae0451c579b02f16d960e694
https://github.com/jpirko/libteam/commit/4a9e1fac5d69e6abae0451c579b02f16...
Author: Xin Long <lucien.xin(a)gmail.com>
Date: 2016-08-14 (Sun, 14 Aug 2016)
Changed paths:
M teamd/redhat/initscripts_systemd/network-scripts/ifdown-Team
Log Message:
-----------
teamd: fix the issue that network blocks when systemctl stop teamd
Commit 0641375d10d6 ("teamd: change to Before=network-pre.target in
systemd service file") tried to make teamd be stopped after network
service is stopped when systemd shutdown.
But network service also kills teamd service with "systemctl stop" in
ifdown-Team. It means network service try to kill one service that is
dependent on network itself, which leads to a block.
This patch is to use "systemctl stop --ignore-dependencies" to avoid
this block without any side effects. Already verified it on the env
where we can reproduce this issue.
Fixes: 0641375d10d6 ("teamd: change to Before=network-pre.target in systemd service file")
Signed-off-by: Xin Long <lucien.xin(a)gmail.com>
Signed-off-by: Jiri Pirko <jiri(a)mellanox.com>
7 years, 8 months
[PATCH] teamd: fix the issue that network blocks when systemctl stop teamd
by Xin Long
Commit 0641375d10d6 ("teamd: change to Before=network-pre.target in
systemd service file") tried to make teamd be stopped after network
service is stopped when systemd shutdown.
But network service also kills teamd service with "systemctl stop" in
ifdown-Team. It means network service try to kill one service that is
dependent on network itself, which leads to a block.
This patch is to use "systemctl stop --ignore-dependencies" to avoid
this block without any side effects. Already verified it on the env
where we can reproduce this issue.
Fixes: 0641375d10d6 ("teamd: change to Before=network-pre.target in systemd service file")
Signed-off-by: Xin Long <lucien.xin(a)gmail.com>
---
teamd/redhat/initscripts_systemd/network-scripts/ifdown-Team | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/teamd/redhat/initscripts_systemd/network-scripts/ifdown-Team b/teamd/redhat/initscripts_systemd/network-scripts/ifdown-Team
index 9404e05..73eccee 100755
--- a/teamd/redhat/initscripts_systemd/network-scripts/ifdown-Team
+++ b/teamd/redhat/initscripts_systemd/network-scripts/ifdown-Team
@@ -45,5 +45,5 @@ if [ -n "${TEAM_CONFIG}" ]; then
is_ignored_file "$device" && continue
/sbin/ifdown ${device##*/}
done
- /usr/bin/systemctl stop teamd(a)${DEVICE}.service || exit 1
+ /usr/bin/systemctl stop teamd(a)${DEVICE}.service --ignore-dependencies || exit 1
fi
--
2.1.0
7 years, 8 months
hwaddr_policy: same_all does not set MACs properly
by Mariusz Mazur
I'm running kernel 4.7.0 and teamd 1.25. The following config should set
all of the configured links to the same MAC address. It doesn't:
[root@vc1n3 ~]# teamdctl sw2 config dump
{
"device": "sw2",
"link_watch": {
"delay_up": 5000,
"name": "ethtool"
},
"ports": {
"eno4": {},
"eno5": {},
"enp17s0f1": {}
},
"runner": {
"active": true,
"fast_rate": true,
"name": "lacp",
"tx_hash": [
"eth",
"ipv4",
"ipv6"
]
}
}
And the links are:
3: enp17s0f1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master
sw2 state UP mode DEFAULT group default qlen 1000
link/ether 00:0a:f7:06:1e:6a brd ff:ff:ff:ff:ff:ff
6: eno4: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master sw2
state UP mode DEFAULT group default qlen 1000
link/ether 00:0a:f7:06:1e:6a brd ff:ff:ff:ff:ff:ff
7: eno5: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq master sw2
state UP mode DEFAULT group default qlen 1000
link/ether f2:8a:c5:1a:67:47 brd ff:ff:ff:ff:ff:ff
As you can see, eno5 was changed (from the NIC's default
'98:be:94:45:31:f5'), but to something seemingly random. Interestingly
enough if I explicitly add "hwaddr": "00:0a:f7:06:1e:6a" to the config,
all three NICs are set up properly and I have no issues.
But as it stands, the default behavior is buggy for me.
7 years, 8 months