[Bug 743213] New: systemd causes startup to hang if OFED initialization scripts are enabled
https://bugzilla.novell.com/show_bug.cgi?id=743213 https://bugzilla.novell.com/show_bug.cgi?id=743213#c0 Summary: systemd causes startup to hang if OFED initialization scripts are enabled Classification: openSUSE Product: openSUSE 12.1 Version: Final Platform: x86-64 OS/Version: SuSE Other Status: NEW Severity: Normal Priority: P5 - None Component: Basesystem AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: bart.vanassche@gmail.com QAContact: qa@suse.de Found By: --- Blocker: --- Created an attachment (id=472566) --> (http://bugzilla.novell.com/attachment.cgi?id=472566) Contents of /var/log/messages with systemd logging enabled User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:9.0.1) Gecko/20100101 Firefox/9.0.1 I'm using the OFED startup scripts since a long time on openSUSE. These worked fine with openSUSE 11.4 and before but not with openSUSE 12.1 (installed from scratch). If I configure the openibd and opensmd script to run during startup, then startup hangs before switching from console mode to X11 desktop. Reproducible: Always Steps to Reproduce: 1. Install openSUSE 12.1. 2. Download, configure, build and install OFED 1.5.4 (http://www.openfabrics.org/downloads/OFED/ofed-1.5.4/OFED-1.5.4.tgz). 3. Enable the OFED scripts to run during boot: chkconfig -s openibd on chkconfig -s opensmd 235 4. Reboot. Actual Results: Startup never completes. Expected Results: System boots normally and the graphical (KDE) desktop appears after a reasonable time. Can probably only be reproduced on a system with at least one InfiniBand HCA. # head /etc/sysconfig/network/ifcfg-ib* ==> /etc/sysconfig/network/ifcfg-ib0 <== BOOTPROTO='static' BROADCAST='' ETHTOOL_OPTIONS='' IPADDR='192.168.5.1/24' MTU='' NAME='MT26428 [ConnectX VPI PCIe 2.0 5GT/s - IB QDR / 10GigE]' NETWORK='' REMOTE_IPADDR='' STARTMODE='auto' USERCONTROL='no' ==> /etc/sysconfig/network/ifcfg-ib1 <== BOOTPROTO='static' BROADCAST='' ETHTOOL_OPTIONS='' IPADDR='192.168.6.1/24' MTU='' NAME='MT26428 [ConnectX VPI PCIe 2.0 5GT/s - IB QDR / 10GigE]' NETWORK='' REMOTE_IPADDR='' STARTMODE='auto' USERCONTROL='no' -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c1
--- Comment #1 from Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c2
--- Comment #2 from Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c3
--- Comment #3 from Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c
zj jia
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c4
--- Comment #4 from Frederic Crozat
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c5
--- Comment #5 from Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c6
--- Comment #6 from Frederic Crozat
From a quick look at the "initscript" shipped by OFED, it seems completely broken and doesn't integrate properly with openSUSE initscript (no source of /etc/rc.status, which will break systemd integration when script is manually called).
It is not clear if this script is supposed to start a daemon or not. Try adding in the LSB header (it requires systemd from 12.1 maintenance update to work properly): # X-Systemd-RemainAfterExit: true -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c7
Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c8
--- Comment #8 from Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c9
--- Comment #9 from Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c10
--- Comment #10 from Bart Van Assche
It is not clear if this script is supposed to start a daemon or not.
As far as I know the openibd startup script does not start any daemons. What it does is to load several kernel modules, to adjust some system settings and to bring up the IPoIB network interfaces. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c11
--- Comment #11 from Frederic Crozat
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c12
--- Comment #12 from Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c13
--- Comment #13 from Frederic Crozat
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c14
--- Comment #14 from Bart Van Assche
from the log, it looks like modprobe mlx4_core is blocking in openibd core, so it doesn't seem systemd related, it is just that systemd highlight the issue.
Are you sure ? From /var/log/messages: Feb 9 16:42:58 asus openibd[875]: + /sbin/modprobe mlx4_ib Feb 9 16:42:58 asus openibd[875]: + /sbin/modprobe mlx4_en Feb 9 16:42:59 asus kernel[875]: + /sbin/modprobe mlx4_core So before the openibd startup script issues the shell command "/sbin/modprobe mlx4_core", the modules mlx4_ib and mlx4_en have already been loaded successfully. Since these last two modules depend on mlx4_core, loading either module also loads the mlx4_core module. So how is it possible that the "/sbin/modprobe mlx4_core" command hangs at a time that module has already been loaded ? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c15
--- Comment #15 from Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c16
--- Comment #16 from Frederic Crozat
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c17
Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c18
Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c
Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c19
--- Comment #19 from Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c20
--- Comment #20 from Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c
Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c21
--- Comment #21 from Frederic Crozat
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c22
--- Comment #22 from Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c23
Bart Van Assche
https://bugzilla.novell.com/show_bug.cgi?id=743213
https://bugzilla.novell.com/show_bug.cgi?id=743213#c24
Frederic Crozat
participants (1)
-
bugzilla_noreply@novell.com