[Bug 217563] New: HAL/DBUS doesn't always start properly
https://bugzilla.novell.com/show_bug.cgi?id=217563 Summary: HAL/DBUS doesn't always start properly Product: openSUSE 10.2 Version: Beta 1 Platform: Other OS/Version: Other Status: NEW Severity: Normal Priority: P5 - None Component: Basesystem AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: mboman@novell.com QAContact: qa@suse.de Sometimes when booting up the machine, I've noticed that NetworkManager can't find any network cards. A restart normally solves this. I found out that HAL/DBUS doesn't always start properly. I'm attaching boot.msg and message incase that helps. mblxws01:/home/mboman/tmp # ps auxw|grep -i hal 101 2983 0.0 0.0 2028 880 ? S 15:37 0:00 hald-addon-keyboard: listening on /dev/input/event1 root 4038 0.0 0.0 2860 752 pts/0 R+ 15:41 0:00 grep -i hal mblxws01:/home/mboman/tmp # ps auxw|grep -i dbus 100 2505 0.0 0.0 3552 1008 ? Ss 15:37 0:00 /usr/bin/dbus-daemon --system mboman 3721 0.0 0.0 3772 836 ? Ss 15:40 0:00 /usr/bin/dbus-daemon --fork --print-pid 4 --print-address 6 --session root 4064 0.0 0.0 2856 744 pts/0 R+ 15:42 0:00 grep -i dbus -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #1 from mboman@novell.com 2006-11-02 14:36 MST ------- Created an attachment (id=103606) --> (https://bugzilla.novell.com/attachment.cgi?id=103606&action=view) boot.msg -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #2 from mboman@novell.com 2006-11-02 14:36 MST ------- Created an attachment (id=103607) --> (https://bugzilla.novell.com/attachment.cgi?id=103607&action=view) messages -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 dkukawka@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |NEEDINFO Info Provider| |mboman@novell.com ------- Comment #4 from dkukawka@novell.com 2006-11-04 07:25 MST ------- please change in /etc/init.d/haldaemon this line: HALDAEMON_PARA="--daemon=yes --retain-privileges"; to: HALDAEMON_PARA="--daemon=yes --retain-privileges --verbose=yes --use-syslog"; and attach the part of /var/log/messages since boot if this happen again. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 thoenig@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Summary|HAL/DBUS doesn't always |HAL doesn't always start properly |start properly | Version|Beta 1 |Beta 1 plus ------- Comment #7 from thoenig@novell.com 2006-11-05 11:38 MST ------- I ran into this as well, seems to be a race as it can not be reproduced reliably. However, D-Bus was always working fine, just HAL did not run. Adjusting summary. -> Beta1 Plus -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #8 from thoenig@novell.com 2006-11-05 11:46 MST ------- Just for the log: The change as proposed by Danny (comment #4) makes it impossible to reproduce the problem at any time for me (HAL always gets started properly). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 andreas.hanke@gmx-topmail.de changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |andreas.hanke@gmx-topmail.de ------- Comment #9 from andreas.hanke@gmx-topmail.de 2006-11-05 20:57 MST ------- (In reply to comment #8)
The change as proposed by Danny (comment #4) makes it impossible to reproduce the problem at any time for me (HAL always gets started properly).
This sounds very familiar, it's the same in bug 218184: hald doesn't start properly, but as soon as the debug parameters are added, it does. Adding myself to CC (for a reason, please don't remove me again, thanks). -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 thoenig@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |balta@o2online.de ------- Comment #10 from thoenig@novell.com 2006-11-06 01:07 MST ------- *** Bug 218184 has been marked as a duplicate of this bug. *** -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 peter@suntel.com.tr changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |peter@suntel.com.tr ------- Comment #11 from peter@suntel.com.tr 2006-11-08 09:49 MST ------- I have been seeing this bug also for approximately the last month. I run the latest Factory updated on a daily basis with smart. I see the problem on about 20% of boots, however it is MUCH more likely to happen if I have just done a "smart upgrade" -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #12 from andreas.hanke@gmx-topmail.de 2006-11-10 12:22 MST ------- Created an attachment (id=104740) --> (https://bugzilla.novell.com/attachment.cgi?id=104740&action=view) bootchart graph of a boot process where hald disappeared -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #13 from andreas.hanke@gmx-topmail.de 2006-11-10 12:25 MST ------- Created an attachment (id=104741) --> (https://bugzilla.novell.com/attachment.cgi?id=104741&action=view) bootchart graph of a boot process where hald survived -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #14 from thoenig@novell.com 2006-11-11 11:35 MST ------- Andreas, thanks a lot for the graphs -- that's a great idea to narrow down the cause of this bug. Did anyone run into this with Beta2? So far, I did not run into this issue on my systems running Beta2. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #15 from peter@suntel.com.tr 2006-11-11 12:13 MST ------- I am still seeing this problem with latest Factory (Is it in sync with Beta2)? # date Sat Nov 11 21:05:42 EET 2006 # smart update;smart upgrade -y Loading cache... Updating cache... ################################################################### [100%] Fetching information for 'SUSE Factory'... -> ftp://mirrors.kernel.org/opensuse/distribution/SL-OSS-factory/inst-source/media.1/media media ################################################################### [ 100%] Updating cache... ################################################################### [100%] Channels have no new packages. Saving cache... Loading cache... Updating cache... ################################################################### [100%] Computing transaction... No interesting upgrades available. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #16 from thoenig@novell.com 2006-11-11 12:20 MST ------- Peter, can you please try whether HAL survives if you delay the start? You can test that by replacing startproc -p $HALDAEMON_PID $HALDAEMON_BIN $HALDAEMON_PARA with sleep 5 && tartproc -p $HALDAEMON_PID $HALDAEMON_BIN $HALDAEMON_PARA in '/etc/init.d/haldaemon'. Thanks! -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #17 from thoenig@novell.com 2006-11-11 12:21 MST ------- (In reply to comment #16)
sleep 5 && tartproc -p $HALDAEMON_PID $HALDAEMON_BIN $HALDAEMON_PARA
Of course, this should read sleep 5 && startproc -p $HALDAEMON_PID $HALDAEMON_BIN $HALDAEMON_PARA -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #18 from andreas.hanke@gmx-topmail.de 2006-11-11 13:30 MST ------- Knowing that the desired way to debug hald is --daemon=yes --verbose=yes --use-syslog, I have ignored this because it makes the problem irreproducible. Instead I have changed the startproc invocation to be as follows: HALDAEMON_PARA="--daemon=no" startproc -l /tmp/hal_output.txt -p $HALDAEMON_PID $HALDAEMON_BIN $HALDAEMON_PARA You can find my /tmp/hal_output.txt attached. Maybe it's at least a bit useful. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #19 from andreas.hanke@gmx-topmail.de 2006-11-11 13:31 MST ------- Created an attachment (id=104803) --> (https://bugzilla.novell.com/attachment.cgi?id=104803&action=view) hal_output.txt -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #20 from andreas.hanke@gmx-topmail.de 2006-11-11 13:40 MST ------- ** ERROR **: file blockdev.c: line 835 (hotplug_event_begin_add_blockdev): assertion failed: (d_it != NULL) aborting... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #21 from peter@suntel.com.tr 2006-11-11 14:02 MST ------- I also see this error in /var/log/messages when it doesnt work. I have made the change requested in Comment #16 As the problem is difficult to reproduce reliably, I can't tell if it made any difference. I will report it it reoccurs.. (Note. The problem most reliably occurs on the first and second reboot after a "smart upgrade".. Maybe something starts up a bit slower the first few times after it has been upgraded???) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #22 from peter@suntel.com.tr 2006-11-11 14:07 MST ------- Just a quick note: In my opinion the Severity of this bug should be upgraded. It causes me major annoyance, but would send a non-expert linux user running for another platform if it affects them... As it is I can't figure out a reliable way to stop it happening or to reproduce it... When it happens, its possible to reboot 3 or 4 times without fixing it at which point I usually revert to a manual "ifconfig up" on an ethernet cable. One or 2 reboots later it usually fixes itself and I return to using wifi as normal... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #23 from andreas.hanke@gmx-topmail.de 2006-11-11 14:15 MST ------- Forget about smart, it has absolutely and definitely nothing to do with this and just causes confusion here. Only the engineers should touch the "Severity" field. Be patient, I'm very confident that this report will be handled properly nevertheless. I think now it's time to wait and see whether the information about the failed assertion in file blockdev.c: line 835 goes into the right direction. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #24 from dkukawka@novell.com 2006-11-11 14:28 MST ------- hm ... the g_assert() call is IMO really strange and the only case in the complete code where the complete daemon die because a device could not be found. And somehow the code look not really 'secure/save', because the code only try to get the parent device from the gdl and not from tdl. Could be a littlebit racy. I take a look at this. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #25 from thoenig@novell.com 2006-11-11 14:34 MST ------- Danny, we should really make HAL to issue such warnings using syslog. It would have spared us a lot of time. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #26 from balta@o2online.de 2006-11-11 14:40 MST ------- I reported the bug 218184 (Comment #10) Here it seems that hald isn't crashing anymore since I upgraded to Beta2 with smart... -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #27 from dkukawka@novell.com 2006-11-11 15:02 MST ------- Could you check if this already happen with the package from http://beta.suse.com/private/dkukawka/hal/testpackages/hal-0.5.8_git20061106... ? -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #30 from balta@o2online.de 2006-11-11 15:22 MST ------- Danny, do you mean me? I've installed hal-0.5.8_git20061106-4.x86_64 and it's working now. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #31 from andreas.hanke@gmx-topmail.de 2006-11-11 15:31 MST ------- Marcel, you wrote in comment 26 that it worked for you even with stock Beta2 packages, but not for me. So your information from comment 30 doesn't really apply, sorry. I'm testing hal-0.5.8_git20061106-5.i586.rpm right now on the very same machine where stock Beta2 had the problem. So far it looks good, but I have rebooted only 5 times and would like to test it more. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #32 from balta@o2online.de 2006-11-11 16:10 MST ------- sry for my english... I justed wanted to know if I should try hal-0.5.8_git20061106-5.x86_64.rpm, even if it is working since update to beta2 with hal-0.5.8_git20061106-4.x86_64... I hate english ;-) -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 dkukawka@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEEDINFO |RESOLVED Info Provider|mboman@novell.com | Resolution| |FIXED ------- Comment #33 from dkukawka@novell.com 2006-11-11 16:22 MST ------- mboman also could no longer reproduce the bug. If the bug occours anymore, open the bug. I submitted a new package to STABLE. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #34 from andreas.hanke@gmx-topmail.de 2006-11-11 16:37 MST ------- I have tested the test package hal-0.5.8_git20061106-5.i586.rpm by rebooting the system 30 times after installing it. There was not a single failure. For verification, I downgraded hal to the stock Beta2 package and then it failed again on the first attempt already. So assuming that the new hal submission has the patch from the test package in it, the bug is fixed. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 ------- Comment #35 from peter@suntel.com.tr 2006-11-11 17:37 MST ------- I have also upgraded to your package and after 10 reboots have as yet been unable to reproduce a failure.. Looks good.. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 markus.kriewald@web.de changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |markus.kriewald@web.de ------- Comment #36 from markus.kriewald@web.de 2006-11-16 14:07 MST ------- *** Bug 220912 has been marked as a duplicate of this bug. *** -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
https://bugzilla.novell.com/show_bug.cgi?id=217563 behlert@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- Status|RESOLVED |CLOSED -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.
participants (1)
-
bugzilla_noreply@novell.com