-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 12/14/2010 09:51 AM, Jeff Mahoney wrote:
On 12/14/2010 07:11 AM, Kay Sievers wrote:
On Tue, 2010-12-14 at 11:59 +0100, Peter Czanik wrote:
On 12/14/2010 11:42 AM, Kay Sievers wrote:
Without systemd the same system easily survives stress testing (cd /usr/src/linux && make -j 100 :-) ) without locking up.
Yeah, isolated computational load is a very different situation from booting up -- where we have heavy parallel kernel module load, device initialization, and service startup going on.
If possible, try, if removing: quiet and adding: systemd.log_level=debug systemd.log_target=kmsg to the kernel commandline reveals something on the console.
It might slow down the bootup enough, so that it works-- or it might show where it hangs.
Hehe. Once I use the above settings, systemd seems to boot perfectly, at least for the last two boots. Without it the machine hangs with or without a kernel panic message on screen during the boot or right after the login: prompt is printed.
Yeah, a few people have seen this. It's likely a bug in the kernel in combination with some specific hardware.
Seems, the massive parallel work uncovers some races here, which we didn't trigger with the old bootup logic.
Is there any output on the console when the kernel panic happens? Can you take a picture with a camera of the screen? Would be good to find out which kernel module makes the machine crash.
Might also worth trying to add: systemd.unit=multi-user.target which will be "runlevel 3", and check if that already crashes.
I've seen a panic a bunch of times during startup and suspected it was related to the per-tty auto task groups patch. Mike removed that patch and replaced it with the per-session task groups patch which also fixes several bugs in the original patch. All of my crashes were related to cgroups or scheduling and they seem to have been fixed with the latest -rc5-based factory kernel.
Seems I spoke too soon. Here are the same Oopses I was seeing before. The following occured during boot. [ 2.212682] BUG: unable to handle kernel paging request at 000000040005047a [ 2.213066] IP: [<ffffffff8125df63>] rb_next+0x23/0x50 [ 2.213066] PGD 0 [ 2.213066] Oops: 0000 [#1] PREEMPT SMP [ 2.213066] last sysfs file: /sys/module/ipv6/parameters/disable [ 2.213066] CPU 0 [ 2.213066] Modules linked in: autofs4 ext4 jbd2 crc16 dm_snapshot dm_mod fan processor pata_acpi thermal thermal_sys [ 2.213066] [ 2.213066] Pid: 282, comm: systemd-cgroups Not tainted 2.6.37-rc5-desktop #13 /Bochs [ 2.213066] RIP: 0010:[<ffffffff8125df63>] [<ffffffff8125df63>] rb_next+0x23/0x50 [ 2.213066] RSP: 0018:ffff88003bc5fda0 EFLAGS: 00010006 [ 2.213066] RAX: 000000040005046a RBX: ffff880037f2ba00 RCX: 0000007000000000 [ 2.213066] RDX: 0000007000000000 RSI: ffff880037f2ba00 RDI: ffff880037f2ba10 [ 2.213066] RBP: ffff88003bc5fdd8 R08: 0000000000000001 R09: 0000000000000000 [ 2.213066] R10: ffff880037949158 R11: 0000000000000001 R12: ffff88003bcb0900 [ 2.213066] R13: ffff880037f2ba10 R14: 0000000000000000 R15: ffff88003fc12640 [ 2.213066] FS: 00007f489101d7a0(0000) GS:ffff88003fc00000(0000) knlGS:0000000000000000 [ 2.213066] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [ 2.213066] CR2: 000000040005047a CR3: 0000000001a03000 CR4: 00000000000006f0 [ 2.213066] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 2.213066] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [ 2.213066] Process systemd-cgroups (pid: 282, threadinfo ffff88003bc5e000, task ffff8800379fc6c0) [ 2.213066] Stack: [ 2.213066] ffffffff8104dc78 ffff88003fc12640 ffff88003fc12640 ffff8800379fcaa8 [ 2.213066] 0000000000000000 ffffffff8160b120 ffff8800379fc9b8 ffff88003bc5fea8 [ 2.213066] ffffffff81511c4f ffff880037e98bc0 0000000000000092 0000000300121a65 [ 2.213066] Call Trace: [ 2.213066] [<ffffffff8104dc78>] pick_next_task_fair+0x178/0x180 [ 2.213066] [<ffffffff81511c4f>] thread_return+0x430/0x6a1 [ 2.213066] [<ffffffff8105af0e>] do_exit+0x5fe/0x8d0 [ 2.213066] [<ffffffff8105b471>] do_group_exit+0x51/0xc0 [ 2.213066] [<ffffffff8105b4f2>] sys_exit_group+0x12/0x20 [ 2.213066] [<ffffffff81002f4b>] system_call_fastpath+0x16/0x1b [ 2.213066] [<00007f48902c9b28>] 0x7f48902c9b28 [ 2.213066] Code: 85 d2 75 f4 f3 c3 f3 c3 48 8b 17 31 c0 48 89 d1 48 83 e1 fc 48 39 cf 74 37 48 8b 47 08 48 85 c0 75 09 eb 1a 0f 1f 40 00 48 89 d0 <48> 8b 50 10 48 85 d2 75 f4 f3 c3 66 90 48 8b 11 48 89 cf 48 89 [ 2.213066] RIP [<ffffffff8125df63>] rb_next+0x23/0x50 [ 2.213066] RSP <ffff88003bc5fda0> [ 2.213066] CR2: 000000040005047a [ 2.213066] ---[ end trace d28fc3a29d424b70 ]--- ... and this one occured during reboot: [ 73.191301] general protection fault: 0000 [#1] PREEMPT SMP [ 73.192023] last sysfs file: /sys/devices/pci0000:00/0000:00:03.0/class [ 73.192023] CPU 0 [ 73.192023] Modules linked in: edd ip6t_LOG xt_tcpudp xt_pkttype ipt_LOG xt_limit af_packet ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 ip6table_raw xt_NOTRACK ipt_REJECT iptable_raw iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_defrag_ipv4 ip_tables xt_conntrack nf_conntrack ip6table_filter ip6_tables x_tables mperf ppdev parport_pc parport sr_mod cdrom sg 8139too i2c_piix4 floppy pcspkr button 8139cp autofs4 ext4 jbd2 crc16 dm_snapshot dm_mod fan processor pata_acpi thermal thermal_sys [ 73.192023] [ 73.192023] Pid: 1, comm: systemd Not tainted 2.6.37-rc5-desktop #13 /Bochs [ 73.192023] RIP: 0010:[<ffffffff8125dc65>] [<ffffffff8125dc65>] rb_erase+0xd5/0x300 [ 73.192023] RSP: 0018:ffff88003ce63be0 EFLAGS: 00010086 [ 73.192023] RAX: f000ff53f000ff53 RBX: ffff88003c49ba10 RCX: ffff88003c49ba10 [ 73.192023] RDX: ffff88003c49ba10 RSI: ffff88003bcaf928 RDI: ffff880000000000 [ 73.192023] RBP: ffff88003bcaf928 R08: 0000000000000001 R09: ffff880037824400 [ 73.192023] R10: ffff88003c0884e0 R11: 0000000000000001 R12: ffff88003bcaf900 [ 73.192023] R13: ffff88003c49b210 R14: 0000000000000000 R15: ffff88003fc12640 [ 73.192023] FS: 00007f74804757a0(0000) GS:ffff88003fc00000(0000) knlGS:0000000000000000 [ 73.192023] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 73.192023] CR2: 00000000007c32b0 CR3: 000000003cbe8000 CR4: 00000000000006f0 [ 73.192023] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 73.192023] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [ 73.192023] Process systemd (pid: 1, threadinfo ffff88003ce62000, task ffff88003ce60040) [ 73.192023] Stack: [ 73.192023] ffff88003c49b200 ffff88003ce63c28 ffffffff8104dc43 ffff88003fc12640 [ 73.192023] ffff88003fc12640 ffff88003ce60428 0000000000000000 ffffffff8160b120 [ 73.192023] ffff88003ce60040 ffff88003ce63cf8 ffffffff81511c4f ffff88003ce63d28 [ 73.192023] Call Trace: [ 73.192023] [<ffffffff8104dc43>] pick_next_task_fair+0x143/0x180 [ 73.192023] [<ffffffff81511c4f>] thread_return+0x430/0x6a1 [ 73.192023] [<ffffffff8151248d>] schedule_timeout+0x28d/0x310 [ 73.192023] [<ffffffff81511300>] wait_for_common+0xc0/0x150 [ 73.192023] [<ffffffff810cb831>] synchronize_rcu+0x41/0x50 [ 73.192023] [<ffffffff810a40d6>] cgroup_diput+0x36/0xf0 [ 73.192023] [<ffffffff81161001>] d_kill+0x41/0x70 [ 73.192023] [<ffffffff81161640>] dput+0x60/0x150 [ 73.192023] [<ffffffff8115a9f0>] do_rmdir+0xa0/0x130 [ 73.192023] [<ffffffff81002f4b>] system_call_fastpath+0x16/0x1b [ 73.192023] [<00007f747ed05b67>] 0x7f747ed05b67 [ 73.192023] Code: 83 f8 01 74 55 5b 5d c3 48 83 c8 01 48 89 ee 48 89 07 48 83 23 fe 48 89 df e8 68 fd ff ff 48 8b 7b 10 48 8b 47 10 48 85 c0 74 09 <f6> 00 01 0f 84 9b 01 00 00 48 8b 57 08 48 85 d2 74 0c 48 8b 0a [ 73.192023] RIP [<ffffffff8125dc65>] rb_erase+0xd5/0x300 [ 73.192023] RSP <ffff88003ce63be0> [ 73.192023] ---[ end trace d05e11dcc0577c32 ]--- [ 73.192023] note: systemd[1] exited with preempt_count 2 - -Jeff - -- Jeff Mahoney SUSE Labs -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.16 (GNU/Linux) Comment: Using GnuPG with SUSE - http://enigmail.mozdev.org/ iEYEARECAAYFAk0HjZsACgkQLPWxlyuTD7LdJACfWnzEtVLGgU8eWc2LkXW6RjkW DwEAoItzf9zlYQizXEdgyQKXvODH/4YO =i3fJ -----END PGP SIGNATURE----- -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org For additional commands, e-mail: opensuse-factory+help@opensuse.org