[opensuse-factory] segfault on zypper dup completion

I just did a dup to a 32 bit Beta 2 P4/Celeron system last dup'd about 6 weeks ago (host t2240). It appeared to complete normally, but after the zypper ps warning there was a segmentation fault line. Tail of dmesg: [ 37.342287] Installing knfsd (copyright (C) 1996 okir@monad.swb.de). [ 38.104167] NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory [ 38.113952] NFSD: starting 90-second grace period [ 44.610044] eth0: no IPv6 routers present [ 475.004020] ------------[ cut here ]------------ [ 475.004041] WARNING: at /home/abuild/rpmbuild/BUILD/kernel-desktop-3.4.2/linux-3.4/net/sched/sch_generic.c:256 dev_watchdog+0x1e8/0x200() [ 475.004046] Hardware name: [ 475.004049] NETDEV WATCHDOG: eth0 (e100): transmit queue 0 timed out [ 475.004052] Modules linked in: snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device nfsd nfs lockd fscache auth_rpcgss nfs_acl sunrpc dm_mod sr_mod snd_intel8x0 cdrom snd_ac97_codec ac97_bus snd_pcm snd_timer snd soundcore snd_page_alloc iTCO_wdt iTCO_vendor_support microcode pcspkr e100 i2c_i801 shpchp pci_hotplug floppy i915 drm_kms_helper drm i2c_algo_bit video button scsi_dh_rdac scsi_dh_alua scsi_dh_hp_sw scsi_dh_emc scsi_dh fan processor thermal thermal_sys ata_generic ata_piix sata_sil [ 475.004113] Pid: 0, comm: swapper/0 Not tainted 3.4.2-1-desktop #1 [ 475.004117] Call Trace: [ 475.004138] [<c02054b9>] try_stack_unwind+0x199/0x1b0 [ 475.004149] [<c02041d7>] dump_trace+0x47/0xf0 [ 475.004155] [<c020551b>] show_trace_log_lvl+0x4b/0x60 [ 475.004161] [<c0205548>] show_trace+0x18/0x20 [ 475.004169] [<c06f02fe>] dump_stack+0x6d/0x72 [ 475.004178] [<c0233188>] warn_slowpath_common+0x78/0xb0 [ 475.004185] [<c0233253>] warn_slowpath_fmt+0x33/0x40 [ 475.004194] [<c0636ef8>] dev_watchdog+0x1e8/0x200 [ 475.004204] [<c0240808>] run_timer_softirq+0xf8/0x300 [ 475.004211] [<c02398a1>] __do_softirq+0x81/0x1b0 [ 475.004217] [<c02040b6>] do_softirq+0x76/0xb0 [ 475.004224] ---[ end trace e91e2e809858261d ]--- [ 475.021328] e100 0000:01:02.0: eth0: NIC Link is Up 100 Mbps Full Duplex [ 2774.999044] end_request: I/O error, dev fd0, sector 0 [ 3268.406459] udevd[12045]: starting version 182 [ 4264.313985] nfsd: last server has exited, flushing export cache [ 4269.974101] NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory [ 4269.974153] NFSD: starting 90-second grace period [ 4875.923898] zypper[2291]: segfault at 0 ip 08140fd8 sp bfcdcd80 error 4 in zypper[8048000+151000] -- "The wise are known for their understanding, and pleasant words are persuasive." Proverbs 16:21 (New Living Translation) Team OS/2 ** Reg. Linux User #211409 ** a11y rocks! Felix Miata *** http://fm.no-ip.com/ -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org

On 2012/08/24 00:44 (GMT-0400) Felix Miata composed:
DejaVu on a 1200MHz Pentium III Coppermine with same last line in messages. Previous update was about 8 weeks ago, with zypper running on 3.4.2-1 desktop. cpinfo flags: fpu vme de pse tsc msr pae mce cx8 sep mtrr pge mca cmov pse36 mmx fxsr sse up -- "The wise are known for their understanding, and pleasant words are persuasive." Proverbs 16:21 (New Living Translation) Team OS/2 ** Reg. Linux User #211409 ** a11y rocks! Felix Miata *** http://fm.no-ip.com/ -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org

On Wednesday 2012-08-29 18:59, Cristian Rodríguez wrote:
No it's not a kernel bug. #include <stdio.h> int main(void) { printf("%d\n", *(int*)1ULL); return 0; } The report varies with chosen address; (0x1ULLL): [507330.709345] a.out[14257]: segfault at 1 ip 000000000040052d sp 00007fff7d83ae10 error 4 in a.out[400000+1000] (0x8000000000000000ULL): [507297.331443] a.out[14238] general protection ip:400532 sp:7fffe3ebe5d0 error:0 in a.out[400000+1000] in neither case the kernel is at fault. JFYI, on x86, /* * Page fault error code bits: * * bit 0 == 0: no page found 1: protection fault * bit 1 == 0: read access 1: write access * bit 2 == 0: kernel-mode access 1: user-mode access * bit 3 == 1: use of reserved bit detected * bit 4 == 1: fault was an instruction fetch */ -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org

Hi, 2012/8/24 Felix Miata <mrmazda@earthlink.net>:
Did you encounter this issue before? Can you please provide /var/log/message and lspci -nvv ? Thanks! Regards, David
-- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org

On 2012/08/30 11:52 (GMT+0800) David Chang composed:
Did you encounter this issue before?
Host t2240: http://lists.opensuse.org/opensuse-factory/2012-08/msg00572.html 3c9424e7-ec4f-4814-afdf-2c18951bf474__www.smolts.org = pub_36ce35ea-9740-4427-be66-ab61e6898dc6 http://lists.opensuse.org/opensuse-factory/2012-08/msg00674.html 0b32d2d3-cdb9-4658-8bab-a7fd48cbb962__www.smolts.org = pub_f981a1dd-69e0-4a87-b8c7-a2413c2bd4c8
Can you please provide /var/log/message and lspci -nvv ? Thanks!
From the latter host: http://fm.no-ip.com/Tmp/SUSE/Factory/lspci-nvv-gx150-122.txt http://fm.no-ip.com/Tmp/SUSE/Factory/messages-gx150-122.txt -- "The wise are known for their understanding, and pleasant words are persuasive." Proverbs 16:21 (New Living Translation) Team OS/2 ** Reg. Linux User #211409 ** a11y rocks! Felix Miata *** http://fm.no-ip.com/ -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org

Hi, 2012/8/30 Felix Miata <mrmazda@earthlink.net>:
Sorry, I am not sure what does meaning of those link?
Could you help to file a bug for this issue? Thanks! Regards, David
-- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org

On 2012/08/30 17:05 (GMT+0800) David Chang composed:
2012/8/30 Felix Miata<mrmazda@earthlink.net>:
[Host gx150:]
Sorry, I am not sure what does meaning of those link?
Hardware info: http://en.opensuse.org/Smolt
Got it, but can not access the messages file.
Fixed restrictive permissions inherited from /var/log/ not noticed in migrating files to web server.
Could you help to file a bug for this issue? Thanks!
Without a reliable recreate scenario I usually don't file a bug, discussing here if necessary until that does happen, if it happens. I don't yet know whether this might be bash, zypper/libzypp, kernel, or something else. -- "The wise are known for their understanding, and pleasant words are persuasive." Proverbs 16:21 (New Living Translation) Team OS/2 ** Reg. Linux User #211409 ** a11y rocks! Felix Miata *** http://fm.no-ip.com/ -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org

Hi, 2012/8/30 Felix Miata <mrmazda@earthlink.net>:
Hardware info: http://en.opensuse.org/Smolt
Understood, thanks!
Fixed restrictive permissions inherited from /var/log/ not noticed in migrating files to web server.
I did not see the segfault message from the log. Does it belongs to the issue machine? (host t2240?)
As per the call trace message : [ 475.004049] NETDEV WATCHDOG: eth0 (e100): transmit queue 0 timed out it looks probably relative to net device. But the last message: [ 4875.923898] zypper[2291]: segfault at 0 ip 08140fd8 sp bfcdcd80 error 4 in zypper[8048000+151000] not sure what happen with zypper. Do you still encounter this issue? It would be good, if we can get the whole message log. Thanks! Regards, David
-- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org

On 2012/08/31 18:11 (GMT+0800) David Chang composed:
Felix Miata:
Fixed restrictive permissions inherited from /var/log/ not noticed in migrating files to web server.
I did not see the segfault message from the log. Does it belongs to the issue machine? (host t2240?)
http://fm.no-ip.com/Tmp/SUSE/Factory/lspci-nvv-gx150-122.txt http://fm.no-ip.com/Tmp/SUSE/Factory/messages-gx150-122.txt http://fm.no-ip.com/Tmp/SUSE/Factory/lspci-nvv-t2240-122.txt http://fm.no-ip.com/Tmp/SUSE/Factory/messages-t2240-122.txt
Both machines use e100. CMOS battery was dead in tt2240. I just changed it.
It looks to me like /var/log/messages is not capturing everything that it should. Maybe it's related to zypper/rpm mishandling immutable /etc/fonts/suse-post=user.conf trying to update fontconfig or immutable /boot/message or /etc/adjtime from whatever packages are trying to change them?
Do you still encounter this issue?
Each machine did this exactly once only that I ever noticed. I've not seen it on my other 10 or so 12.2 machines.
It would be good, if we can get the whole message log. Thanks!
Above links to each machine go back more than two months. -- "The wise are known for their understanding, and pleasant words are persuasive." Proverbs 16:21 (New Living Translation) Team OS/2 ** Reg. Linux User #211409 ** a11y rocks! Felix Miata *** http://fm.no-ip.com/ -- To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org To contact the owner, e-mail: opensuse-factory+owner@opensuse.org
participants (4)
-
Cristian Rodríguez
-
David Chang
-
Felix Miata
-
Jan Engelhardt