[Bug 811667] New: Kernel BUG "bad page map" / "bad page state" / "invalid opcode"
https://bugzilla.novell.com/show_bug.cgi?id=811667 https://bugzilla.novell.com/show_bug.cgi?id=811667#c0 Summary: Kernel BUG "bad page map" / "bad page state" / "invalid opcode" Classification: openSUSE Product: openSUSE 12.3 Version: Final Platform: x86-64 OS/Version: SUSE Other Status: NEW Severity: Normal Priority: P5 - None Component: Kernel AssignedTo: kernel-maintainers@forge.provo.novell.com ReportedBy: conrad-novell.com@quisquis.de QAContact: qa-bugs@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.32 (KHTML, like Gecko) Chrome/27.0.1425.0 Safari/537.32 SUSE/27.0.1425.0 Upgraded a server from openSUSE-12.2 to -12.3 yesterday. About 8 hours later, a kernel bug occurred. The machine has been running fine for months with openSUSE-12.2, so I don't think it's a hardware problem. 2013-03-25T22:40:04.172345+01:00 q018 kernel: [28184.984884] postgres[15945]: segfault at fffffffe90a26cfc ip 0 00000000063cc22 sp 00007fff2a936b30 error 5 in postgres[400000+531000] 2013-03-25T22:40:04.172349+01:00 q018 kernel: [28184.985135] BUG: Bad page map in process postgres pte:8000000086f6c067 pmd:84400067 2013-03-25T22:40:04.172353+01:00 q018 kernel: [28184.985143] page:ffffea0001d85fa0 count:2 mapcount:-1 mapping:ffff88008dcfcec0 index:0x6092 2013-03-25T22:40:04.172357+01:00 q018 kernel: [28184.985147] page flags: 0x20000000020038(uptodate|dirty|lru|mappedtodisk) 2013-03-25T22:40:04.172359+01:00 q018 kernel: [28184.985160] addr:00007f196bf2c000 vm_flags:000000fb anon_vma: (null) mapping:ffff880117f65ef8 index:17f 2013-03-25T22:40:04.172363+01:00 q018 kernel: [28184.985171] vma->vm_ops->fault: shm_fault+0x0/0x20 2013-03-25T22:40:04.172366+01:00 q018 kernel: [28184.985178] vma->vm_file->f_op->mmap: shm_mmap+0x0/0x70 2013-03-25T22:40:04.172368+01:00 q018 kernel: [28184.985184] Pid: 15945, comm: postgres Not tainted 3.7.10-1.1-default #1 2013-03-25T22:40:04.172371+01:00 q018 kernel: [28184.985188] Call Trace: 2013-03-25T22:40:04.172373+01:00 q018 kernel: [28184.985214] [<ffffffff81004728>] dump_trace+0x78/0x2c0 2013-03-25T22:40:04.172376+01:00 q018 kernel: [28184.985228] [<ffffffff8153b44e>] dump_stack+0x69/0x6f 2013-03-25T22:40:04.172378+01:00 q018 kernel: [28184.985243] [<ffffffff8112d6e2>] print_bad_pte+0x1e2/0x270 2013-03-25T22:40:04.172381+01:00 q018 kernel: [28184.985256] [<ffffffff8112ffae>] unmap_page_range+0x66e/0x780 2013-03-25T22:40:04.172383+01:00 q018 kernel: [28184.985269] [<ffffffff81130b02>] unmap_vmas+0x42/0x90 2013-03-25T22:40:04.172386+01:00 q018 kernel: [28184.985280] [<ffffffff81138218>] exit_mmap+0x88/0x160 2013-03-25T22:40:04.172388+01:00 q018 kernel: [28184.985292] [<ffffffff81041369>] mmput+0x59/0xf0 2013-03-25T22:40:04.172391+01:00 q018 kernel: [28184.985304] [<ffffffff810494b8>] exit_mm+0x108/0x130 2013-03-25T22:40:04.172393+01:00 q018 kernel: [28184.985316] [<ffffffff81049641>] do_exit+0x161/0x8f0 2013-03-25T22:40:04.172396+01:00 q018 kernel: [28184.985326] [<ffffffff8104a108>] do_group_exit+0x38/0xa0 2013-03-25T22:40:04.172399+01:00 q018 kernel: [28184.985337] [<ffffffff81058804>] get_signal_to_deliver+0x1b4/0x5e0 2013-03-25T22:40:04.172402+01:00 q018 kernel: [28184.985349] [<ffffffff8100231e>] do_signal+0x4e/0x990 2013-03-25T22:40:04.172404+01:00 q018 kernel: [28184.985361] [<ffffffff81002d18>] do_notify_resume+0x98/0xc0 2013-03-25T22:40:04.172407+01:00 q018 kernel: [28184.985375] [<ffffffff81547bbc>] retint_signal+0x48/0x8c 2013-03-25T22:40:04.172409+01:00 q018 kernel: [28184.985410] [<000000000063cc22>] 0x63cc21 2013-03-25T22:40:04.172411+01:00 q018 kernel: [28184.985414] Disabling lock debugging due to kernel taint 2013-03-25T22:40:04.172414+01:00 q018 kernel: [28184.985896] BUG: Bad page state in process postgres pfn:86f6c 2013-03-25T22:40:04.172417+01:00 q018 kernel: [28184.985902] page:ffffea0001d85fa0 count:0 mapcount:-1 mapping:ffff88008dcfcec0 index:0x6092 2013-03-25T22:40:04.172419+01:00 q018 kernel: [28184.985906] page flags: 0x20000000020018(uptodate|dirty|mappedtodisk) 2013-03-25T22:40:04.172422+01:00 q018 kernel: [28184.985913] Modules linked in: joydev ipt_MASQUERADE xt_TCPMSS nf_conntrack_netbios_ns nf_conntrack_broadcast xt_nat xt_pkttype xt_LOG xt_limit ts_bm xt_length iptable_nat nf_nat_ipv4 nf_nat xt_recent xt_string ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 ip6table_raw ipt_REJECT iptable_raw xt_CT ip6table_mangle nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ip6table_filter ip6_tables xt_tcpudp iptable_filter ip_tables x_tables cpufreq_conservative cpufreq_userspace cpufreq_powersave radeon ttm drm_kms_helper drm acpi_cpufreq mperf i2c_algo_bit sp5100_tco button ohci_hcd kvm_amd pcspkr k10temp kvm pata_atiixp shpchp pci_hotplug edac_core edac_mce_amd ehci_hcd microcode tg3 usbcore usb_common i2c_piix4 sg autofs4 reiserfs dm_mod linear processor thermal_sys scsi_dh_alua scsi_dh_hp_sw scsi_dh_emc scsi_dh_rdac scsi_dh ata_generic 2013-03-25T22:40:04.172427+01:00 q018 kernel: [28184.986006] Pid: 15945, comm: postgres Tainted: G B 3.7.10-1.1-default #1 2013-03-25T22:40:04.172429+01:00 q018 kernel: [28184.986009] Call Trace: 2013-03-25T22:40:04.172432+01:00 q018 kernel: [28184.986022] [<ffffffff81004728>] dump_trace+0x78/0x2c0 2013-03-25T22:40:04.172434+01:00 q018 kernel: [28184.986033] [<ffffffff8153b44e>] dump_stack+0x69/0x6f 2013-03-25T22:40:04.172437+01:00 q018 kernel: [28184.986044] [<ffffffff8153df5b>] bad_page+0xea/0xfe 2013-03-25T22:40:04.172439+01:00 q018 kernel: [28184.986055] [<ffffffff8111144c>] free_pages_prepare+0xfc/0x110 2013-03-25T22:40:04.172442+01:00 q018 kernel: [28184.986067] [<ffffffff8111149b>] free_hot_cold_page+0x3b/0x160 2013-03-25T22:40:04.172444+01:00 q018 kernel: [28184.986077] [<ffffffff8111183e>] free_hot_cold_page_list+0x3e/0x90 2013-03-25T22:40:04.172447+01:00 q018 kernel: [28184.986088] [<ffffffff81115cbc>] release_pages+0x1dc/0x220 2013-03-25T22:40:04.172449+01:00 q018 kernel: [28184.986101] [<ffffffff811423b5>] free_pages_and_swap_cache+0x95/0xc0 2013-03-25T22:40:04.172452+01:00 q018 kernel: [28184.986118] [<ffffffff8112dfb4>] tlb_flush_mmu+0x54/0x90 2013-03-25T22:40:04.172454+01:00 q018 kernel: [28184.986129] [<ffffffff8112e006>] tlb_finish_mmu+0x16/0x50 2013-03-25T22:40:04.172457+01:00 q018 kernel: [28184.986139] [<ffffffff81138241>] exit_mmap+0xb1/0x160 2013-03-25T22:40:04.172459+01:00 q018 kernel: [28184.986149] [<ffffffff81041369>] mmput+0x59/0xf0 2013-03-25T22:40:04.172462+01:00 q018 kernel: [28184.986160] [<ffffffff810494b8>] exit_mm+0x108/0x130 2013-03-25T22:40:04.172464+01:00 q018 kernel: [28184.986171] [<ffffffff81049641>] do_exit+0x161/0x8f0 2013-03-25T22:40:04.172467+01:00 q018 kernel: [28184.986181] [<ffffffff8104a108>] do_group_exit+0x38/0xa0 2013-03-25T22:40:04.172469+01:00 q018 kernel: [28184.986191] [<ffffffff81058804>] get_signal_to_deliver+0x1b4/0x5e0 2013-03-25T22:40:04.172472+01:00 q018 kernel: [28184.986202] [<ffffffff8100231e>] do_signal+0x4e/0x990 2013-03-25T22:40:04.172474+01:00 q018 kernel: [28184.986213] [<ffffffff81002d18>] do_notify_resume+0x98/0xc0 2013-03-25T22:40:04.172477+01:00 q018 kernel: [28184.986227] [<ffffffff81547bbc>] retint_signal+0x48/0x8c 2013-03-25T22:40:04.172479+01:00 q018 kernel: [28184.986243] [<000000000063cc22>] 0x63cc21 2013-03-25T22:40:04.271678+01:00 q018 kernel: [28185.086765] ------------[ cut here ]------------ 2013-03-25T22:40:04.271703+01:00 q018 kernel: [28185.086803] kernel BUG at /home/abuild/rpmbuild/BUILD/kernel-default-3.7.10/linux-3.7/mm/filemap.c:133! 2013-03-25T22:40:04.271706+01:00 q018 kernel: [28185.086854] invalid opcode: 0000 [#1] SMP 2013-03-25T22:40:04.271708+01:00 q018 kernel: [28185.086884] Modules linked in: joydev ipt_MASQUERADE xt_TCPMSS nf_conntrack_netbios_ns nf_conntrack_broadcast xt_nat xt_pkttype xt_LOG xt_limit ts_bm xt_length iptable_nat nf_nat_ipv4 nf_nat xt_recent xt_string ip6t_REJECT nf_conntrack_ipv6 nf_defrag_ipv6 ip6table_raw ipt_REJECT iptable_raw xt_CT ip6table_mangle nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ip6table_filter ip6_tables xt_tcpudp iptable_filter ip_tables x_tables cpufreq_conservative cpufreq_userspace cpufreq_powersave radeon ttm drm_kms_helper drm acpi_cpufreq mperf i2c_algo_bit sp5100_tco button ohci_hcd kvm_amd pcspkr k10temp kvm pata_atiixp shpchp pci_hotplug edac_core edac_mce_amd ehci_hcd microcode tg3 usbcore usb_common i2c_piix4 sg autofs4 reiserfs dm_mod linear processor thermal_sys scsi_dh_alua scsi_dh_hp_sw scsi_dh_emc scsi_dh_rdac scsi_dh ata_generic 2013-03-25T22:40:04.271710+01:00 q018 kernel: [28185.087306] CPU 0 2013-03-25T22:40:04.271711+01:00 q018 kernel: [28185.087318] Pid: 1792, comm: postgres Tainted: G B 3.7.10-1.1-default #1 HP ProLiant MicroServer 2013-03-25T22:40:04.271713+01:00 q018 kernel: [28185.087388] RIP: 0010:[<ffffffff8110a42a>] [<ffffffff8110a42a>] __delete_from_page_cache+0x14a/0x150 2013-03-25T22:40:04.271715+01:00 q018 kernel: [28185.087449] RSP: 0018:ffff8800d79f7c28 EFLAGS: 00010046 2013-03-25T22:40:04.271716+01:00 q018 kernel: [28185.087478] RAX: 0000000000000000 RBX: ffffea0002b85fa0 RCX: 00000000ffffffe8 2013-03-25T22:40:04.271718+01:00 q018 kernel: [28185.087507] RDX: 0000000000000018 RSI: 0000000000000017 RDI: ffff88011ffec6c0 2013-03-25T22:40:04.271719+01:00 q018 kernel: [28185.087536] RBP: ffff880117f65ef8 R08: fec0000000000000 R09: a8000ae17f600000 2013-03-25T22:40:04.271720+01:00 q018 kernel: [28185.087565] R10: 57ffdf1e83585fd8 R11: 0000000000000210 R12: 0000000000000000 2013-03-25T22:40:04.271722+01:00 q018 kernel: [28185.087593] R13: ffff8800d79f7ce8 R14: 0000000000000007 R15: 000000000000017f 2013-03-25T22:40:04.271723+01:00 q018 kernel: [28185.087624] FS: 00007f197157a7c0(0000) GS:ffff88011fc00000(0000) knlGS:00000000f73b86c0 2013-03-25T22:40:04.271725+01:00 q018 kernel: [28185.087672] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 2013-03-25T22:40:04.271726+01:00 q018 kernel: [28185.087700] CR2: 0000000000b4bfa0 CR3: 00000000d787e000 CR4: 00000000000007f0 2013-03-25T22:40:04.271727+01:00 q018 kernel: [28185.087729] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 2013-03-25T22:40:04.271729+01:00 q018 kernel: [28185.087758] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 2013-03-25T22:40:04.271730+01:00 q018 kernel: [28185.087788] Process postgres (pid: 1792, threadinfo ffff8800d79f6000, task ffff8801138ec580) 2013-03-25T22:40:04.271731+01:00 q018 kernel: [28185.087835] Stack: 2013-03-25T22:40:04.271733+01:00 q018 kernel: [28185.087858] 000000000000017f ffffea0002b85fa0 ffff880117f65ef8 ffffffff8110a599 2013-03-25T22:40:04.271734+01:00 q018 kernel: [28185.087913] ffffea0002b85fa0 ffff880117f65ef8 ffffffffffffffff ffffffff81116b95 2013-03-25T22:40:04.271736+01:00 q018 kernel: [28185.087966] ffffea0002b85fd8 ffff8800d79f7d58 0000000000000000 ffffffff81121a4f 2013-03-25T22:40:04.271737+01:00 q018 kernel: [28185.088020] Call Trace: 2013-03-25T22:40:04.271739+01:00 q018 kernel: [28185.088067] [<ffffffff8110a599>] delete_from_page_cache+0x39/0x80 2013-03-25T22:40:04.271740+01:00 q018 kernel: [28185.088104] [<ffffffff81116b95>] truncate_inode_page+0x55/0x80 2013-03-25T22:40:04.271741+01:00 q018 kernel: [28185.088140] [<ffffffff81121a4f>] shmem_undo_range+0x32f/0x780 2013-03-25T22:40:04.271743+01:00 q018 kernel: [28185.088174] [<ffffffff81121eab>] shmem_truncate_range+0xb/0x30 2013-03-25T22:40:04.271744+01:00 q018 kernel: [28185.088208] [<ffffffff811220c7>] shmem_evict_inode+0xd7/0x130 2013-03-25T22:40:04.271746+01:00 q018 kernel: [28185.088245] [<ffffffff8117f5f3>] evict+0xa3/0x190 2013-03-25T22:40:04.271747+01:00 q018 kernel: [28185.088280] [<ffffffff8117ba48>] d_kill+0xe8/0x140 2013-03-25T22:40:04.271748+01:00 q018 kernel: [28185.088316] [<ffffffff8117c112>] dput+0xd2/0x1b0 2013-03-25T22:40:04.271750+01:00 q018 kernel: [28185.088351] [<ffffffff81168208>] __fput+0x148/0x230 2013-03-25T22:40:04.271751+01:00 q018 kernel: [28185.088385] [<ffffffff81063237>] task_work_run+0x97/0xd0 2013-03-25T22:40:04.271752+01:00 q018 kernel: [28185.088422] [<ffffffff81002d09>] do_notify_resume+0x89/0xc0 2013-03-25T22:40:04.271754+01:00 q018 kernel: [28185.088462] [<ffffffff8154f5aa>] int_signal+0x12/0x17 2013-03-25T22:40:04.271755+01:00 q018 kernel: [28185.088505] [<00007f196f696157>] 0x7f196f696156 2013-03-25T22:40:04.271757+01:00 q018 kernel: [28185.088531] Code: 85 f6 0f 88 22 ff ff ff 48 89 df e8 c1 a3 05 00 e9 15 ff ff ff 0f 1f 40 00 be 17 00 00 00 48 89 df e8 4b be 01 00 e9 34 ff ff ff <0f> 0b 0f 1f 40 00 48 83 ec 28 83 e2 fd 4c 89 6c 24 18 49 89 fd 2013-03-25T22:40:04.271759+01:00 q018 kernel: [28185.088718] RIP [<ffffffff8110a42a>] __delete_from_page_cache+0x14a/0x150 2013-03-25T22:40:04.271760+01:00 q018 kernel: [28185.088751] RSP <ffff8800d79f7c28> 2013-03-25T22:40:04.271761+01:00 q018 kernel: [28185.089058] ---[ end trace 97ae46d0bbac0f94 ]--- /proc/cpuinfo: processor : 0 vendor_id : AuthenticAMD cpu family : 16 model : 6 model name : AMD Turion(tm) II Neo N40L Dual-Core Processor stepping : 3 microcode : 0x10000c8 cpu MHz : 800.000 cache size : 1024 KB physical id : 0 siblings : 2 core id : 0 cpu cores : 2 apicid : 0 initial apicid : 0 fpu : yes fpu_exception : yes cpuid level : 5 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow constant_tsc rep_good nopl nonstop_tsc extd_apicid pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt nodeid_msr hw_pstate npt lbrv svm_lock nrip_save bogomips : 2995.15 TLB size : 1024 4K pages clflush size : 64 cache_alignment : 64 address sizes : 48 bits physical, 48 bits virtual power management: ts ttp tm stc 100mhzsteps hwpstate processor : 1 vendor_id : AuthenticAMD cpu family : 16 model : 6 model name : AMD Turion(tm) II Neo N40L Dual-Core Processor stepping : 3 microcode : 0x10000c8 cpu MHz : 800.000 cache size : 1024 KB physical id : 0 siblings : 2 core id : 1 cpu cores : 2 apicid : 1 initial apicid : 1 fpu : yes fpu_exception : yes cpuid level : 5 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow constant_tsc rep_good nopl nonstop_tsc extd_apicid pni monitor cx16 popcnt lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt nodeid_msr hw_pstate npt lbrv svm_lock nrip_save bogomips : 2995.15 TLB size : 1024 4K pages clflush size : 64 cache_alignment : 64 address sizes : 48 bits physical, 48 bits virtual power management: ts ttp tm stc 100mhzsteps hwpstate Reproducible: Didn't try Steps to Reproduce: I'll keep an eye on the logs and report back if this happens again. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=811667
https://bugzilla.novell.com/show_bug.cgi?id=811667#c1
Michal Hocko
[28184.985135] BUG: Bad page map in process postgres pte:8000000086f6c067 pmd:84400067 [28184.985143] page:ffffea0001d85fa0 count:2 mapcount:-1 mapping:ffff88008dcfcec0 index:0x6092 [28184.985147] page flags: 0x20000000020038(uptodate|dirty|lru|mappedtodisk) [28184.985160] addr:00007f196bf2c000 vm_flags:000000fb anon_vma: (null) mapping:ffff880117f65ef8 index:17f [28184.985171] vma->vm_ops->fault: shm_fault+0x0/0x20 [28184.985178] vma->vm_file->f_op->mmap: shm_mmap+0x0/0x70
Which is an output from print_bad_pte called during page table tear down. Interesting thing is that the page ref count is still 2. The page is associated with shmem backed memory and it is still on the LRU list. Even more interesting thing is that page->mapping != vma->vm_file->f_mapping (ffff88008dcfcec0 vs ffff880117f65ef8) Later on we tried to remove a shmem file (because the last reference to the file was dropped) and we encountered a mapped page while trying to remove it from the page cache. This shouldn't happen, however, because truncate_inode_page unmaps the page first before it gets down to delete_from_page_cache. It is holding page lock while it is doing that. So somebody much be racing with this removal without holding page lock. [...]
[28185.086803] kernel BUG at /home/abuild/rpmbuild/BUILD/kernel-default-3.7.10/linux-3.7/mm/filemap.c:133! [28185.086854] invalid opcode: 0000 [#1] SMP [28185.086884] Modules linked in: [...] [28185.087306] CPU 0 [28185.087318] Pid: 1792, comm: postgres Tainted: G B 3.7.10-1.1-default #1 HP ProLiant MicroServer [28185.087388] RIP: 0010:[<ffffffff8110a42a>] [<ffffffff8110a42a>] __delete_from_page_cache+0x14a/0x150 [28185.087449] RSP: 0018:ffff8800d79f7c28 EFLAGS: 00010046 [28185.087478] RAX: 0000000000000000 RBX: ffffea0002b85fa0 RCX: 00000000ffffffe8 [28185.087507] RDX: 0000000000000018 RSI: 0000000000000017 RDI: ffff88011ffec6c0 [28185.087536] RBP: ffff880117f65ef8 R08: fec0000000000000 R09: a8000ae17f600000 [28185.087565] R10: 57ffdf1e83585fd8 R11: 0000000000000210 R12: 0000000000000000 [28185.087593] R13: ffff8800d79f7ce8 R14: 0000000000000007 R15: 000000000000017f [28185.087624] FS: 00007f197157a7c0(0000) GS:ffff88011fc00000(0000) knlGS:00000000f73b86c0 [28185.087672] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [28185.087700] CR2: 0000000000b4bfa0 CR3: 00000000d787e000 CR4: 00000000000007f0 [28185.087729] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [28185.087758] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [28185.087788] Process postgres (pid: 1792, threadinfo ffff8800d79f6000, task ffff8801138ec580) [28185.087835] Stack: [28185.087858] 000000000000017f ffffea0002b85fa0 ffff880117f65ef8 ffffffff8110a599 [28185.087913] ffffea0002b85fa0 ffff880117f65ef8 ffffffffffffffff ffffffff81116b95 [28185.087966] ffffea0002b85fd8 ffff8800d79f7d58 0000000000000000 ffffffff81121a4f [28185.088020] Call Trace: [28185.088067] [<ffffffff8110a599>] delete_from_page_cache+0x39/0x80 [28185.088104] [<ffffffff81116b95>] truncate_inode_page+0x55/0x80 [28185.088140] [<ffffffff81121a4f>] shmem_undo_range+0x32f/0x780 [28185.088174] [<ffffffff81121eab>] shmem_truncate_range+0xb/0x30 [28185.088208] [<ffffffff811220c7>] shmem_evict_inode+0xd7/0x130 [28185.088245] [<ffffffff8117f5f3>] evict+0xa3/0x190 [28185.088280] [<ffffffff8117ba48>] d_kill+0xe8/0x140 [28185.088316] [<ffffffff8117c112>] dput+0xd2/0x1b0 [28185.088351] [<ffffffff81168208>] __fput+0x148/0x230 [28185.088385] [<ffffffff81063237>] task_work_run+0x97/0xd0 [28185.088422] [<ffffffff81002d09>] do_notify_resume+0x89/0xc0 [28185.088462] [<ffffffff8154f5aa>] int_signal+0x12/0x17 [28185.088505] [<00007f196f696157>] 0x7f196f696156 [28185.088531] Code: 85 f6 0f 88 22 ff ff ff 48 89 df e8 c1 a3 05 00 e9 15 ff ff ff 0f 1f 40 00 be 17 00 00 00 48 89 df e8 4b be 01 00 e9 34 ff ff ff <0f> 0b 0f 1f 40 00 48 83 ec 28 83 e2 fd 4c 89 6c 24 18 49 89 fd [28185.088718] RIP [<ffffffff8110a42a>] __delete_from_page_cache+0x14a/0x150 [28185.088751] RSP <ffff8800d79f7c28> [28185.089058] ---[ end trace 97ae46d0bbac0f94 ]---
The mapping is in RBP register (from disassemle of __delete_from_page_cache) and this matches vma->vm_file->f_mapping. OK, this all sounds fishy but I would like to know more - what is the page->mapping and who uses that shmem segment. I have recently seen DRM code doing some nasty things about mappings (see bug 807850). This doesn't look like the same issue - at least not now - but who knows. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=811667
https://bugzilla.novell.com/show_bug.cgi?id=811667#c2
--- Comment #2 from Peter Conrad
https://bugzilla.novell.com/show_bug.cgi?id=811667
https://bugzilla.novell.com/show_bug.cgi?id=811667#c3
--- Comment #3 from Michal Hocko
Thanks for the detailed explanation.
I'm not familiar with kdump and the machine has been stable for a week now, so I don't know if it's worth the trouble.
All you need is to setup kdump in Yast (System tab -> Kernel Kdump). It will store the kernel memory image when the kernel crashes and this would help us examine the postmortem condition. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
https://bugzilla.novell.com/show_bug.cgi?id=811667
https://bugzilla.novell.com/show_bug.cgi?id=811667#c4
--- Comment #4 from Peter Conrad
https://bugzilla.novell.com/show_bug.cgi?id=811667
https://bugzilla.novell.com/show_bug.cgi?id=811667#c5
--- Comment #5 from Peter Conrad
https://bugzilla.novell.com/show_bug.cgi?id=811667
https://bugzilla.novell.com/show_bug.cgi?id=811667#c6
--- Comment #6 from Michal Hocko
https://bugzilla.novell.com/show_bug.cgi?id=811667
https://bugzilla.novell.com/show_bug.cgi?id=811667#c7
Peter Conrad
participants (1)
-
bugzilla_noreply@novell.com