What | Removed | Added |
---|---|---|
CC | ailiopoulos@suse.com |
The first oops excerpt from the logs is the only reliable hint, the rest are just side-effects of the bug: Mar 01 15:55:44 ost kernel: BUG: unable to handle page fault for address: 000000000000394e Mar 01 15:55:44 ost kernel: #PF: supervisor read access in kernel mode Mar 01 15:55:44 ost kernel: #PF: error_code(0x0000) - not-present page Mar 01 15:55:44 ost kernel: PGD 0 P4D 0 Mar 01 15:55:44 ost kernel: Oops: 0000 [#2] PREEMPT SMP PTI Mar 01 15:55:44 ost kernel: CPU: 8 PID: 19366 Comm: tlp Tainted: P D OE 6.7.6-1-default #1 openSUSE Tumbleweed cd9816be5099dbe04750b2583fe34462de6dcdca Mar 01 15:55:44 ost kernel: Hardware name: ASUSTeK COMPUTER INC. Zephyrus M GU502GV_GU502GV/GU502GV, BIOS GU502GV.305 09/25/2019 Mar 01 15:55:44 ost kernel: RIP: 0010:simple_xattr_get+0x31/0xa0 Mar 01 15:55:44 ost kernel: Code: 00 00 41 56 49 89 ce 41 55 4c 8d 6f 08 41 54 49 89 d4 55 48 89 f5 53 48 89 fb 4c 89 ef e8 57 54 88 00 48 8b 1b 48 85 db 74 1b <48> 8b 7b 18 48 89 ee e8 63 75 85 00 85 c0 78 27 74 2b 48 8b 5b 08 Mar 01 15:55:44 ost kernel: RSP: 0018:ffffb415ca8c3b20 EFLAGS: 00010206 Mar 01 15:55:44 ost kernel: RAX: 0000000000000200 RBX: 0000000000003936 RCX: 0000000000000000 Mar 01 15:55:44 ost kernel: RDX: 0000000000000000 RSI: ffffffff99fc5446 RDI: ffff8fb6c38dac20 Mar 01 15:55:44 ost kernel: RBP: ffffffff99fc5446 R08: 0000000000000000 R09: 0000000000000000 Mar 01 15:55:44 ost kernel: R10: ffffffff99fc5446 R11: 0000000000000000 R12: 0000000000000000 Mar 01 15:55:44 ost kernel: R13: ffff8fb6c38dac20 R14: 0000000000000000 R15: ffff8fb795c31900 Mar 01 15:55:44 ost kernel: FS: 00007f55d60db600(0000) GS:ffff8fba2dc00000(0000) knlGS:0000000000000000 Mar 01 15:55:44 ost kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Mar 01 15:55:44 ost kernel: CR2: 000000000000394e CR3: 00000004278bc004 CR4: 00000000003706f0 Mar 01 15:55:44 ost kernel: Call Trace: Mar 01 15:55:44 ost kernel: <TASK> Mar 01 15:55:44 ost kernel: __vfs_getxattr+0x7f/0xb0 Mar 01 15:55:44 ost kernel: cap_inode_need_killpriv+0x1e/0x30 Mar 01 15:55:44 ost kernel: security_inode_need_killpriv+0x2d/0x50 Mar 01 15:55:44 ost kernel: dentry_needs_remove_privs+0x32/0x60 Mar 01 15:55:44 ost kernel: do_truncate+0x5b/0xd0 Mar 01 15:55:44 ost kernel: path_openat+0xfc3/0x1200 Mar 01 15:55:44 ost kernel: do_filp_open+0xb8/0x160 Mar 01 15:55:44 ost kernel: do_sys_openat2+0x91/0xc0 Mar 01 15:55:44 ost kernel: __x64_sys_openat+0x57/0xa0 Mar 01 15:55:44 ost kernel: do_syscall_64+0x61/0xe0 Mar 01 15:55:44 ost kernel: entry_SYSCALL_64_after_hwframe+0x6e/0x76 I think this happens during ltp trying to open some sysfs file with O_TRUNC. The failure is during rbtree_simple_xattr_cmp when strcmp attempts to deref xattr->name. xattr is at 0x0000000000003936 so accessing xattr->name is causing the oops. Not sure why xattrs->rb_root would be pointing there, this should have been initialized to null. If you could setup kdump [1] and reproduce the issue (with the latest stable kernel) there may be more hints there for analysis. Also please set /proc/sys/kernel/panic_on_oops = 1. [1] https://doc.opensuse.org/documentation/leap/tuning/html/book-tuning/cha-tuning-kexec.html#cha-tuning-kdump-basic