[Bug 1096401] New: snapper: segfault found in openQA tests
http://bugzilla.opensuse.org/show_bug.cgi?id=1096401 Bug ID: 1096401 Summary: snapper: segfault found in openQA tests Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: Other OS: Other Status: NEW Severity: Normal Priority: P5 - None Component: Basesystem Assignee: bnc-team-screening@forge.provo.novell.com Reporter: dimstar@opensuse.org QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- While checking some logs on openQA, trying to find why so many tests are failing, I stumbled over a segfault of snapper: Jun 07 02:18:14 susetest org.opensuse.Snapper[719]: terminate called after throwing an instance of 'boost::exception_detail::clone_impl<boost::exception_detail::error_info_injector<boost::lock_error>
' Jun 07 02:18:14 susetest org.opensuse.Snapper[719]: what(): boost: mutex lock failed in pthread_mutex_lock: Invalid argument Jun 07 02:18:14 susetest systemd[1]: Started Process Core Dump (PID 20684/UID 0). Jun 07 02:18:15 susetest systemd-coredump[20685]: Process 18707 (snapperd) of user 0 dumped core.
TIME PID UID GID SIG COREFILE EXE Thu 2018-06-07 02:09:35 EDT 2722 0 0 6 present /bin/login Thu 2018-06-07 02:14:34 EDT 14648 0 0 6 present /usr/sbin/snapperd Thu 2018-06-07 02:15:58 EDT 16462 0 0 6 present /usr/sbin/snapperd Thu 2018-06-07 02:18:15 EDT 18707 0 0 6 present /usr/sbin/snapperd This can explain why openQA has a hard time passing the tests, as the system is pre-occupied writing coredumps The three snaperd coredumps are available in the openQA provided tarball https://openqa.opensuse.org/tests/688332/file/gimp-problem_detection_logs.ta... (directory coredumps) -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1096401 Dominique Leuenberger <dimstar@opensuse.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Component|Basesystem |YaST2 Assignee|bnc-team-screening@forge.pr |yast2-maintainers@suse.de |ovo.novell.com | QA Contact|qa-bugs@suse.de |jsrain@suse.com -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1096401 Dominique Leuenberger <dimstar@opensuse.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |arvin@suse.de -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1096401 http://bugzilla.opensuse.org/show_bug.cgi?id=1096401#c1 --- Comment #1 from Dominique Leuenberger <dimstar@opensuse.org> --- [Current thread is 1 (Thread 0x7f73167c4940 (LWP 18707))] (gdb) bt #0 __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:51 #1 0x00007f731533ada1 in __GI_abort () at abort.c:79 #2 0x00007f731596ce83 in __gnu_cxx::__verbose_terminate_handler () at ../../../../libstdc++-v3/libsupc++/vterminate.cc:95 #3 0x00007f7315972dd8 in __cxxabiv1::__terminate(void (*)()) () at ../../../../libstdc++-v3/libsupc++/eh_terminate.cc:47 #4 0x00007f7315971ec9 in __cxa_call_terminate (ue_header=ue_header@entry=0x55cdabbc7630) at ../../../../libstdc++-v3/libsupc++/eh_call.cc:54 #5 0x00007f7315972815 in __gxx_personality_v0 () at ../../../../libstdc++-v3/libsupc++/eh_personality.cc:676 #6 0x00007f73156d1a43 in _Unwind_RaiseException_Phase2 () at ../../../libgcc/unwind.inc:64 #7 0x00007f73156d22c5 in _Unwind_Resume () at ../../../libgcc/unwind.inc:241 #8 0x00007f7316369610 in __gnu_cxx::new_allocator<char>::deallocate (this=<optimized out>, __p=<optimized out>) at /usr/include/c++/7/ext/new_allocator.h:125 #9 std::allocator_traits<std::allocator<char> >::deallocate (__a=..., __n=<optimized out>, __p=<optimized out>) at /usr/include/c++/7/bits/alloc_traits.h:462 #10 std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::_M_destroy (__size=<optimized out>, this=<optimized out>) at /usr/include/c++/7/bits/basic_string.h:226 #11 std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::_M_dispose (this=<optimized out>) at /usr/include/c++/7/bits/basic_string.h:221 #12 std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >::~basic_string (this=<optimized out>, __in_chrg=<optimized out>) at /usr/include/c++/7/bits/basic_string.h:647 #13 snapper::logStreamClose (level=level@entry=snapper::MILESTONE, file=file@entry=0x7f731639ee27 "Snapper.cc", line=line@entry=136, func=func@entry=0x7f731639f448 <snapper::Snapper::~Snapper()::__FUNCTION__> "~Snapper", stream=stream@entry=0x55cdabbc0e60) at Log.cc:65 #14 0x00007f7316339794 in snapper::Snapper::~Snapper (this=0x7f730c000e80, __in_chrg=<optimized out>) at Snapper.cc:136 #15 0x000055cda9d8aeeb in MetaSnapper::~MetaSnapper (this=0x55cdabbb7c10, __in_chrg=<optimized out>) at MetaSnapper.cc:105 #16 0x000055cda9d8affc in __gnu_cxx::new_allocator<std::_List_node<MetaSnapper>
::destroy<MetaSnapper> (this=<optimized out>, __p=<optimized out>) at /usr/include/c++/7/ext/new_allocator.h:140 #17 std::allocator_traits<std::allocator<std::_List_node<MetaSnapper> > ::destroy<MetaSnapper> (__a=..., __p=<optimized out>) at /usr/include/c++/7/bits/alloc_traits.h:487 #18 std::__cxx11::_List_base<MetaSnapper, std::allocator<MetaSnapper> ::_M_clear (this=0x55cda9fa5ba0 <meta_snappers>) at /usr/include/c++/7/bits/list.tcc:76 #19 std::__cxx11::_List_base<MetaSnapper, std::allocator<MetaSnapper> ::~_List_base (this=0x55cda9fa5ba0 <meta_snappers>, __in_chrg=<optimized out>) at /usr/include/c++/7/bits/stl_list.h:442 #20 std::__cxx11::list<MetaSnapper, std::allocator<MetaSnapper> >::~list (this=0x55cda9fa5ba0 <meta_snappers>, __in_chrg=<optimized out>) at /usr/include/c++/7/bits/stl_list.h:733 #21 MetaSnappers::~MetaSnappers (this=0x55cda9fa5ba0 <meta_snappers>, __in_chrg=<optimized out>) at MetaSnapper.cc:184 #22 0x00007f731533c431 in __run_exit_handlers (status=0, listp=0x7f73156bb718 <__exit_funcs>, run_list_atexit=run_list_atexit@entry=true, run_dtors=run_dtors@entry=true) at exit.c:108 #23 0x00007f731533c52a in __GI_exit (status=<optimized out>) at exit.c:139 #24 0x00007f7315324a8e in __libc_start_main (main=0x55cda9d6f080 <main(int, char**)>, argc=1, argv=0x7ffd1fd5e648, init=<optimized out>, fini=<optimized out>, rtld_fini=<optimized out>, stack_end=0x7ffd1fd5e638) at ../csu/libc-start.c:342 #25 0x000055cda9d6f96a in _start () at ../sysdeps/x86_64/start.S:120
-- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1096401 http://bugzilla.opensuse.org/show_bug.cgi?id=1096401#c2 --- Comment #2 from Dominique Leuenberger <dimstar@opensuse.org> --- Not sure if this did happen before - but one notable change in this snapshot would be kernel 4.16.13; might be related? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1096401 http://bugzilla.opensuse.org/show_bug.cgi?id=1096401#c3 Dominique Leuenberger <dimstar@opensuse.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Priority|P5 - None |P1 - Urgent CC| |dleuenberger@suse.com, | |mlin@suse.com Flags| |SHIP_STOPPER?(dleuenberger@ | |suse.com) --- Comment #3 from Dominique Leuenberger <dimstar@opensuse.org> --- This is now blocking further Tumbleweed Snapshots. Happens in all updated scnearios coming from 42.1, 42.2 and 42.3 -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1096401 Dominique Leuenberger <dleuenberger@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Flags|SHIP_STOPPER?(dleuenberger@ |SHIP_STOPPER+ |suse.com) | -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1096401 Dominique Leuenberger <dimstar@opensuse.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |kernel-maintainers@forge.pr | |ovo.novell.com -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1096401 Oliver Kurz <okurz@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |okurz@suse.com -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1096401 http://bugzilla.opensuse.org/show_bug.cgi?id=1096401#c8 Oliver Kurz <okurz@suse.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Blocker|--- |Yes Severity|Normal |Critical --- Comment #8 from Oliver Kurz <okurz@suse.com> --- Under my assessment we actually have quite some "unstable" openQA tests which I tried to investigate and in the end I always come back to finding the crashes mentioned in this bug so marking as "blocker". https://progress.opensuse.org/issues/35302 is an example of an "openQA test" issue where we track more "unstable tests". Maybe we can come up with some kind of "workaround" by waiting longer after seeing these coredumps in tests. That is to be handled in https://progress.opensuse.org/issues/35302 -- You are receiving this mail because: You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@novell.com