(In reply to Stefan Schubert from comment #9) > [���] > > The three snaperd coredumps are available in the openQA provided tarball > > https://openqa.opensuse.org/tests/688332/file/gimp-problem_detection_logs. > > tar.xz > Hm, this file does not exist anymore. Could you please point us to it again ? openQA cleans up logs and then older openQA jobs after some time. I wonder why the one-month job already deleted its jobs but as a hint: Jobs are kept around longer than logs so the job https://openqa.opensuse.org/tests/688332 itself is still there. From there we can find the link to the always "latest": https://openqa.opensuse.org/tests/latest?flavor=NET&machine=64bit&arch=x86_64&distri=opensuse&version=Tumbleweed&test=update_Leap_42.3_gnome#previous this points me to the two latest failures https://openqa.opensuse.org/tests/700735 failing in gnucash and https://openqa.opensuse.org/tests/700420 failing in chromium. I think in both cases we would be able to find these logs. > Additional it would be nice if you point us to the point where it happens in > this test. (The test is quite huge :-)) > > > > (directory coredumps) In the aforementioned test failures I can find references to the core dump, e.g. in https://openqa.opensuse.org/tests/700735/file/gnucash-journal.log I see ``` Jul 03 04:10:40 susetest org.opensuse.Snapper[729]: terminate called after throwing an instance of 'boost::exception_detail::clone_impl<boost::exception_detail::error_info_injector<boost::lock_error> >' Jul 03 04:10:40 susetest org.opensuse.Snapper[729]: what(): boost: mutex lock failed in pthread_mutex_lock: Invalid argument Jul 03 04:10:40 susetest systemd[1]: Created slice system-systemd\x2dcoredump.slice. Jul 03 04:10:40 susetest systemd[1]: Started Process Core Dump (PID 7421/UID 0). Jul 03 04:10:40 susetest systemd-coredump[7422]: Process 5743 (snapperd) of user 0 dumped core. ``` so to identify further at which point in time within the test this happens we can take a look at the timestamp. https://openqa.opensuse.org/tests/700735#step/keymap_or_locale/10 is at 03:56 so before, https://openqa.opensuse.org/tests/700735#step/consoletest_finish/17 is at 04:17 after that. Taking a look into the journal again I can find the lines ``` Jul 03 04:09:56 susetest systemd[1]: Started The Salt Minion. Jul 03 04:10:19 susetest salt-minion[7008]: [ERROR ] The Salt Master has cached the public key for this node, this salt minion will wait for 10 seconds before attempting to re-authenticate Jul 03 04:10:29 susetest salt-minion[7008]: [ERROR ] The Salt Master has cached the public key for this node, this salt minion will wait for 10 seconds before attempting to re-authenticate ``` that is just 11 seconds before the above error message at 04:10:40 and the next message after the snapper error messages is ``` Jul 03 04:11:03 susetest systemd[1]: Stopping The Salt Master Server... ``` that is 23 seconds later. With this https://openqa.opensuse.org/tests/700735#step/salt/14 is the step just in before within the test module "salt", https://openqa.opensuse.org/tests/700735#step/salt/24 is the step afterwards. I have a suspicion that the snapper core dump is related to installing packages which should also trigger snapper to create snapshots, right? E.g. from https://openqa.opensuse.org/tests/700735/file/gnucash-y2logs_clone.tar.bz2 we can also find the zypper.log stating: ``` 2018-07-03 04:09:39 <2> susetest(6262) [PLUGIN] PluginScript.cc(~PluginDumpStderr):75 ! INFO:root:creating post snapshot 2018-07-03 04:09:39 <2> susetest(6262) [PLUGIN] PluginScript.cc(~PluginDumpStderr):75 ! DEBUG:root:created post snapshot 8 2018-07-03 04:09:39 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(receive):426 PluginScript[6283] /usr/lib/zypp/plugins/commit/snapper.py <-PluginFrame[ACK](0){0} 2018-07-03 04:09:39 <1> susetest(6262) [zypp::plugin++] PluginExecutor.cc(send):94 --------------- send PluginFrame[COMMITEND](0){4556} 2018-07-03 04:09:39 <1> susetest(6262) [zypp] PathInfo.cc(dirForEach):553 readdir /etc/products.d 2018-07-03 04:09:39 <1> susetest(6262) [zypp] TargetImpl.cc(buildCache):850 Read cookie: /var/cache/zypp/solv/@System/cookie{- 0644 0/0 size 52} 2018-07-03 04:09:39 <1> susetest(6262) [zypp] TargetImpl.cc(buildCache):857 Read cookie: /var/cache/zypp/solv/@System/cookie says: outdated 2018-07-03 04:09:39 <1> susetest(6262) [zypp::exec++] ExternalProgram.cc(start_program):252 Executing 'rpmdb2solv' '-r' '/' '-X' '-A' '-p' '/etc/products.d' '/var/cache/zypp/solv/@System/solv' '-o' '/var/cache/zypp/solv/@System/solvP20RfW' 2018-07-03 04:09:39 <1> susetest(6262) [zypp::exec++] ExternalProgram.cc(start_program):415 pid 6943 launched 2018-07-03 04:09:39 <1> susetest(6262) [zypp::exec++] ExternalProgram.cc(checkStatus):516 Pid 6943 successfully completed 2018-07-03 04:09:39 <1> susetest(6262) [zypp] PathInfo.cc(rename):701 rename /var/cache/zypp/solv/@System/solvP20RfW -> /var/cache/zypp/solv/@System/solv 2018-07-03 04:09:39 <1> susetest(6262) [zypp] PathInfo.cc(chmod):1051 chmod /var/cache/zypp/solv/@System/solv 00644 2018-07-03 04:09:39 <1> susetest(6262) [zypp::plugin++] PluginExecutor.cc(load):53 +++++++++++++++ load /usr/lib/zypp/plugins/system{d 0755 0/0} 2018-07-03 04:09:39 <1> susetest(6262) [zypp] PathInfo.cc(dirForEach):553 readdir /usr/lib/zypp/plugins/system 2018-07-03 04:09:39 <1> susetest(6262) [zypp::plugin++] PluginExecutor.cc(load):80 --------------- load /usr/lib/zypp/plugins/system{d 0755 0/0} 2018-07-03 04:09:39 <1> susetest(6262) [zypp] TargetImpl.cc(commit):1338 TargetImpl::commit(<pool>, CommitPolicy( DownloadInHeaps )) returns: CommitResult (total 40, done 40, error 0, skipped 0, updateMessages 0) 2018-07-03 04:09:39 <1> susetest(6262) [zypp::plugin++] PluginExecutor.cc(send):85 +++++++++++++++ send PluginFrame[PLUGINEND](0){0} 2018-07-03 04:09:39 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(send):272 PluginScript[6281] /usr/lib/zypp/plugins/commit/btrfs-defrag-plugin.py ->send PluginFrame[PLUGINEND](0){0} 2018-07-03 04:09:40 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(receive):426 PluginScript[6281] /usr/lib/zypp/plugins/commit/btrfs-defrag-plugin.py <-PluginFrame[ACK](0){0} 2018-07-03 04:09:40 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(send):272 PluginScript[6283] /usr/lib/zypp/plugins/commit/snapper.py ->send PluginFrame[PLUGINEND](0){0} 2018-07-03 04:09:40 <2> susetest(6262) [PLUGIN] PluginScript.cc(~PluginDumpStderr):75 ! INFO:root:PLUGINEND 2018-07-03 04:09:40 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(receive):426 PluginScript[6283] /usr/lib/zypp/plugins/commit/snapper.py <-PluginFrame[ACK](0){0} 2018-07-03 04:09:40 <1> susetest(6262) [zypp::plugin++] PluginExecutor.cc(send):94 --------------- send PluginFrame[PLUGINEND](0){0} 2018-07-03 04:09:40 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(close):229 Close:PluginScript[6281] /usr/lib/zypp/plugins/commit/btrfs-defrag-plugin.py 2018-07-03 04:09:40 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(send):272 PluginScript[6281] /usr/lib/zypp/plugins/commit/btrfs-defrag-plugin.py ->send PluginFrame[_DISCONNECT](0){0} 2018-07-03 04:09:40 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(receive):426 PluginScript[6281] /usr/lib/zypp/plugins/commit/btrfs-defrag-plugin.py <-PluginFrame[ACK](1){10} 2018-07-03 04:09:40 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(close):251 PluginScript[6281] /usr/lib/zypp/plugins/commit/btrfs-defrag-plugin.py -> [0] Disconnect 2018-07-03 04:09:40 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(close):229 Close:PluginScript[6283] /usr/lib/zypp/plugins/commit/snapper.py 2018-07-03 04:09:40 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(send):272 PluginScript[6283] /usr/lib/zypp/plugins/commit/snapper.py ->send PluginFrame[_DISCONNECT](0){0} 2018-07-03 04:09:40 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(receive):426 PluginScript[6283] /usr/lib/zypp/plugins/commit/snapper.py <-PluginFrame[ACK](1){10} 2018-07-03 04:09:40 <1> susetest(6262) [zypp::plugin++] PluginScript.cc(close):251 PluginScript[6283] /usr/lib/zypp/plugins/commit/snapper.py -> [0] Disconnect ``` so could that be https://bugzilla.suse.com/show_bug.cgi?id=1099494 "zypper calls snapper, snapshot creation failed while snapper already running"? Keep in mind that the test module "gnucash" failing is merely a coincidence. https://openqa.opensuse.org/tests/700420/file/chromium-journal.log is the journal from the second log and it shows the same symptoms, snapper core dumps next to salt services while the failing test module is actually chromium.