[Bug 1178453] New: [Build 20201103] gnuhealth_client crashes on start
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453 Bug ID: 1178453 Summary: [Build 20201103] gnuhealth_client crashes on start Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: Other URL: https://openqa.opensuse.org/tests/1460183/modules/gnuh ealth_client_first_time/steps/2 OS: Other Status: NEW Severity: Normal Priority: P5 - None Component: Other Assignee: screening-team-bugs@suse.de Reporter: dimstar@opensuse.org QA Contact: qa-bugs@suse.de Found By: openQA Blocker: Yes ## Observation openQA test in scenario opensuse-Tumbleweed-DVD-x86_64-gnuhealth@64bit fails in [gnuhealth_client_first_time](https://openqa.opensuse.org/tests/1460183/modules/gnuhealth_client_first_tim...) ## Test suite description Maintainer: okurz@suse.de Test scenario for gnuhealth software stack ## Reproducible Fails since (at least) Build [20201103](https://openqa.opensuse.org/tests/1458960) ## Expected result Last good: [20201030](https://openqa.opensuse.org/tests/1456811) (or more recent) ## Further details Always latest result in this scenario: [latest](https://openqa.opensuse.org/tests/latest?arch=x86_64&distri=opensuse&flavor=DVD&machine=64bit&test=gnuhealth&version=Tumbleweed) -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c4
Axel Braun
The gnuhealth-client is non-distributable, as it contains a binary blob, the camera plugin.
it comes with a __pycache__ which obviously caused the problem. I have removed it, rebuild and the client comes up normally. Thanks for the hint! -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c6
--- Comment #6 from Axel Braun
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c9
--- Comment #9 from Axel Braun
This is more likely an issue of the openQA worker, openCV is doing runtime dispatching for CPU feature dependent code.
can you explain this in a way a 10yr old blonde understands it? ;-) Who needs to look into this? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c11
--- Comment #11 from Oliver Kurz
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c12
Axel Braun
https://openqa.opensuse.org/tests/1476138#step/gnuhealth_client_first_time/1 is strange though. It was another retriggered test which ran with `QEMUCPU=qemu64` and it showed the client just fine *confused*
yes, and in current build 20201115 it runs fine as well. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c13
Oliver Kurz
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c14
Axel Braun
but that is with the workaround applied. I removed that again as Stephan Kulow suggested in https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/ 11414#issuecomment-728756213 to keep it failing to not hide the product issue that we can not easily work around.
axel.braun@gmx.de , assigning back to you as https://github.com/os-autoinst/os-autoinst-distri-opensuse/pull/ 11414#issuecomment-728717005 convinced me: If it doesn't run on certain cpus, it should be recompiled to do so.
Hm, there is basically nothing I can do from gnuhealth-side. And I cant judge if the openQA worker is the problem (maybe submitting invalid data?) or if opencv processes these data falsely. Or even a different problem. As we will probably release a new gnuhealth version soon (planned is this month) I would really like to get this fixed, or error removed by setting the host-CPU as suggested originally. What can we do? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c15
Axel Braun
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
Axel Braun
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c17
Stefan Br�ns
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c19
Oliver Kurz
Sorry I haven't followed up on this.
I tried enabling the test suite in openCV, and the gapi test is also segfaulting on OBS. Unfortunately the test suite has to many external dependencies to be useful without much more work, so I have kept it disabled for now.
The gapi code seems to trigger some error probably related to qemu/kvm in its dispatch code (so either CPU dispatching chooses the wrong path, or the dispatched code is failing).
Will try to create trivial reproducer the coming days.
@dimstar, @fokurz - for runtime dispatched code it would be useful to have different (emulated) hardware, both on OBS (internal testsuite) and on openQA. Do we have something in place for this? I.e. request a host which has only SSE2.
Hm, I think what's possible is to run `osc build` locally or within respectively configured VMs. For openQA we can configure the VMs that are used for tests on the fly. E.g. with ``` openqa-clone-job --skip-chained-deps --within-instance https://openqa.opensuse.org 1583395 TEST=gnuhealth_only_sse2 QEMUCPU=base,sse2 _GROUP=0 BUILD=debug_bsc1178453 ``` I could create https://openqa.opensuse.org/t1583917 which (hopefully) will spawn a test job on a machine that has only sse2 enabled. We can tweak that with the "QEMUCPU" parameter which is directly passed to `qemu -cpu ���` Everyone with operator permissions on an openQA instance can run the same. You already have according permissions. -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c22
--- Comment #22 from Axel Braun
(In reply to Stefan Br��ns from comment #17)
....
I could create https://openqa.opensuse.org/t1583917 which (hopefully) will spawn a test job on a machine that has only sse2 enabled. We can tweak that with the "QEMUCPU" parameter which is directly passed to `qemu -cpu ���`
The above test fails as well.. From the other two options mentioned in https://github.com/opencv/opencv/issues/19020#issuecomment-758142627: - disable LTO - disable dispatching in OpenCV (cmake ... -DCPU_DISPATCH=), but with SSE2 baseline performance results are not really good (third option - code change - is not something we can consider here I assume) we have only 'disable LTO' left as option? -- You are receiving this mail because: You are on the CC list for the bug.
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c23
--- Comment #23 from Stefan Br�ns
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
Guillaume GARDET
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453
http://bugzilla.opensuse.org/show_bug.cgi?id=1178453#c29
--- Comment #29 from OBSbugzilla Bot
participants (1)
-
bugzilla_noreply@suse.com