[Bug 1215337] New: python-onnx does not build s390x

https://bugzilla.suse.com/show_bug.cgi?id=1215337 Bug ID: 1215337 Summary: python-onnx does not build s390x Classification: openSUSE Product: openSUSE Tumbleweed Version: Current Hardware: S/390-64 OS: openSUSE Tumbleweed Status: NEW Severity: Normal Priority: P5 - None Component: Development Assignee: screening-team-bugs@suse.de Reporter: azouhr@opensuse.org QA Contact: qa-bugs@suse.de CC: ada.lovelace@gmx.de, ihno@suse.com Target Milestone: --- Found By: --- Blocker: --- python-onnx does not build on openSUSE:Factory:zSystems for quite some time: osc buildhistory openSUSE:Factory:zSystems python-onnx standard s390x TIME SRCMD5 VER-REL.BUILD# REV DURATION 2019-11-26 15:59:51 c7251471230f79907f69762d42f620a6 1.6.0-1.1 1 414 2019-12-24 10:29:50 c7251471230f79907f69762d42f620a6 1.6.0-1.2 1 821 2020-01-07 20:22:00 c7251471230f79907f69762d42f620a6 1.6.0-1.3 1 138 2020-01-08 17:02:51 0147179dbf202fdba63d0887ccf75400 1.6.0-2.1 2 192 2020-01-11 08:08:49 0147179dbf202fdba63d0887ccf75400 1.6.0-2.2 2 137 2020-01-21 18:59:45 0147179dbf202fdba63d0887ccf75400 1.6.0-2.3 2 135 2020-02-05 07:12:05 0147179dbf202fdba63d0887ccf75400 1.6.0-2.4 2 668 2020-02-07 20:58:34 0147179dbf202fdba63d0887ccf75400 1.6.0-2.5 2 178 2020-02-16 16:40:23 0147179dbf202fdba63d0887ccf75400 1.6.0-2.6 2 280 2020-02-18 13:51:04 0147179dbf202fdba63d0887ccf75400 1.6.0-2.7 2 227 2020-02-19 17:01:33 0147179dbf202fdba63d0887ccf75400 1.6.0-2.8 2 264 This package is needed in order to prepare a container for onnx-mlir which is the platform that also supports IBM Telum processors on Mainframe. -- You are receiving this mail because: You are on the CC list for the bug.

https://bugzilla.suse.com/show_bug.cgi?id=1215337 https://bugzilla.suse.com/show_bug.cgi?id=1215337#c2 --- Comment #2 from Berthold Gunreben <azouhr@opensuse.org> --- For the time being, I just disabled the tests that brake the build. Please have a look at osc rdiff openSUSE:Factory python-onnx openSUSE:Factory:zSystems I would think that IBM should be interested in debugging this themselves. One thing that surprised me is, that the build on my notebook with qemu was almost as fast as the one on real hardware in OBS. Notebook: 7817s Mainframe: 7657s I used this command to build locally: osc build --vm-type=qemu --vm-memory=12G standard s390x and of course, the notebook is rather powerful with 16 virtual cores (8 cores, SMT2, Ryzen 6900HS) -- You are receiving this mail because: You are on the CC list for the bug.

https://bugzilla.suse.com/show_bug.cgi?id=1215337 https://bugzilla.suse.com/show_bug.cgi?id=1215337#c4 --- Comment #4 from Berthold Gunreben <azouhr@opensuse.org> --- Created attachment 869863 --> https://bugzilla.suse.com/attachment.cgi?id=869863&action=edit buildlog with failing tests -- You are receiving this mail because: You are on the CC list for the bug.

https://bugzilla.suse.com/show_bug.cgi?id=1215337 https://bugzilla.suse.com/show_bug.cgi?id=1215337#c9 --- Comment #9 from Berthold Gunreben <azouhr@opensuse.org> --- (In reply to Christian Goll from comment #6)
Does it still not build as I am getting now: 2023-09-26 13:54:55 58e929acfff8575cc616eb942c686a64 1.12.0-2.1 2 7688
Can you please also re-check after https://build.opensuse.org/request/show/1116892 was accepted.
yes, the build worked because I disabled the failing checks. I removed the package from openSUSE:Factory:zSystems to let you see the real results, and also to pick up the new package after the SR has been accepted. I see that there has not been a test of s390x from within the SR, therefore, I have some doubts that the situation will change. -- You are receiving this mail because: You are on the CC list for the bug.

https://bugzilla.suse.com/show_bug.cgi?id=1215337 https://bugzilla.suse.com/show_bug.cgi?id=1215337#c19 --- Comment #19 from Sarah Kriesch <ada.lovelace@gmx.de> --- Latest Assertion Errors: 465s] [31m[1m_ TestExternalDataToArray_0_protobuf.test_save_model_with_external_data_multiple_times _[0m [ 465s] [gw0] linux -- Python 3.10.14 /usr/bin/python3.10 [ 465s] [1m[31mtest/test_external_data.py[0m:685: in test_save_model_with_external_data_multiple_times [ 465s] np.testing.assert_allclose([90m[39;49;00m [ 465s] [1m[31m/usr/lib64/python3.10/contextlib.py[0m:79: in inner [ 465s] [94mreturn[39;49;00m func(*args, **kwds)[90m[39;49;00m [ 465s] [1m[31mE AssertionError: [0m [ 465s] [1m[31mE Not equal to tolerance rtol=1e-07, atol=0[0m [ 465s] [1m[31mE [0m [ 465s] [1m[31mE x and y nan location mismatch:[0m [ 465s] [1m[31mE x: array([[[ 6.166898e+34, 4.026997e-26, 1.645485e+22, ...,[0m [ 465s] [1m[31mE 1.920033e+25, 2.967944e-20, -4.614014e-04],[0m [ 465s] [1m[31mE [-1.356084e+15, -8.810646e-38, -2.061762e+04, ...,...[0m [ 465s] [1m[31mE y: array([[[5.322033e-01, 9.073346e-01, 5.053618e-01, ..., 6.152101e-01,[0m [ 465s] [1m[31mE 6.564350e-01, 4.549692e-01],[0m [ 465s] [1m[31mE [6.703315e-01, 4.256554e-01, 5.766872e-01, ..., 9.197403e-01,...[0m [ 465s] [31m[1m_____ TestExternalDataToArray_0_protobuf.test_to_array_with_external_data ______[0m [ 465s] [gw0] linux -- Python 3.10.14 /usr/bin/python3.10 [ 465s] [1m[31mtest/test_external_data.py[0m:664: in test_to_array_with_external_data [ 465s] np.testing.assert_allclose(loaded_large_data, [96mself[39;49;00m.large_data)[90m[39;49;00m [ 465s] [1m[31m/usr/lib64/python3.10/contextlib.py[0m:79: in inner [ 465s] [94mreturn[39;49;00m func(*args, **kwds)[90m[39;49;00m [ 465s] [1m[31mE AssertionError: [0m [ 465s] [1m[31mE Not equal to tolerance rtol=1e-07, atol=0[0m [ 465s] [1m[31mE [0m [ 465s] [1m[31mE x and y nan location mismatch:[0m [ 465s] [1m[31mE x: array([[[ 5.798805e-30, -1.165122e+02, 8.127961e+20, ...,[0m [ 465s] [1m[31mE 1.806727e-37, -8.921661e-06, 5.233179e-20],[0m [ 465s] [1m[31mE [-5.477265e+32, -4.729377e+18, -1.624872e+24, ...,...[0m [ 465s] [1m[31mE y: array([[[0.002852, 0.527004, 0.746832, ..., 0.459877, 0.340009,[0m [ 465s] [1m[31mE 0.630724],[0m [ 465s] [1m[31mE [0.135593, 0.767637, 0.135425, ..., 0.046752, 0.25354 ,...[0m [ 465s] [31m[1m_ TestExternalDataToArray_1_textproto.test_save_model_with_external_data_multiple_times _[0m [ 465s] [gw0] linux -- Python 3.10.14 /usr/bin/python3.10 [ 465s] [1m[31mtest/test_external_data.py[0m:685: in test_save_model_with_external_data_multiple_times [ 465s] np.testing.assert_allclose([90m[39;49;00m [ 465s] [1m[31m/usr/lib64/python3.10/contextlib.py[0m:79: in inner [ 465s] [94mreturn[39;49;00m func(*args, **kwds)[90m[39;49;00m [ 465s] [1m[31mE AssertionError: [0m [ 465s] [1m[31mE Not equal to tolerance rtol=1e-07, atol=0[0m [ 465s] [1m[31mE [0m [ 465s] [1m[31mE x and y nan location mismatch:[0m [ 465s] [1m[31mE x: array([[[ 1.639122e+29, 7.927940e-02, -1.220582e-34, ...,[0m [ 465s] [1m[31mE 5.746708e+14, 3.756967e+37, -3.229969e+24],[0m [ 465s] [1m[31mE [-1.614809e-31, 2.402144e-37, 8.529220e-11, ...,...[0m [ 465s] [1m[31mE y: array([[[9.063177e-01, 5.410980e-02, 1.856786e-01, ..., 3.320491e-01,[0m [ 465s] [1m[31mE 6.167372e-01, 4.964211e-01],[0m [ 465s] [1m[31mE [3.072628e-01, 9.829561e-01, 7.018124e-02, ..., 1.915139e-01,... [ 465s] [31m[1m__________________ test_make_tensor_raw[TensorProto.FLOAT16] ___________________[0m [ 465s] [gw0] linux -- Python 3.10.14 /usr/bin/python3.10 [ 465s] [1m[31mtest/helper_test.py[0m:928: in test_make_tensor_raw [ 465s] np.testing.assert_equal(np_array, numpy_helper.to_array(tensor))[90m[39;49;00m [ 465s] [1m[31m/usr/lib64/python3.10/contextlib.py[0m:79: in inner [ 465s] [94mreturn[39;49;00m func(*args, **kwds)[90m[39;49;00m [ 465s] [1m[31mE AssertionError: [0m [ 465s] [1m[31mE Arrays are not equal[0m [ 465s] [1m[31mE [0m [ 465s] [1m[31mE Mismatched elements: 6 / 6 (100%)[0m [ 465s] [1m[31mE Max absolute difference: 1.146[0m [ 465s] [1m[31mE Max relative difference: 1576.[0m [ 465s] [1m[31mE x: array([[ 1.309 , -0.0258, 1.1455],[0m [ 465s] [1m[31mE [ 0.3464, 0.774 , -0.7744]], dtype=float16)[0m [ 465s] [1m[31mE y: array([[ 1.060e+00, -3.735e-03, -1.278e-03],[0m [ 465s] [1m[31mE [-2.199e-04, 1.633e-01, 2.102e-01]], dtype=float16)[0m -- You are receiving this mail because: You are on the CC list for the bug.

https://bugzilla.suse.com/show_bug.cgi?id=1215337 https://bugzilla.suse.com/show_bug.cgi?id=1215337#c24 --- Comment #24 from Sarah Kriesch <ada.lovelace@gmx.de> --- The highest available Python version in openSUSE and Fedora is 3.13. That is the build log with excluding 3.10: https://build.opensuse.org/package/live_build_log/home:AdaLovelace:branches:... That is the build log including Python 3.10, Python 3.11, Python 3.12 and Python 3.13: https://build.opensuse.org/package/live_build_log/openSUSE:Factory:zSystems/... Our Python Onnx Maintainers are mainlining/updating continuously. We are using onnx 1.16.0 at the moment. So, do you mean I should exclude Python 3.10-3.12? What is supported on your "supported" Enterprise distributions for AI on IBM Z? -- You are receiving this mail because: You are on the CC list for the bug.

https://bugzilla.suse.com/show_bug.cgi?id=1215337 https://bugzilla.suse.com/show_bug.cgi?id=1215337#c25 --- Comment #25 from Sarah Kriesch <ada.lovelace@gmx.de> --- I want to test your referenced patch. -- You are receiving this mail because: You are on the CC list for the bug.

https://bugzilla.suse.com/show_bug.cgi?id=1215337 https://bugzilla.suse.com/show_bug.cgi?id=1215337#c26 --- Comment #26 from Sarah Kriesch <ada.lovelace@gmx.de> --- @Andreas This bug report has been forwarded to you via the IBM Bugzilla. Therefore, you can get an openSUSE VM for development (not only Fedora) by IBM. Fedora is the partner community of openSUSE, and we are supporting each other. Please fix the bug on openSUSE Tumbleweed. As soon as python-onnx is working on openSUSE, it will be enabled for builds on Fedora also again. -- You are receiving this mail because: You are on the CC list for the bug.

https://bugzilla.suse.com/show_bug.cgi?id=1215337 https://bugzilla.suse.com/show_bug.cgi?id=1215337#c33 --- Comment #33 from OBSbugzilla Bot <bwiedemann+obsbugzillabot@suse.com> --- This is an autogenerated message for OBS integration: This bug (1215337) was mentioned in https://build.opensuse.org/request/show/1183184 Factory / python-onnx -- You are receiving this mail because: You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@suse.com