Bug ID | 1178359 |
---|---|
Summary | Unexplainable high load average |
Classification | openSUSE |
Product | openSUSE Distribution |
Version | Leap 15.2 |
Hardware | Other |
OS | openSUSE Leap 15.2 |
Status | NEW |
Severity | Normal |
Priority | P5 - None |
Component | Other |
Assignee | screening-team-bugs@suse.de |
Reporter | itteam@smartodds.co.uk |
QA Contact | qa-bugs@suse.de |
Found By | --- |
Blocker | --- |
Created attachment 843219 [details]
fig1
Overview:
We've witnessed an unusually high system load average on several recent Leap
15.2 virtual machine builds which reside under a Nutanix AHV hypervisor.
Steps to reproduce:
It's difficult to reproduce as it seems to start after several days of uptime,
if at all. But I have seen this happen on 4 systems so far.
Actual results:
On an affected VM, after some time (days or weeks), system load average starts
jumping up in steps. For one system, the load average jumped from ~0.01
(normal), to ~0.5, then to ~1.0 and so on until it is now ~5.5, over the period
of several days. The attached image (fig 1) shows this behaviour over time, it
is a chart of the 5-minute average (note: unit on the chart is % where 100% =
nproc) A reboot resets the problem although I am yet to see if it returns.
Some output from another VM, this one is essentially as close to a bare 15.2
installation as we have:
# cat /proc/loadavg
2.04 2.01 2.00 1/228 4319
As you can see from the below output, there are no processes listed as waiting
or in uninterruptible sleep, and there is nothing waiting for IO, nor is there
any swap activity.
# vmstat 1
procs -----------memory---------- ---swap-- -----io---- -system--
------cpu-----
r b swpd free buff cache si so bi bo in cs us sy id wa
st
0 0 19200 248944 1060 1337940 0 0 3 27 11 20 1 0 99 0
0
0 0 19200 248912 1060 1337940 0 0 0 0 349 260 1 0 100 0
0
0 0 19200 248912 1060 1337940 0 0 0 0 318 249 0 0 100 0
0
0 0 19200 248912 1060 1337940 0 0 0 0 338 254 0 0 100 0
0
0 0 19200 248976 1060 1337940 0 0 0 0 287 246 0 1 100 0
0
0 0 19200 248944 1060 1337940 0 0 0 0 262 214 1 0 100 0
0
0 0 19200 248912 1060 1337940 0 0 0 0 320 248 0 1 100 0
0
0 0 19200 248944 1060 1337940 0 0 0 0 315 243 1 0 100 0
0
0 0 19200 248912 1060 1337940 0 0 0 0 284 236 0 0 100 0
0
0 0 19200 248912 1060 1337940 0 0 0 0 362 276 0 0 100 0
0
0 0 19200 248944 1060 1337940 0 0 0 0 377 270 0 0 100 0
0
0 0 19200 248912 1060 1337940 0 0 0 0 335 233 0 0 100 0
0
0 0 19200 248912 1060 1337940 0 0 0 0 316 271 0 0 99 0
0
0 0 19200 248944 1060 1337940 0 0 0 0 321 258 0 0 100 0
0
0 0 19200 248912 1060 1337940 0 0 0 0 297 228 0 0 99 0
0
0 0 19200 248944 1060 1337940 0 0 0 0 319 241 0 1 100 0
0
0 0 19200 248912 1060 1337940 0 0 0 0 309 250 1 0 100 0
0
0 0 19200 248912 1060 1337940 0 0 0 0 328 249 0 0 100 0
0
0 0 19200 248944 1060 1337940 0 0 0 0 339 267 0 0 99 0
0
0 0 19200 248912 1060 1337940 0 0 0 0 360 245 0 0 100 0
0
0 0 19200 248944 1060 1337940 0 0 0 0 288 238 0 0 100 0
0
0 0 19200 248944 1060 1337940 0 0 0 0 338 246 0 0 100 0
0
0 0 19200 248944 1060 1337940 0 0 0 0 348 263 1 0 100 0
0
0 0 19200 248660 1060 1338132 0 0 0 1288 326 288 0 0 100 0
0
0 0 19200 248660 1060 1338132 0 0 0 0 341 258 0 1 100 0
0
0 0 19200 248692 1060 1338132 0 0 0 0 369 267 0 0 100 0
0
0 0 19200 248692 1060 1338132 0 0 0 0 330 265 0 0 100 0
0
0 0 19200 248660 1060 1338132 0 0 0 0 337 263 0 0 99 0
0
0 0 19200 248692 1060 1338132 0 0 0 0 331 258 0 0 100 0
0
0 0 19200 248660 1060 1338132 0 0 0 0 309 223 0 0 100 0
0
0 0 19200 248912 1060 1338132 0 0 0 0 306 262 0 0 100 0
0
Expected results:
System load average stays within expected levels which allows for effective
monitoring.
Build:
Linux 5.3.18-lp152.26-default #1 SMP Mon Jun 29 14:58:38 UTC 2020 (2a0430f)
x86_64 x86_64 x86_64 GNU/Linux
Nutanix AHV virtual machine, Intel(R) Xeon(R) Gold 6150 CPU, variable
memory/core count for VMs.
Additional Builds and platforms:
We have not witnessed this on Leap 15.1