[Bug 1171807] Random crashes: "general protection fault: 0000 [#1] SMP PTI"
http://bugzilla.suse.com/show_bug.cgi?id=1171807 http://bugzilla.suse.com/show_bug.cgi?id=1171807#c5 --- Comment #5 from Reg Proctor <novell@regproctor.com> --- The power is rock solid with a UPS and more than adequate wattage good quality power supply. The memory -- that's hard to say. The machine can sit for days without crashing and then need to boot 3-4 times before it's up and running again (or boot up just fine without crashing) so I couldn't say whether there was a problem there or not in memory. It has passed memory tests when I run them without ever reporting a problem. I do run my machine a little faster than stock standard but I don't do any custom fiddling. I just let the BOIS know that I have a water cooled CPU and it "ups" some parameters so the CPU can run a little faster. I do have an unusual video card. I used to have a Radeon 5870 which has 6 dp outputs and I recently replaced it with a Radeon 7870 which is just a slightly newer model of the same thing. Still an old model as those things are expensive so 2nd hand and not the latest is the way to go. My motherboard is a ASUS X99-E WS which is not specifically mentioned as made for Linux so there could be an issue there but it's several years old now and I would suspect that Linux has the drivers for all the hardware by now. I should also remove a USB stick that I tend to leave attached since it could be a source of a hardware error. I used to have these 3 additional boot parameters: acpi_enforce_resources=lax, pci=noaer, radeon.audio=0 and when I put the new video card in, the Radeon 7870, I played with it and it seemed that I could drop the pci=noaer, radeon.audio=0 but since you are mentioning aer maybe I'll have to add the pci=noaer back in however since you mentioned it I looked up aer and it seems to me that pci=noaer is a bandaide solution and what I should really do is some diagnostics to find out which device is causing aer problems, if that's what's happening. However, I have no idea how to go about that. I'm more than happy to do the work to figure this out but I think I'm going to need some guidance on which tools are available and I should use. Any chance you could give me a starting point? I am a software developer (golang and PHP mostly these days, assembly in a past life too long ago to matter) so I am reasonably tech savvy. -- You are receiving this mail because: You are on the CC list for the bug.
participants (1)
-
bugzilla_noreply@suse.com