Lockups with SuSE 9.3 + Quadro FX 3400 + HP xw9300 + 7676
Hi there, Well I'm at it again, you may remember my troubles with getting the NVidia drivers to work on an HP xw9300 workstation with SuSE 9.2 back in April. The machine is back in my office again, and I'm giving it another shot. I'm also posting this message in the nvnews forums, but I was hoping I might get some help here too (Thanks in advance Kevin and the rest of you): I'm trying to get the 7676 x86-64 driver running on SuSE 9.3 on an HP xw9300 (dual Opteron 250s, 8 GB RAM, nForce Professional 2200 chipset. I've been experiencing two different lockups. One involves only an Xid: Aug 24 16:53:16 ekk2 kernel: NVRM: Xid: 25, L1 -> L0 Aug 24 16:53:16 ekk2 kernel: NVRM: Xid: 13, 0005 beef3097 00004097 00001748 00000000 00000002 This error locks up my opengl app momentarily, but the X server has always recovered so far. The other error I get starts with this message in the log: Aug 24 17:05:26 ekk2 kernel: NVRM: not remapping 0x1000 bytes, 0x3c00000 total Aug 24 17:05:26 ekk2 kernel: NVRM: VM: nv_vm_malloc_pages: failed to sg map pages Aug 24 17:05:26 ekk2 kernel: ----------- [cut here ] --------- [please bite here ] --------- Aug 24 17:05:26 ekk2 kernel: Kernel BUG at pageattr:154 Aug 24 17:05:26 ekk2 kernel: invalid operand: 0000 [1] SMP Aug 24 17:05:26 ekk2 kernel: CPU 1 This error is more serious (and it seems to be more frequent too), as the X server never recovers and I have to shut down remotely. For some reason nvidia-bug-report.sh never finishes with the second error either. I've attached a bug report generated after the Xid error (bug report at http://www.nvnews.net/vbulletin/attachment.php?attachmentid=13077). Please let me know if you have any insight. Thank you, Ken
Hi Ken, It looks like you might be hitting the bug where the nvidia driver tries to modify the low memory. There is a patch to try to fix this here: http://lkml.org/lkml/2005/7/19/115 as far as i can tell this patch hasn't been included in the mainline, but your symptoms look very similar. peter
On Thursday 25 August 2005 04:00, Peter Buckingham wrote:
Hi Ken,
It looks like you might be hitting the bug where the nvidia driver tries to modify the low memory. There is a patch to try to fix this here:
http://lkml.org/lkml/2005/7/19/115
as far as i can tell this patch hasn't been included in the mainline, but your symptoms look very similar.
That problem applies only to i386, x86-64 is not affected. -Andi
We had a similar problem but when we switched from 6GB of memory to 8GB of memory it suddenly worked fine, on our xw8200. Our xw9300 dual opteron has been much more stable than the 8000 series. We've even had problems with Windows on the 8000 line. I tried newer Nvidia drivers but that didn't help at all. One of our Windows admins added more memory to take it up to 8GB and all of a sudden it worked. He didn't do anything else to the system. David W. Stadden, UNIX/Linux Systems Administrator Raytheon Space & Airborne Systems Ken Siersma <siersmak@ekkinc. com> To suse-amd64@suse.com 08/24/2005 06:19 cc PM Subject [suse-amd64] Lockups with SuSE 9.3 + Quadro FX 3400 + HP xw9300 + 7676 Hi there, Well I'm at it again, you may remember my troubles with getting the NVidia drivers to work on an HP xw9300 workstation with SuSE 9.2 back in April. The machine is back in my office again, and I'm giving it another shot. I'm also posting this message in the nvnews forums, but I was hoping I might get some help here too (Thanks in advance Kevin and the rest of you): I'm trying to get the 7676 x86-64 driver running on SuSE 9.3 on an HP xw9300 (dual Opteron 250s, 8 GB RAM, nForce Professional 2200 chipset. I've been experiencing two different lockups. One involves only an Xid: Aug 24 16:53:16 ekk2 kernel: NVRM: Xid: 25, L1 -> L0 Aug 24 16:53:16 ekk2 kernel: NVRM: Xid: 13, 0005 beef3097 00004097 00001748 00000000 00000002 This error locks up my opengl app momentarily, but the X server has always recovered so far. The other error I get starts with this message in the log: Aug 24 17:05:26 ekk2 kernel: NVRM: not remapping 0x1000 bytes, 0x3c00000 total Aug 24 17:05:26 ekk2 kernel: NVRM: VM: nv_vm_malloc_pages: failed to sg map pages Aug 24 17:05:26 ekk2 kernel: ----------- [cut here ] --------- [please bite here ] --------- Aug 24 17:05:26 ekk2 kernel: Kernel BUG at pageattr:154 Aug 24 17:05:26 ekk2 kernel: invalid operand: 0000 [1] SMP Aug 24 17:05:26 ekk2 kernel: CPU 1 This error is more serious (and it seems to be more frequent too), as the X server never recovers and I have to shut down remotely. For some reason nvidia-bug-report.sh never finishes with the second error either. I've attached a bug report generated after the Xid error (bug report at http://www.nvnews.net/vbulletin/attachment.php?attachmentid=13077). Please let me know if you have any insight. Thank you, Ken -- Check the List-Unsubscribe header to unsubscribe For additional commands, email: suse-amd64-help@suse.com
participants (4)
-
Andi Kleen
-
David W Stadden
-
Ken Siersma
-
Peter Buckingham