We had a similar problem but when we switched from 6GB of memory to 8GB of
memory it suddenly worked fine, on our xw8200.
Our xw9300 dual opteron has been much more stable than the 8000 series.
We've even had problems with Windows on the 8000 line.
I tried newer Nvidia drivers but that didn't help at all. One of our
Windows admins added more memory to take it up to 8GB and all of
a sudden it worked. He didn't do anything else to the system.
David W. Stadden, UNIX/Linux Systems Administrator
Raytheon Space & Airborne Systems
Ken Siersma
To
suse-amd64@suse.com
08/24/2005 06:19 cc
PM
Subject
[suse-amd64] Lockups with SuSE 9.3
+ Quadro FX 3400 + HP xw9300 + 7676
Hi there,
Well I'm at it again, you may remember my troubles with getting the
NVidia drivers to work on an HP xw9300 workstation with SuSE 9.2 back in
April. The machine is back in my office again, and I'm giving it
another shot. I'm also posting this message in the nvnews forums, but I
was hoping I might get some help here too (Thanks in advance Kevin and
the rest of you):
I'm trying to get the 7676 x86-64 driver running on SuSE 9.3 on an HP
xw9300 (dual Opteron 250s, 8 GB RAM, nForce Professional 2200 chipset.
I've been experiencing two different lockups. One involves only an Xid:
Aug 24 16:53:16 ekk2 kernel: NVRM: Xid: 25, L1 -> L0
Aug 24 16:53:16 ekk2 kernel: NVRM: Xid: 13, 0005 beef3097 00004097
00001748 00000000 00000002
This error locks up my opengl app momentarily, but the X server has
always recovered so far.
The other error I get starts with this message in the log:
Aug 24 17:05:26 ekk2 kernel: NVRM: not remapping 0x1000 bytes, 0x3c00000
total
Aug 24 17:05:26 ekk2 kernel: NVRM: VM: nv_vm_malloc_pages: failed to sg
map pages
Aug 24 17:05:26 ekk2 kernel: ----------- [cut here ] --------- [please
bite here ] ---------
Aug 24 17:05:26 ekk2 kernel: Kernel BUG at pageattr:154
Aug 24 17:05:26 ekk2 kernel: invalid operand: 0000 [1] SMP
Aug 24 17:05:26 ekk2 kernel: CPU 1
This error is more serious (and it seems to be more frequent too), as
the X server never recovers and I have to shut down remotely. For some
reason nvidia-bug-report.sh never finishes with the second error either.
I've attached a bug report generated after the Xid error (bug report at
http://www.nvnews.net/vbulletin/attachment.php?attachmentid=13077).
Please let me know if you have any insight.
Thank you,
Ken
--
Check the List-Unsubscribe header to unsubscribe
For additional commands, email: suse-amd64-help@suse.com