Mailinglist Archive: opensuse-amd64 (130 mails)

< Previous Next >
RE: [suse-amd64] Tyan K8S >4GB
  • From: "Santiago Flores" <santi@xxxxxxxxxx>
  • Date: Mon, 15 Sep 2003 15:06:27 +0000 (UTC)
  • Message-id: <DFEFIPHEDHBDBJKCDBOBIEFODDAA.santi@xxxxxxxxxx>
LSI Logic 320-4x
http://www.lsilogic.com/products/stor_prod/raid/3204x.html

-----Original Message-----
From: Andreas Jaeger [mailto:aj@xxxxxxx]
Sent: Monday, September 15, 2003 8:00 AM
To: Santiago Flores
Cc: suse-amd64@xxxxxxxx
Subject: Re: [suse-amd64] Tyan K8S >4GB


"Santiago Flores" <santi@xxxxxxxxxx> writes:

> Thanks so much for looking at the outputs. I will grab the new kernel.
> Interesting developments are as follows:
>
> I final got a RAID card I had been waiting for. I installed the card and

Which RAID card is this?

Andreas

> attempted to boot the system. The system would not post. I removed the new
> controller and everything else. Eventually got down to one CPU and one
DIMM.
> The board died. We are in the process of RMAing it. Hopefully this will
> solve all of the problems we were seeing. Thanks so much for everyone's
> help.
>
> If any info is needed on the LSI 320-4x on an TYAN S2880 with 2 procs and
> 6GB DDR, I should be able to tell soon.
>
> Thanks!
>
> Santiago
>
> -----Original Message-----
> From: Andreas Jaeger [mailto:aj@xxxxxxx]
> Sent: Saturday, September 13, 2003 7:50 AM
> To: Santiago Flores
> Cc: suse-amd64@xxxxxxxx
> Subject: Re: [suse-amd64] Tyan K8S >4GB
>
>
>
> Looking at this one, this might be broken hardware - or an too old
> kernel. Can you try the latest ones from
> ftp.suse.com/pub/suse/x86-64/supplementary/kernel
>
> Do they work better?
>
> Andreas
>
>> MCG_STATUS: unrecoverable
>> Northbridge Machine Check exception f435a00077080a13 0
>> Lost at least one NB error condition
>> Uncorrectable condition
>> Unrecoverable condition
>> Northbridge status f435a00077080a13
>> ECC syndrome bits 776b
>> extended error chipkill ecc error
>> link number 0
>> uncorrected ecc error
>> error address valid
>> error enable
>> error uncorrected
>> error overflow
>> previous error lost
>> error address 0000000100100048
>> Address: 0000000100100048
>> MCE at EIP ffffffff8010ce2f ESP 100efd43fd8
>> CPU 1: Machine Check Exception: 0000000000000000
>> Kernel panic: Unable to continue
>> In idle task - not syncing
>> NMI Watchdog detected LOCKUP on CPU1, eip ffffffff801191cc, registers:
>> CPU 1
>> Pid: 0, comm: swapper Not tainted
>> RIP: 0010:[<ffffffff801191cc>]{.text.lock.smp+23}
>> RSP: 0018:00000100efd43d88 EFLAGS: 00000086
>> RAX: 0000000000000000 RBX: ffffffff802e43da RCX: 0000000000000000
>> RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffffff80119060
>> RBP: 0000000000000005 R08: 0000000000000001 R09: 0000000000000000
>> R10: 0000000000000000 R11: ffffffff803e55b0 R12: 0000000000000411
>> R13: 0000000000000010 R14: 0000000000000000 R15: 0000000000000001
>> FS: 0000000000000000(0000) GS:ffffffff804bb880(0000)
> knlGS:0000000000000000
>> CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
>> CR2: 0000000000000000 CR3: 00000000ef956000 CR4: 00000000000006e0
>>
>> Call Trace: <EOE> [<ffffffff80119060>]{stop_this_cpu+0}
>> [<ffffffff801190a9>]{smp_send_stop+25}
> [<ffffffff8012204d>]{panic+285}
>> [<ffffffff801225fe>]{__call_console_drivers+62}
> [<ffffffff8011c4f1>]{check_k8_nb+625}
>> [<ffffffff8011c164>]{generic_machine_check+404}
> [<ffffffff8011c206>]{do_machine_check+86}
>> [<ffffffff8010ce10>]{default_idle+0}
> [<ffffffff8010f7c2>]{error_exit+0}
>> [<ffffffff8010ce10>]{default_idle+0}
> [<ffffffff8010ce2f>]{default_idle+31}
>> [<ffffffff8010ce9a>]{cpu_idle+42}
>> Process swapper (pid: 0, stackpage=100efd43000)
>> Stack: 00000100efd43d88 0000000000000018 0000000000000000
0000000000000000
>> 0000000000000000 0000000000000000 0000000000000000
0000000000000000
>> 0000000000000000 0000000000000000 0000000000000000
0000000000000000
>> 0000000000000000 0000000000000000 0000000000000000
0000000000000000
>> 0000000000000000 0000000000000000 0000000000000000
0000000000000000
>> 0000000000000000 0000000000000000 0000000000000000
0000000000000000
>> 0000000000000000 0000000000000000 0000000000000000
0000000000000000
>> 0000000000000000 0000000000000000 0000000000000000
0000000000000000
>> 0000000000000000 0000000000000000 0000000000000000
0000000000000000
>> 0000000000000000 0000000000000000 0000000000000000
0000000000000000
>> Call Trace: <EOE> [<ffffffff80119060>]{stop_this_cpu+0}
>> [<ffffffff801190a9>]{smp_send_stop+25}
> [<ffffffff8012204d>]{panic+285}
>> [<ffffffff801225fe>]{__call_console_drivers+62}
> [<ffffffff8011c4f1>]{check_k8_nb+625}
>> [<ffffffff8011c164>]{generic_machine_check+404}
> [<ffffffff8011c206>]{do_machine_check+86}
>> [<ffffffff8010ce10>]{default_idle+0}
> [<ffffffff8010f7c2>]{error_exit+0}
>> [<ffffffff8010ce10>]{default_idle+0}
> [<ffffffff8010ce2f>]{default_idle+31}
>> [<ffffffff8010ce9a>]{cpu_idle+42}
>>
>> Code: f3 90 7e f5 e9 13 fe ff ff 90 90 90 90 90 90 90 90 90 90 90
>> console shuts up ...
>>
> Andreas

Andreas
--
Andreas Jaeger, aj@xxxxxxx, http://www.suse.de/~aj
SuSE Linux AG, Deutschherrnstr. 15-19, 90429 N├╝rnberg, Germany
GPG fingerprint = 93A3 365E CE47 B889 DF7F FED1 389A 563C C272 A126


< Previous Next >
Follow Ups
References