Maschine Code Exception mit F8 64bit

ensc

New member
Themenstarter
Registriert
12 Dez. 2007
Beiträge
4
,

mein gerade erst erworbenes NI04YGE stirbt im laufenden Betrieb (unter X mit intel und vesa Treiber reproduzierbar nach spaetestens 5 Minuten, beim Booten einmal gesehen) mit

Code:
CPU 1: Machine Check Exception:                4 Bank 2: b200000000010014
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor

CPU 1: Machine Check Exception:                4 Bank 5: b200003008000e0f
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
(ueber serielle Konsole erhalten, unter Fedora 8 64 bit).

Weiss jemand, welche Komponenten damit gemeint sind und ob das wirklich ein HARDWARE ERROR ist?

Das Notebook wurde vom Haendler mit einer anderen CPU (T7500 statt T7300) geupgraded.

UPDATE: neueres mcelog decodiert das zu

Code:
# mcelog --core2 --dmi 
MCE 0
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 1 BANK 2 TSC 3bd9608a3f
MCG status:MCIP 
MCi status:
Uncorrected error
Error enabled
Processor context corrupt
MCA: Data TLB Level-0 Error
STATUS b200000000010014 MCGSTATUS 4

MCE 1
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 1 BANK 5 TSC 3bd960946e
MCG status:MCIP 
MCi status:
Uncorrected error
Error enabled
Processor context corrupt
MCA: BUS Generic Generic Generic Other-transaction Request-timeout Error
BQ_DCU_READ_TYPE BQ_ERR_SINGLE_TYPE BQ_ERR_SINGLE_TYPE
response parity error bus BINIT
STATUS b200003008000e0f MCGSTATUS 4

MCE 2
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 0 BANK 5 TSC 3bd94c83bb
RIP !INEXACT! 10:ffffffff8100aef7 
MCG status:RIPV MCIP 
MCi status:
Uncorrected error
Error enabled
Processor context corrupt
MCA: BUS Generic Generic Generic Other-transaction Request-timeout Error
BQ_DCU_READ_TYPE <25:3> <25:3> external BINIT response parity error
STATUS b200001806000e0f MCGSTATUS 5

MCE 3
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 1 BANK 2 TSC 3be40a022e
MCG status:MCIP 
MCi status:
Error overflow
Uncorrected error
Error enabled
Processor context corrupt
MCA: Data TLB Level-0 Error
STATUS f200000000010014 MCGSTATUS 4

MCE 4
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 0 BANK 5 TSC 3be3f5a486
RIP !INEXACT! 33:41669c 
MCG status:RIPV MCIP 
MCi status:
Error overflow
Uncorrected error
Error enabled
Processor context corrupt
MCA: BUS Generic Generic Generic Other-transaction Request-timeout Error
BQ_DCU_READ_TYPE <25:3> <25:3> external BINIT response parity error
STATUS f200001806000e0f MCGSTATUS 5

MCE 5
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 1 BANK 5 TSC 3be40a0950
MCG status:MCIP 
MCi status:
Error overflow
Uncorrected error
Error enabled
Processor context corrupt
MCA: BUS Generic Generic Generic Other-transaction Request-timeout Error
BQ_DCU_READ_TYPE BQ_ERR_SINGLE_TYPE BQ_ERR_SINGLE_TYPE
response parity error bus BINIT
STATUS f200003008000e0f MCGSTATUS 4

knoppix (von c't 26/2007 Notfall CD) bleibt bei Starting udev hot-plug hardware detection haengen.

Wenn DualCore im BIOS ausgeschaltet ist, scheint dieser Fehler nicht mehr vorzukommen und Knoppix startet.
 
> Lass doch mal einen Memory Test laufen...!

Laeuft problemlos durch...
 
Da kann so ziemlich alles defekt sein.
Ist aber mit hoher Wahrscheinlichkeit wirklich ein Hardwareproblem,
ich würde mich also wieder an den Händler wenden.
Laut google tritt das Problem häufig auf, wenn die CPU nicht genügend Strom bekommt oder starke Spannungsschwankungen auftreten.
Wenn dein Händler den Prozessor gewechselt hat, würde ich aufgrund des "Processor Context Error" mehr auf eine fehlerhafte oder unsachgemäß eingebaute CPU tippen.

edit: Vorher aber mal Kernel und BIOS updaten!
Dazu auch:
D. Probable causes

Here is my top cause list. In general, this is not the CPU . This is:
1. Most probably, overclocking or high temperature in the case/near the RAM, CPU or chipset.
2. Very probably, the RAM chips or settings (mem testing tools may report no error but this is not sufficient, Linux/users can stress more memory than any tool).
3. Probably, the PSU that is insufficient (especially when the error occurs when accessing disks or after having added a hardware, or plugging a USB device).
4. Dometimes, cheap hardware on chipset/mobo.
5. Rarely, the BIOS wrongly configures something (my case since manually I managed to stabilize the box!).

E. How to fix it?

Iterate tries and tests!

1. Check the temperature of the CPU, chipset, RAM, the whole case. Do not overclock.
2. Check your memory with memtest. Try to lower memory banks speed, deactivate dual channel mode. Test with different RAM.
3. Remove some disks/PCI cards, unplug USB devices, use self powered USB device or a powered hub. Test with a different graphic card - not a gamer one. Test with a different (more powerfull) power supply unit.
4. Reduce speed of core components in the BIOS, increase voltage, increase delays of RAM/PCI components (I'm here ;-).
5. Test with a different mobo, a different CPU. If you arrive here, you are near to change the whole PC, and you should consider it to spare time, money, and have newer and better hardware. I'm not here :-o
 
  • ok1.de
  • ok2.de
  • thinkstore24.de
  • Preiswerte-IT - Gebrauchte Lenovo Notebooks kaufen

Werbung

Zurück
Oben