- Thread Author
- #1
NOTE: Sorry for the long post, trying to be thorough.
Just for reference, these are the specs of the machine. Everything is running stock, it's a work machine:
OS: Win 10 Pro
CPU: i7-6700K
Motherboard: ASUS Z170-A LGA 1151
RAM: 2x G.SKILL Ripjaws V Series 16GB 288-Pin DDR4
Using onboard GPU
Combination of SSDs (EVOs) and HDDs (WD Blue and Gold)
So this computer starting BSODing yesterday/night before in the middle of the night at 11pm (per event log), after having been untouched since I left at 6pm. It BSOD'd frequently in the morning when I arrived (every 30 seconds to 5 minutes) and worsened to the point where it could barely get to the log on screen.
For most of my morning troubleshooting I assumed it was due to a Win 10 update and a driver conflict. The faulting module in the memory dump was intelppm, and the minidumps showed a 124 error (WHEA_UNCORRECTABLE_ERROR (124)), so I figured the chipset and/or CPU drivers were to blame.
However, this is what I've done so far, so now I'm looking for confirmation if I should try an RMA for the CPU (using onboard gpu, so no dedicated gfx card to test):
However, with the above tests I'm pretty sure I (mostly) ruled out the RAM, ruled out the hard drives, ruled out the OS, ruled out the PSU. Even though it looks completely certain that it's the CPU, I'm just finding that hard to believe. And if it is the CPU, has anyone experienced something like this? The PC has had a rock solid uptime of about 10 months, treated very well in a clean environment, CPU always ran cool, max temp it has reached is 61C.
Anyway, any thoughts would be appreciated. I don't know how to go about and test the CPU for the purpose of an RMA process without an OS, so any clues on that would be great too!
Included are Sysinfo file for specs and one of the mini dumps in case that confirms anything (though to me the 124 error is probably way too generic).
The CPU I suspect is a 6700K, so the current 6700 that's working is a bit of a step down, but no biggie for my normal workload. Also I haven't bothered putting the second stick of RAM back in, so the specs only show 1 stick at 16 GB.
Just for reference, these are the specs of the machine. Everything is running stock, it's a work machine:
OS: Win 10 Pro
CPU: i7-6700K
Motherboard: ASUS Z170-A LGA 1151
RAM: 2x G.SKILL Ripjaws V Series 16GB 288-Pin DDR4
Using onboard GPU
Combination of SSDs (EVOs) and HDDs (WD Blue and Gold)
So this computer starting BSODing yesterday/night before in the middle of the night at 11pm (per event log), after having been untouched since I left at 6pm. It BSOD'd frequently in the morning when I arrived (every 30 seconds to 5 minutes) and worsened to the point where it could barely get to the log on screen.
For most of my morning troubleshooting I assumed it was due to a Win 10 update and a driver conflict. The faulting module in the memory dump was intelppm, and the minidumps showed a 124 error (WHEA_UNCORRECTABLE_ERROR (124)), so I figured the chipset and/or CPU drivers were to blame.
However, this is what I've done so far, so now I'm looking for confirmation if I should try an RMA for the CPU (using onboard gpu, so no dedicated gfx card to test):
- The PC has 2x sticks of RAM, so I tried each individually, no change.
- I disconnected all hard disks except the primary, no change.
- I replaced the main disk with another disk with a Win 10 install on it, no change, same BSOD.
- I put in a fresh SSD and loaded the Win 10 installer from a new Win 10 Pro Install disc. It BSOD's TWICE loading the CD interface (Win 10 not even installed yet). I didn't even know you could BSOD at this point...
- The third try it did load the CD interface and got about halfway through the install of Win 10 until it BSOD'd.
- I replaced the PSU with a known good, no change.
- I managed to find another spare machine with a 6700 (non-K) chip, and put that in my machine. Problem gone. I can now boot back into the normal Win 10 install, as well as the secondary test install disk I put in, completely stable for over 18 hours now. It basically went from 30x or so BSOD's an hour to nothing with this switch.
However, with the above tests I'm pretty sure I (mostly) ruled out the RAM, ruled out the hard drives, ruled out the OS, ruled out the PSU. Even though it looks completely certain that it's the CPU, I'm just finding that hard to believe. And if it is the CPU, has anyone experienced something like this? The PC has had a rock solid uptime of about 10 months, treated very well in a clean environment, CPU always ran cool, max temp it has reached is 61C.
Anyway, any thoughts would be appreciated. I don't know how to go about and test the CPU for the purpose of an RMA process without an OS, so any clues on that would be great too!
Included are Sysinfo file for specs and one of the mini dumps in case that confirms anything (though to me the 124 error is probably way too generic).
The CPU I suspect is a 6700K, so the current 6700 that's working is a bit of a step down, but no biggie for my normal workload. Also I haven't bothered putting the second stick of RAM back in, so the specs only show 1 stick at 16 GB.