Disk01

New Member
Joined
Nov 13, 2015
Messages
6
Hey, hopefully someone here will be able to help me out.

I bought a new GPU, a GTX 980 and after 3 days I got my first BSOD. A few days later I got another BSOD in WoW while playing only for a few minutes. After that every game I started first after I booted up my PC, would completely freeze the system after about 5mins of gaming.
I waited about 10-15mins and since I didn't get a BSOD, I just restarted my system. After the restart in the same game I previously got the BSOD, it would work flawlessly and with no problems, until the next time I shutdown my PC and powered it up again.
This only ever seemed to happen after a PC boot, but after the system freeze and forced restart, I could play the same game for hours on end with no issues whatsoever.

Here are the 2 BSODs that I got in order.
First one:

Code:
*******************************************************************************
*                                                                             *
*                        Bugcheck Analysis                                    *
*                                                                             *
*******************************************************************************

Use !analyze -v to get detailed debugging information.

BugCheck 124, {4, fffffa800f5cc038, 0, 0}

Probably caused by : GenuineIntel

Followup:     MachineOwner
---------

2: kd> !analyze -v
*******************************************************************************
*                                                                             *
*                        Bugcheck Analysis                                    *
*                                                                             *
*******************************************************************************

WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred. Parameter 1 identifies the type of error
source that reported the error. Parameter 2 holds the address of the
WHEA_ERROR_RECORD structure that describes the error conditon.
Arguments:
Arg1: 0000000000000004, PCI Express Error
Arg2: fffffa800f5cc038, Address of the WHEA_ERROR_RECORD structure.
Arg3: 0000000000000000
Arg4: 0000000000000000

Debugging Details:
------------------


SYSTEM_SKU:  To be filled by O.E.M.

SYSTEM_VERSION:  3.0

BIOS_DATE:  05/29/2015

BASEBOARD_PRODUCT:  X99S GAMING 7 (MS-7885)

BASEBOARD_VERSION:  3.0

BUGCHECK_P1: 4

BUGCHECK_P2: fffffa800f5cc038

BUGCHECK_P3: 0

BUGCHECK_P4: 0

BUGCHECK_STR:  0x124_GenuineIntel

CPU_COUNT: c

CPU_MHZ: ce4

CPU_VENDOR:  GenuineIntel

CPU_FAMILY: 6

CPU_MODEL: 3f

CPU_STEPPING: 2

CUSTOMER_CRASH_COUNT:  1

DEFAULT_BUCKET_ID:  WIN7_DRIVER_FAULT

PROCESS_NAME:  Wow-64.exe

CURRENT_IRQL:  a

ANALYSIS_VERSION: 10.0.10240.9 amd64fre

STACK_TEXT:
fffff880`0c41f968 fffff800`03218a3b : 00000000`00000124 00000000`00000004 fffffa80`0f5cc038 00000000`00000000 : nt!KeBugCheckEx
fffff880`0c41f970 fffff800`02da7e63 : 00000000`00000001 fffffa80`0f5b3b10 00000000`00000000 fffffa80`0f5b31b0 : hal!HalBugCheckSystem+0x1e3
fffff880`0c41f9b0 fffff880`00e09bcf : fffffa80`00000750 fffffa80`0f5b3b10 00000000`00000000 fffffa80`0f5cbab0 : nt!WheaReportHwError+0x263
fffff880`0c41fa10 fffff880`00e095f6 : 00000000`00000000 fffff880`0c41fb60 fffffa80`0f698d80 fffff880`0c41fae0 : pci!ExpressRootPortAerInterruptRoutine+0x27f
fffff880`0c41fa70 fffff800`02c8a05c : 00000000`0d839f00 fffff880`0c41fb60 fffffa80`0f698d80 00000000`0d68f001 : pci!ExpressRootPortInterruptRoutine+0x36
fffff880`0c41fae0 000007fe`f6f8ed3c : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : nt!KiInterruptDispatch+0x16c
00000000`0030ee50 00000000`00000000 : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : 0x000007fe`f6f8ed3c


STACK_COMMAND:  kb

FOLLOWUP_NAME:  MachineOwner

MODULE_NAME: GenuineIntel

IMAGE_NAME:  GenuineIntel

DEBUG_FLR_IMAGE_TIMESTAMP:  0

IMAGE_VERSION:

FAILURE_BUCKET_ID:  X64_0x124_GenuineIntel_PCIEXPRESS

BUCKET_ID:  X64_0x124_GenuineIntel_PCIEXPRESS

PRIMARY_PROBLEM_CLASS:  X64_0x124_GenuineIntel_PCIEXPRESS

ANALYSIS_SOURCE:  KM

FAILURE_ID_HASH_STRING:  km:x64_0x124_genuineintel_pciexpress

FAILURE_ID_HASH:  {3fd82133-403a-028a-6939-36a9eccfc17d}

Followup:     MachineOwner

Second one:

Code:
*******************************************************************************
*                                                                             *
*                        Bugcheck Analysis                                    *
*                                                                             *
*******************************************************************************

Use !analyze -v to get detailed debugging information.

BugCheck 124, {4, fffffa800f62a038, 0, 0}

Probably caused by : GenuineIntel

Followup:     MachineOwner
---------

2: kd> !analyze -v
*******************************************************************************
*                                                                             *
*                        Bugcheck Analysis                                    *
*                                                                             *
*******************************************************************************

WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred. Parameter 1 identifies the type of error
source that reported the error. Parameter 2 holds the address of the
WHEA_ERROR_RECORD structure that describes the error conditon.
Arguments:
Arg1: 0000000000000004, PCI Express Error
Arg2: fffffa800f62a038, Address of the WHEA_ERROR_RECORD structure.
Arg3: 0000000000000000
Arg4: 0000000000000000

Debugging Details:
------------------


SYSTEM_SKU:  To be filled by O.E.M.

SYSTEM_VERSION:  3.0

BIOS_DATE:  05/29/2015

BASEBOARD_PRODUCT:  X99S GAMING 7 (MS-7885)

BASEBOARD_VERSION:  3.0

BUGCHECK_P1: 4

BUGCHECK_P2: fffffa800f62a038

BUGCHECK_P3: 0

BUGCHECK_P4: 0

BUGCHECK_STR:  0x124_GenuineIntel

CPU_COUNT: c

CPU_MHZ: ce4

CPU_VENDOR:  GenuineIntel

CPU_FAMILY: 6

CPU_MODEL: 3f

CPU_STEPPING: 2

CUSTOMER_CRASH_COUNT:  1

DEFAULT_BUCKET_ID:  WIN7_DRIVER_FAULT

PROCESS_NAME:  svchost.exe

CURRENT_IRQL:  a

ANALYSIS_VERSION: 10.0.10240.9 amd64fre

STACK_TEXT:
fffff880`0dbff378 fffff800`03609a3b : 00000000`00000124 00000000`00000004 fffffa80`0f62a038 00000000`00000000 : nt!KeBugCheckEx
fffff880`0dbff380 fffff800`03198e63 : 00000000`00000001 fffffa80`0f5dbb10 00000000`00000000 fffffa80`0f5db1b0 : hal!HalBugCheckSystem+0x1e3
fffff880`0dbff3c0 fffff880`00f9fbcf : fffffa80`00000750 fffffa80`0f5dbb10 00000000`00000000 fffffa80`0f629ab0 : nt!WheaReportHwError+0x263
fffff880`0dbff420 fffff880`00f9f5f6 : 00000000`00000000 fffff880`0dbff570 fffffa80`0f70dd80 fffff880`0dbff4f0 : pci!ExpressRootPortAerInterruptRoutine+0x27f
fffff880`0dbff480 fffff800`0307b05c : 00000000`00000000 00000000`00000001 fffffa80`0f70dd80 00000000`00000001 : pci!ExpressRootPortInterruptRoutine+0x36
fffff880`0dbff4f0 fffff800`030534a4 : 00000000`00000000 fffff800`0307ae67 fffffa80`0f63aa88 00000000`00000000 : nt!KiInterruptDispatch+0x16c
fffff880`0dbff680 fffff800`0338c50f : 00000000`00000000 fffff880`0dbffb60 00000000`00000000 00001f80`00380200 : nt!KeFlushProcessWriteBuffers+0x88
fffff880`0dbff6f0 fffff800`0338cb6d : 00000000`06992690 fffff800`0337786a 00000000`00000000 00000000`00000286 : nt!ExpQuerySystemInformation+0x13af
fffff880`0dbffaa0 fffff800`0307e493 : 00000000`00000000 fffff880`0dbffb60 ffffffff`fffe7960 000007fe`f1a00b90 : nt!NtQuerySystemInformation+0x4d
fffff880`0dbffae0 00000000`7703dd1a : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : nt!KiSystemServiceCopyEnd+0x13
00000000`031df4c8 00000000`00000000 : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : 0x7703dd1a


STACK_COMMAND:  kb

FOLLOWUP_NAME:  MachineOwner

MODULE_NAME: GenuineIntel

IMAGE_NAME:  GenuineIntel

DEBUG_FLR_IMAGE_TIMESTAMP:  0

IMAGE_VERSION:

FAILURE_BUCKET_ID:  X64_0x124_GenuineIntel_PCIEXPRESS

BUCKET_ID:  X64_0x124_GenuineIntel_PCIEXPRESS

PRIMARY_PROBLEM_CLASS:  X64_0x124_GenuineIntel_PCIEXPRESS

ANALYSIS_SOURCE:  KM

FAILURE_ID_HASH_STRING:  km:x64_0x124_genuineintel_pciexpress

FAILURE_ID_HASH:  {3fd82133-403a-028a-6939-36a9eccfc17d}

Followup:     MachineOwner

Now, I sent the GPU in service but apparently they didn't experience the same errors. In the meantime I got a HD5850 from a friend and in the same games, it has no problems or BSODs/freezes...so yeah, I don't really know. I never had any other BSODs or issues of this sort before until I got the GTX 980, so, I don't know..

I put 2 files up, one with the gtx 980 and the tool provided by the sevenforum and the other with my current config, the HD5850 that uses the diagnostics tool from here. I added the CPU-z stuff, but sadly the Rammon program won't start after the install for some reason.

My GPU(gtx 980) is still in service, but they plan to send it back to me on Monday so it would be nice to get an answer by then, if it's really something caused by my system, or they just didn't test it well enough.

Thanks.
 


Attachments

Solution
Code:
*******************************************************************************
*                                                                             *
*                        Bugcheck Analysis                                    *
*                                                                             *
*******************************************************************************

Use !analyze -v to get detailed debugging information.

BugCheck 124, {4, fffffa800f62a038, 0, 0}

Probably caused by : GenuineIntel

Followup: MachineOwner
---------
Hi,
as your probably aware Bugcheck 124 usually means a hardware error occurred. The dump file also tells us that the error was down to an 'uncorrectable PCI Express error...
Hi,
I need to re-install the debugger (fresh install of win 10 1511) but while I do can you post the make and model of your PSU please.
 


Code:
*******************************************************************************
*                                                                             *
*                        Bugcheck Analysis                                    *
*                                                                             *
*******************************************************************************

Use !analyze -v to get detailed debugging information.

BugCheck 124, {4, fffffa800f62a038, 0, 0}

Probably caused by : GenuineIntel

Followup: MachineOwner
---------
Hi,
as your probably aware Bugcheck 124 usually means a hardware error occurred. The dump file also tells us that the error was down to an 'uncorrectable PCI Express error occurring'.
I noticed that your Bios is the original version that came with the board. There has been several major updates, some specific to gpu's (one even cites a specific mention to Titan of which the 980 isn't far off) as well as stability updates. Use the support page to update the bios although please double check I have the correct page for your moBo:
Link Removed

Your also missing the Intel chipset drivers which are pretty important especially the chipset inf and engine. Use your support page to install them:
Motherboard - The world leader in motherboard design

Your also running a couple of apps well known for causing bsod's in win 7.
Daemon Tools (lite?)
MSI Afterburner
I'm not saying they are the cause but just to consider them if the bsod should continue

Thanks for supplying the psu info. I did wonder if your issue was down to low power but you should be fine with the one mentioned.

If you get the card back and it still bsod's even after the bios updates check your CPU
Link Removed

I guess all you can do is update the bios and install the drivers until you get the card back but once you do post any new dump files.
 


Solution
Code:
*******************************************************************************
*                                                                             *
*                        Bugcheck Analysis                                    *
*                                                                             *
*******************************************************************************

Use !analyze -v to get detailed debugging information.

BugCheck 124, {4, fffffa800f62a038, 0, 0}

Probably caused by : GenuineIntel

Followup: MachineOwner
---------
Hi,
as your probably aware Bugcheck 124 usually means a hardware error occurred. The dump file also tells us that the error was down to an 'uncorrectable PCI Express error occurring'.
I noticed that your Bios is the original version that came with the board. There has been several major updates, some specific to gpu's (one even cites a specific mention to Titan of which the 980 isn't far off) as well as stability updates. Use the support page to update the bios although please double check I have the correct page for your moBo:
Link Removed

Your also missing the Intel chipset drivers which are pretty important especially the chipset inf and engine. Use your support page to install them:
Motherboard - The world leader in motherboard design

Your also running a couple of apps well known for causing bsod's in win 7.
Daemon Tools (lite?)
MSI Afterburner
I'm not saying they are the cause but just to consider them if the bsod should continue

Thanks for supplying the psu info. I did wonder if your issue was down to low power but you should be fine with the one mentioned.

If you get the card back and it still bsod's even after the bios updates check your CPU
Link Removed

I guess all you can do is update the bios and install the drivers until you get the card back but once you do post any new dump files.

Thanks for looking in.

First things first, the BIOS is the latest one and when I bought the mobo I had the option for them to update it to the latest version prior to shipping it, which they did.
As far as the intel chipset driver goes, I got the latest update and installed it at the start of this month, using the MSI Live Update tool, so I'm not really sure..does it say that it's not installed ?!

A thing I forgot to mention is that my CPU is overclocked to 4.2ghz at 1.121v but I also tried without any OC and the system still froze, so if that's any help. Right now, i'm worried that it might be the cause, but if it is, it's some really weird timing because I never had any issues until I installed the new GPU.

Edit: Looks like the CPU diagnostics test passed with my current config.
 


Last edited:
First things first, the BIOS is the latest one and when I bought the mobo I had the option for them to update it to the latest version prior to shipping it, which they did.
Yup, I took another look at the dump file and it was the way it was being displayed.

As far as the intel chipset driver goes, I got the latest update and installed it at the start of this month, using the MSI Live Update tool, so I'm not really sure..does it say that it's not installed ?!
I couldn't see the driver in the Driver list in the dump file but checking the other files I see it is indeed installed.

A thing I forgot to mention is that my CPU is overclocked to 4.2ghz at 1.121v but I also tried without any OC and the system still froze, so if that's any help. Right now, i'm worried that it might be the cause, but if it is, it's some really weird timing because I never had any issues until I installed the new GPU.
Well in any case we always recommend you disable the overclock until the bsod is stopped. When you say you already tried disabling the overclock but the machine still froze that's quite different to a bsod and could be caused by something else. I would disable the overclock and just run a vanilla set up for now or at least when the card comes back. See if the freezing still occurs and if does we'll have a look see.
 


Yup, I took another look at the dump file and it was the way it was being displayed.


I couldn't see the driver in the Driver list in the dump file but checking the other files I see it is indeed installed.


Well in any case we always recommend you disable the overclock until the bsod is stopped. When you say you already tried disabling the overclock but the machine still froze that's quite different to a bsod and could be caused by something else. I would disable the overclock and just run a vanilla set up for now or at least when the card comes back. See if the freezing still occurs and if does we'll have a look see.

Ok, will do. Cheers.
 


See if the freezing still occurs and if does we'll have a look see.

Still getting the same freezes I'm afraid and a new BSOD. No overclocking, same freezes/bsods.

Code:
*******************************************************************************
*                                                                             *
*                        Bugcheck Analysis                                    *
*                                                                             *
*******************************************************************************

WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred. Parameter 1 identifies the type of error
source that reported the error. Parameter 2 holds the address of the
WHEA_ERROR_RECORD structure that describes the error conditon.
Arguments:
Arg1: 0000000000000004, PCI Express Error
Arg2: fffffa800f5d2038, Address of the WHEA_ERROR_RECORD structure.
Arg3: 0000000000000000
Arg4: 0000000000000000

Debugging Details:
------------------


SYSTEM_SKU:  To be filled by O.E.M.

SYSTEM_VERSION:  3.0

BIOS_DATE:  05/29/2015

BASEBOARD_PRODUCT:  X99S GAMING 7 (MS-7885)

BASEBOARD_VERSION:  3.0

BUGCHECK_P1: 4

BUGCHECK_P2: fffffa800f5d2038

BUGCHECK_P3: 0

BUGCHECK_P4: 0

BUGCHECK_STR:  0x124_GenuineIntel

CPU_COUNT: c

CPU_MHZ: ce4

CPU_VENDOR:  GenuineIntel

CPU_FAMILY: 6

CPU_MODEL: 3f

CPU_STEPPING: 2

CUSTOMER_CRASH_COUNT:  1

DEFAULT_BUCKET_ID:  WIN7_DRIVER_FAULT

PROCESS_NAME:  System

CURRENT_IRQL:  a

ANALYSIS_VERSION: 10.0.10240.9 amd64fre

STACK_TEXT:
fffff880`02f9a938 fffff800`0401ba3b : 00000000`00000124 00000000`00000004 fffffa80`0f5d2038 00000000`00000000 : nt!KeBugCheckEx
fffff880`02f9a940 fffff800`041dfe63 : 00000000`00000001 fffffa80`0f5b9b10 00000000`00000000 fffffa80`0f5b91b0 : hal!HalBugCheckSystem+0x1e3
fffff880`02f9a980 fffff880`00fcabcf : fffffa80`00000750 fffffa80`0f5b9b10 00000000`00000000 fffffa80`0f5d1ab0 : nt!WheaReportHwError+0x263
fffff880`02f9a9e0 fffff880`00fca5f6 : 00000000`00000000 fffff880`02f9ab30 fffffa80`0f6bed80 fffff880`02f9aab0 : pci!ExpressRootPortAerInterruptRoutine+0x27f
fffff880`02f9aa40 fffff800`040c205c : fffff880`02f72180 fffff880`02f9ab30 fffffa80`0f6bed80 fffffa80`0eea8001 : pci!ExpressRootPortInterruptRoutine+0x36
fffff880`02f9aab0 fffff800`040bdee2 : fffff880`02f72180 fffff880`00000001 00000000`00000001 fffff800`00000000 : nt!KiInterruptDispatch+0x16c
fffff880`02f9ac40 00000000`00000000 : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : nt!KiIdleLoop+0x32


STACK_COMMAND:  kb

FOLLOWUP_NAME:  MachineOwner

MODULE_NAME: GenuineIntel

IMAGE_NAME:  GenuineIntel

DEBUG_FLR_IMAGE_TIMESTAMP:  0

IMAGE_VERSION:

FAILURE_BUCKET_ID:  X64_0x124_GenuineIntel_PCIEXPRESS

BUCKET_ID:  X64_0x124_GenuineIntel_PCIEXPRESS

PRIMARY_PROBLEM_CLASS:  X64_0x124_GenuineIntel_PCIEXPRESS

ANALYSIS_SOURCE:  KM

FAILURE_ID_HASH_STRING:  km:x64_0x124_genuineintel_pciexpress

FAILURE_ID_HASH:  {3fd82133-403a-028a-6939-36a9eccfc17d}

Followup:     MachineOwner

There was a new bios update for my mobo but that didn't seemed to have done much in fixing this issue. So far I tested the ram again with memtest for 8+hrs in which it passed all tests, ran the Intel diagnostics tool again with the 980 and it also passed.

In the meantime I was able to get a 270x GPU from a friend which I also tested and didn't run into any issues with, same with the HD 5850. Difference between the AMD gpu-s that I tested and Nvidia one, is that the Nvidia one requires a 6pin and 8pin power connector and the other two, only 2x6pin. Could the problem be a faulty PSU, because if it was anything else, then it would've happened with the other 2 GPU-s as well, wouldn't it ?

But yeah, so far, it will still freeze on the first game after startup then work flawlessly in the same games after the forced restart.

I'm opened to any advice or further tests that I can try to identify the problem. Thanks.
 


Hmm... is it possible to try your 980 in your friends machine? It could be down to either a faulty psu, gpu or driver issue.

I'm tempted to think it's a driver related issue because the card works fine after a forced restart. Uninstall your current gpu driver using this application (you will need to use it in safe mode) as it will also remove any bits of driver left behind by previous installs:
Display Driver Uninstaller (DDU) V15.7.0.1 Released. - Wagnardmobile Forum

Re-install the latest whql driver
Drivers - Download NVIDIA Drivers
 


Hmm... is it possible to try your 980 in your friends machine? It could be down to either a faulty psu, gpu or driver issue.

I'm tempted to think it's a driver related issue because the card works fine after a forced restart. Uninstall your current gpu driver using this application (you will need to use it in safe mode) as it will also remove any bits of driver left behind by previous installs:
Display Driver Uninstaller (DDU) V15.7.0.1 Released. - Wagnardmobile Forum

Re-install the latest whql driver
Drivers - Download NVIDIA Drivers

Well, when I sent the gpu to be serviced, both the shop I bought it from and the warranty place, tested it and reported no issues, so unless they are doing it on purpose....which is why I'm starting to think that it might be the psu.
As far as DDU goes, I already used it every single time I installed different drivers, so that should be fine. Running whql 359.0 which is their latest.

A thing I ran into while searching for related problems, was either a faulty architecture in the 2011 socket mobo's and potentially 2011-v3 and another thing related to the PSU as being the issue, as the reason for the freeze at startup was because it needed to heat up or something...idk.
I'll see if I can't find a different PSU to test it with, but that might take some time.
 


I'll see if I can't find a different PSU to test it with, but that might take some time.
Now that would be really good. I hope you manage to source one soon.
 


Back
Top