1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Folding@Home keeps crashing

Discussion in 'bit-tech Folding Team' started by maestro0428, 8 Sep 2013.

  1. maestro0428

    maestro0428 Master Modder

    Joined:
    24 Jun 2010
    Posts:
    382
    Likes Received:
    25
    Help! One of my workstations keeps crashing while folding. I am only folding on my cpu (medium 7 threads). This has been happening on and off for months and I just can't figure it out. All other testing shows the system is rock solid.

    System specs

    Xeon 1245v2/CoolerMaster Hyper 212 EVO
    32GB Corsair ram
    Samsung 840 Pro 128GB
    ASRock Extreme 4M
    Nvidia Quadro K600

    F@H log
    *********************** Log Started 2013-09-08T16:02:19Z ***********************
    16:02:19:************************* Folding@home Client *************************
    16:02:19: Website: http://folding.stanford.edu/
    16:02:19: Copyright: (c) 2009-2013 Stanford University
    16:02:19: Author: Joseph Coffland <joseph@cauldrondevelopment.com>
    16:02:19: Args: --open-web-control
    16:02:19: Config: C:/Users/JAllen Labs Mod/AppData/Roaming/FAHClient/config.xml
    16:02:19:******************************** Build ********************************
    16:02:19: Version: 7.3.6
    16:02:19: Date: Feb 18 2013
    16:02:19: Time: 15:25:17
    16:02:19: SVN Rev: 3923
    16:02:19: Branch: fah/trunk/client
    16:02:19: Compiler: Intel(R) C++ MSVC 1500 mode 1200
    16:02:19: Options: /TP /nologo /EHa /Qdiag-disable:4297,4103,1786,279 /Ox -arch:SSE
    16:02:19: /QaxSSE2,SSE3,SSSE3,SSE4.1,SSE4.2 /Qopenmp /Qrestrict /MT /Qmkl
    16:02:19: Platform: win32 XP
    16:02:19: Bits: 32
    16:02:19: Mode: Release
    16:02:19:******************************* System ********************************
    16:02:19: CPU: Intel(R) Xeon(R) CPU E3-1230 V2 @ 3.30GHz
    16:02:19: CPU ID: GenuineIntel Family 6 Model 58 Stepping 9
    16:02:19: CPUs: 8
    16:02:19: Memory: 15.96GiB
    16:02:19: Free Memory: 14.21GiB
    16:02:19: Threads: WINDOWS_THREADS
    16:02:19: Has Battery: false
    16:02:19: On Battery: false
    16:02:19: UTC offset: -5
    16:02:19: PID: 2324
    16:02:19: CWD: C:/Users/JAllen Labs Mod/AppData/Roaming/FAHClient
    16:02:19: OS: Windows 7 Professional
    16:02:19: OS Arch: AMD64
    16:02:19: GPUs: 1
    16:02:19: GPU 0: NVIDIA:3 GK107 [Quadro K600]
    16:02:19: CUDA: 3.0
    16:02:19: CUDA Driver: 5050
    16:02:19:Win32 Service: false
    16:02:19:***********************************************************************
    16:02:19:<config>
    16:02:19: <!-- Folding Slot Configuration -->
    16:02:19: <power v='full'/>
    16:02:19:
    16:02:19: <!-- Network -->
    16:02:19: <proxy v=':8080'/>
    16:02:19:
    16:02:19: <!-- User Information -->
    16:02:19: <user v='Maestro0428'/>
    16:02:19:
    16:02:19: <!-- Folding Slots -->
    16:02:19: <slot id='1' type='CPU'/>
    16:02:19:</config>
    16:02:19:Trying to access database...
    16:02:19:Successfully acquired database lock
    16:02:19:Enabled folding slot 01: READY cpu:8
    16:02:19:WU01:FS01:Starting
    16:02:19:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:/Users/JAllen Labs Mod/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe" -dir 01 -suffix 01 -version 703 -lifeline 2324 -checkpoint 15 -np 8
    16:02:19:WU01:FS01:Started FahCore on PID 3576
    16:02:19:WU01:FS01:Core PID:3352
    16:02:19:WU01:FS01:FahCore 0xa3 started
    16:02:19:WU01:FS01:0xa3:
    16:02:19:WU01:FS01:0xa3:*------------------------------*
    16:02:19:WU01:FS01:0xa3:Folding@Home Gromacs SMP Core
    16:02:19:WU01:FS01:0xa3:Version 2.27 (Dec. 15, 2010)
    16:02:19:WU01:FS01:0xa3:
    16:02:19:WU01:FS01:0xa3:preparing to commence simulation
    16:02:19:WU01:FS01:0xa3:- Ensuring status. Please wait.
    16:02:22:3:127.0.0.1:New Web connection
    16:02:28:WU01:FS01:0xa3:- Looking at optimizations...
    16:02:28:WU01:FS01:0xa3:- Working with standard loops on this execution.
    16:02:28:WU01:FS01:0xa3:- Previous termination of core was improper.
    16:02:28:WU01:FS01:0xa3:- Going to use standard loops.
    16:02:28:WU01:FS01:0xa3:- Files status OK
    16:02:29:WU01:FS01:0xa3:- Expanded 3851658 -> 4394468 (decompressed 114.0 percent)
    16:02:29:WU01:FS01:0xa3:Called DecompressByteArray: compressed_data_size=3851658 data_size=4394468, decompressed_data_size=4394468 diff=0
    16:02:29:WU01:FS01:0xa3:- Digital signature verified
    16:02:29:WU01:FS01:0xa3:
    16:02:29:WU01:FS01:0xa3:project: 8578 (Run 1, Clone 8, Gen 2)
    16:02:29:WU01:FS01:0xa3:
    16:02:29:WU01:FS01:0xa3:Entering M.D.
    16:02:32:FS01:Shutting core down
    16:02:35:WU01:FS01:0xa3:Mapping NT from 8 to 8
    16:02:35:WU01:FS01:0xa3:Completed 0 out of 500000 steps (0%)
    16:02:39:WU01:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
    16:02:39:WU01:FS01:Starting
    16:02:39:WARNING:WU01:FS01:Changed SMP threads from 8 to 7 this can cause some work units to fail
    16:02:39:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:/Users/JAllen Labs Mod/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe" -dir 01 -suffix 01 -version 703 -lifeline 2324 -checkpoint 15 -np 7
    16:02:39:WU01:FS01:Started FahCore on PID 5844
    16:02:39:WU01:FS01:Core PID:5860
    16:02:39:WU01:FS01:FahCore 0xa3 started
    16:02:39:WU01:FS01:0xa3:
    16:02:39:WU01:FS01:0xa3:*------------------------------*
    16:02:39:WU01:FS01:0xa3:Folding@Home Gromacs SMP Core
    16:02:39:WU01:FS01:0xa3:Version 2.27 (Dec. 15, 2010)
    16:02:39:WU01:FS01:0xa3:
    16:02:39:WU01:FS01:0xa3:preparing to commence simulation
    16:02:39:WU01:FS01:0xa3:- Looking at optimizations...
    16:02:39:WU01:FS01:0xa3:- Files status OK
    16:02:39:WU01:FS01:0xa3:- Expanded 3851658 -> 4394468 (decompressed 114.0 percent)
    16:02:39:WU01:FS01:0xa3:Called DecompressByteArray: compressed_data_size=3851658 data_size=4394468, decompressed_data_size=4394468 diff=0
    16:02:39:WU01:FS01:0xa3:- Digital signature verified
    16:02:39:WU01:FS01:0xa3:
    16:02:39:WU01:FS01:0xa3:project: 8578 (Run 1, Clone 8, Gen 2)
    16:02:39:WU01:FS01:0xa3:
    16:02:39:WU01:FS01:0xa3:Assembly optimizations on if available.
    16:02:39:WU01:FS01:0xa3:Entering M.D.
    16:02:45:WU01:FS01:0xa3:Mapping NT from 7 to 7
    16:02:46:WU01:FS01:0xa3:Completed 0 out of 500000 steps (0%)
    16:13:21:WU01:FS01:0xa3:Completed 5000 out of 500000 steps (1%)
    16:24:03:WU01:FS01:0xa3:Completed 10000 out of 500000 steps (2%)
    16:34:45:WU01:FS01:0xa3:Completed 15000 out of 500000 steps (3%)
    16:45:26:WU01:FS01:0xa3:Completed 20000 out of 500000 steps (4%)
    16:56:07:WU01:FS01:0xa3:Completed 25000 out of 500000 steps (5%)
    17:06:48:WU01:FS01:0xa3:Completed 30000 out of 500000 steps (6%)
    17:17:28:WU01:FS01:0xa3:Completed 35000 out of 500000 steps (7%)
    17:24:26:FS01:Shutting core down
    17:24:34:WU01:FS01:FahCore returned: INTERRUPTED (102 = 0x66)
    17:24:34:WU01:FS01:Starting
    17:24:34:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:/Users/JAllen Labs Mod/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe" -dir 01 -suffix 01 -version 703 -lifeline 2324 -checkpoint 15 -np 7
    17:24:34:WU01:FS01:Started FahCore on PID 5596
    17:24:34:WU01:FS01:Core PID:6016
    17:24:34:WU01:FS01:FahCore 0xa3 started
    17:24:35:WU01:FS01:0xa3:
    17:24:35:WU01:FS01:0xa3:*------------------------------*
    17:24:35:WU01:FS01:0xa3:Folding@Home Gromacs SMP Core
    17:24:35:WU01:FS01:0xa3:Version 2.27 (Dec. 15, 2010)
    17:24:35:WU01:FS01:0xa3:
    17:24:35:WU01:FS01:0xa3:preparing to commence simulation
    17:24:35:WU01:FS01:0xa3:- Looking at optimizations...
    17:24:35:WU01:FS01:0xa3:- Files status OK
    17:24:35:WU01:FS01:0xa3:- Expanded 3851658 -> 4394468 (decompressed 114.0 percent)
    17:24:35:WU01:FS01:0xa3:Called DecompressByteArray: compressed_data_size=3851658 data_size=4394468, decompressed_data_size=4394468 diff=0
    17:24:35:WU01:FS01:0xa3:- Digital signature verified
    17:24:35:WU01:FS01:0xa3:
    17:24:35:WU01:FS01:0xa3:project: 8578 (Run 1, Clone 8, Gen 2)
    17:24:35:WU01:FS01:0xa3:
    17:24:35:WU01:FS01:0xa3:Assembly optimizations on if available.
    17:24:35:WU01:FS01:0xa3:Entering M.D.
    17:24:41:WU01:FS01:0xa3:Using Gromacs checkpoints
    17:24:41:WU01:FS01:0xa3:Mapping NT from 7 to 7
    17:24:42:WU01:FS01:0xa3:Resuming from checkpoint
    17:24:42:WU01:FS01:0xa3:Verified 01/wudata_01.log
    17:24:42:WU01:FS01:0xa3:Verified 01/wudata_01.trr
    17:24:42:WU01:FS01:0xa3:Verified 01/wudata_01.edr
    17:24:42:WU01:FS01:0xa3:Completed 35140 out of 500000 steps (7%)
    17:25:30:Saving configuration to config.xml
    17:25:30:<config>
    17:25:30: <!-- Network -->
    17:25:30: <proxy v=':8080'/>
    17:25:30:
    17:25:30: <!-- User Information -->
    17:25:30: <user v='Maestro0428'/>
    17:25:30:
    17:25:30: <!-- Folding Slots -->
    17:25:30: <slot id='1' type='CPU'/>
    17:25:30:</config>
    17:27:06:WARNING:WU01:FS01:FahCore returned an unknown error code which probably indicates that it crashed
    17:27:06:WARNING:WU01:FS01:FahCore returned: UNKNOWN_ENUM (-1073741783 = 0xc0000029)
    17:27:06:WU01:FS01:Starting
    17:27:06:WU01:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" "C:/Users/JAllen Labs Mod/AppData/Roaming/FAHClient/cores/www.stanford.edu/~pande/Win32/AMD64/Core_a3.fah/FahCore_a3.exe" -dir 01 -suffix 01 -version 703 -lifeline 2324 -checkpoint 15 -np 7
    17:27:06:WU01:FS01:Started FahCore on PID 1276
    17:27:06:WU01:FS01:Core PID:5224
    17:27:06:WU01:FS01:FahCore 0xa3 started
    17:27:06:WU01:FS01:0xa3:
    17:27:06:WU01:FS01:0xa3:*------------------------------*
    17:27:06:WU01:FS01:0xa3:Folding@Home Gromacs SMP Core
    17:27:06:WU01:FS01:0xa3:Version 2.27 (Dec. 15, 2010)
    17:27:06:WU01:FS01:0xa3:
    17:27:06:WU01:FS01:0xa3:preparing to commence simulation
    17:27:06:WU01:FS01:0xa3:- Ensuring status. Please wait.
    17:27:16:WU01:FS01:0xa3:- Looking at optimizations...
    17:27:16:WU01:FS01:0xa3:- Working with standard loops on this execution.
    17:27:16:WU01:FS01:0xa3:- Previous termination of core was improper.
    17:27:16:WU01:FS01:0xa3:- Files status OK
    17:27:16:WU01:FS01:0xa3:- Expanded 3851658 -> 4394468 (decompressed 114.0 percent)
    17:27:16:WU01:FS01:0xa3:Called DecompressByteArray: compressed_data_size=3851658 data_size=4394468, decompressed_data_size=4394468 diff=0
    17:27:16:WU01:FS01:0xa3:- Digital signature verified
    17:27:16:WU01:FS01:0xa3:
    17:27:16:WU01:FS01:0xa3:project: 8578 (Run 1, Clone 8, Gen 2)
    17:27:16:WU01:FS01:0xa3:
    17:27:16:WU01:FS01:0xa3:Entering M.D.
    17:27:22:WU01:FS01:0xa3:Using Gromacs checkpoints
    17:27:22:WU01:FS01:0xa3:Mapping NT from 7 to 7
    17:27:23:WU01:FS01:0xa3:Resuming from checkpoint
    17:27:23:WU01:FS01:0xa3:Verified 01/wudata_01.log
    17:27:23:WU01:FS01:0xa3:Verified 01/wudata_01.trr
    17:27:23:WU01:FS01:0xa3:Verified 01/wudata_01.edr
    17:27:23:WU01:FS01:0xa3:Completed 35140 out of 500000 steps (7%)

    Windows Crash log
    Problem signature:
    Problem Event Name: APPCRASH
    Application Name: FahCore_a3.exe
    Application Version: 0.0.0.0
    Application Timestamp: 4d4720af
    Fault Module Name: ntdll.dll
    Fault Module Version: 6.1.7601.17725
    Fault Module Timestamp: 4ec49b8f
    Exception Code: c0000029
    Exception Offset: 00090812
    OS Version: 6.1.7601.2.1.0.256.48
    Locale ID: 1033
    Additional Information 1: 0a9e
    Additional Information 2: 0a9e372d3b4ad19135b953a78882e789
    Additional Information 3: 0a9e
    Additional Information 4: 0a9e372d3b4ad19135b953a78882e789


    Any ideas are greatly appreciated.

    -thanks!
     
  2. debs3759

    debs3759 Was that a warranty I just broke?

    Joined:
    10 Oct 2011
    Posts:
    1,769
    Likes Received:
    92
    fah is more sensitive to system instabilities than any of the stress testing apps. Try increasing the Vcore by a singe increment, or reduce the speed of your overclock.
     
  3. maestro0428

    maestro0428 Master Modder

    Joined:
    24 Jun 2010
    Posts:
    382
    Likes Received:
    25
    As it is a Xeon, its running at stock frequencies.
     
  4. DocJonz

    DocJonz Another CPC refugee .....

    Joined:
    24 Apr 2009
    Posts:
    1,202
    Likes Received:
    97
    INTERRUPTED (102 = 0x66) - is not a well enough defined error code to be able to pin-point your problem unfortunately. May be a duff WU? Delete the files and re-try. Did you note the Project numbers of previous repeat failures?

    (Also note that running F@H with a prime number of threads can cause issues - I note that sometimes you use 7 threads ...)
     
  5. maestro0428

    maestro0428 Master Modder

    Joined:
    24 Jun 2010
    Posts:
    382
    Likes Received:
    25
    Thanks for the reply. I have not noted the work units (but I will now). My 3770k/660 sli and 4770k/650ti boost never crash. To be honest, the is the first time I have run into trouble in the last 8 years.

    I have removed/reinstalled (data too) several times. It doesn't seem to be hardware related...I think its something in Windows 7 or conflicting software.

    I will try running it on 8 threads first.

    Any other ideas?
     
  6. maestro0428

    maestro0428 Master Modder

    Joined:
    24 Jun 2010
    Posts:
    382
    Likes Received:
    25
    I managed to fix it by removing F@H and data (again) and running it on full/8 threads. And it was fine for a couple of weeks. Its been down for two weeks (been too sickly to deal with it). Now it is crashing randomly again! Damn. I have never had this problem before now. I can't get it to crash under any other circumstances. I will post the crash report, but until then, if anyone has any suggestions, it would be much appreciated.
     

Share This Page