Message boards : Graphics cards (GPUs) : NOELIA_SH2eq Short Work unit(s) Instantly Failing
Author | Message |
---|---|
Noelia_SH2eq work unit(s)-argprox1/argasnx1/asnmetx3/argcysx3/alaphex6/argalax3/argvalx1/asaaspx6/asnserx6/argasnx2/argargx7/alailex7 all failing with Code (98) along with statement: ERROR: file mdioload.cpp line 162: No CHARMM parameter file specified. Wingman generating same error line. | |
ID: 36996 | Rating: 0 | rate: / Reply Quote | |
Hi: Well, I've already made a few of these short tasks without problem, in Linux -Ubuntu 14.04. | |
ID: 36997 | Rating: 0 | rate: / Reply Quote | |
Thanks for sharing you're experience. I've found couple Linux wingman who completed Noelia_SH2eq work units, yet failing work units showing process exited code 199 (0xc7, -57)and FATAL : Cuda driver error 35 in file 'swanlibnv2.cpp' in line 446. | |
ID: 36998 | Rating: 0 | rate: / Reply Quote | |
eXaPower, | |
ID: 36999 | Rating: 0 | rate: / Reply Quote | |
Hi guys <core_client_version>7.2.41</core_client_version> <![CDATA[ <message> process exited with code 199 (0xc7, -57) </message> <stderr_txt> # SWAN Device 0 : # Name : GeForce GTX 780 Ti # ECC : Disabled # Global mem : 3071MB # Capability : 3.5 # PCI ID : 0000:01:00.0 # Device clock : 1071MHz # Memory clock : 3600MHz # Memory width : 384bit SWAN : FATAL : Cuda driver error 700 in file 'swanlibnv2.cpp' in line 1963. # SWAN swan_assert -57 </stderr_txt> ]]> Other GPU's Cards run fine. (GTX 780 ** GTX 760 ** GTX 660Ti ** GTX 750Ti) All Cards running 340.24 Nvidia Drivers (for Linux) All OS have SWAN_SYNC=0 environment variable (Gentoo Linux) No Overclock (CPU and GPU) All systems hardware (except GPU's) are the same (i7-4770 ASUS-Z87-A) ONLY 780Ti has too much WU's failed Other GPU Projects GTX780Ti run fine PS : GTX 780Ti + GTX 760 same PC running only 780Ti WU's fail | |
ID: 37437 | Rating: 0 | rate: / Reply Quote | |
uhm.. I think the probl is hight temperature | |
ID: 37442 | Rating: 0 | rate: / Reply Quote | |
nothing changes Sun Jul 27 20:07:58 2014 +------------------------------------------------------+ | NVIDIA-SMI 340.24 Driver Version: 340.24 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | |===============================+======================+======================| | 0 GeForce GTX 780 Ti Off | 0000:01:00.0 N/A | N/A | | 55% 79C P0 N/A / N/A | 570MiB / 3071MiB | N/A Default | +-------------------------------+----------------------+----------------------+ | 1 GeForce GTX 760 Off | 0000:04:00.0 N/A | N/A | | 62% 81C P0 N/A / N/A | 530MiB / 2047MiB | N/A Default | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Compute processes: GPU Memory | | GPU PID Process name Usage | |=============================================================================| | 0 Not Supported | | 1 Not Supported | +-----------------------------------------------------------------------------+ CPU coretemp-isa-0000 Adapter: ISA adapter Physical id 0: +65.0°C (high = +80.0°C, crit = +100.0°C) Core 0: +60.0°C (high = +80.0°C, crit = +100.0°C) Core 1: +65.0°C (high = +80.0°C, crit = +100.0°C) Core 2: +61.0°C (high = +80.0°C, crit = +100.0°C) Core 3: +61.0°C (high = +80.0°C, crit = +100.0°C) | |
ID: 37447 | Rating: 0 | rate: / Reply Quote | |
:O <name>acemdshort</name> <max_concurrent>2</max_concurrent> <gpu_versions> <gpu_usage>1</gpu_usage> <cpu_usage>0.49</cpu_usage> </gpu_versions> with this configuration GPU temperature is ~54°C If I increase CPU_USAGE >=0.5 or 1 (keeping GPU_USAGE=1), temperature increase over 70°C If I decrease CPU_USAGE to <0.5 temperature is very low but WU is very very slowly CPU_USAGE >=0.5 temp increase No Changes playing with GPU_USAGE CPU_USAGE=0.49 ** GPU_USAGE=1 Sun Jul 27 21:50:51 2014 +------------------------------------------------------+ | NVIDIA-SMI 340.24 Driver Version: 340.24 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | |===============================+======================+======================| | 0 GeForce GTX 780 Ti Off | 0000:01:00.0 N/A | N/A | | 40% 55C P0 N/A / N/A | 576MiB / 3071MiB | N/A Default | +-------------------------------+----------------------+----------------------+ | 1 GeForce GTX 760 Off | 0000:04:00.0 N/A | N/A | | 50% 59C P0 N/A / N/A | 526MiB / 2047MiB | N/A Default | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Compute processes: GPU Memory | | GPU PID Process name Usage | |=============================================================================| | 0 Not Supported | | 1 Not Supported | +-----------------------------------------------------------------------------+ :O any ideas ? No problems with GTX780 .... | |
ID: 37448 | Rating: 0 | rate: / Reply Quote | |
Message boards : Graphics cards (GPUs) : NOELIA_SH2eq Short Work unit(s) Instantly Failing