Advanced search

Message boards : Number crunching : All WUs failed on GTX590

Author Message
jlhal
Send message
Joined: 1 Mar 10
Posts: 147
Credit: 1,077,535,540
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 27381 - Posted: 22 Nov 2012 | 22:55:17 UTC

SInce yesterday all WUs are failed on my linux (ubuntu) PC :

<core_client_version>7.0.39</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
# Using device 0
SWAN: FATAL : Unable to enumerate devices
acemd.linux64.2352: swanlib_nv.c:390: error: Assertion `0' failed.

SIGABRT: abort called
Stack trace (11 frames):
../../projects/www.gpugrid.net/acemd.linux64.2352(boinc_catch_signal+0x4d)[0x482bed]
/lib/x86_64-linux-gnu/libc.so.6(+0x364a0)[0x7fcdb2a004a0]
/lib/x86_64-linux-gnu/libc.so.6(gsignal+0x35)[0x7fcdb2a00425]
/lib/x86_64-linux-gnu/libc.so.6(abort+0x17b)[0x7fcdb2a03b8b]
/lib/x86_64-linux-gnu/libc.so.6(+0x2f0ee)[0x7fcdb29f90ee]
/lib/x86_64-linux-gnu/libc.so.6(+0x2f192)[0x7fcdb29f9192]
../../projects/www.gpugrid.net/acemd.linux64.2352[0x491b33]
../../projects/www.gpugrid.net/acemd.linux64.2352[0x491f32]
../../projects/www.gpugrid.net/acemd.linux64.2352[0x407e6a]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xed)[0x7fcdb29eb76d]
../../projects/www.gpugrid.net/acemd.linux64.2352[0x407a19]

Exiting...

</stderr_txt>


?? Any idea ?
____________
Lubuntu 16.04.1 LTS x64

Profile Carlesa25
Avatar
Send message
Joined: 13 Nov 10
Posts: 328
Credit: 72,619,453
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 27382 - Posted: 22 Nov 2012 | 23:42:37 UTC - in response to Message 27381.

Hello: What is the core_client 7.0.39 version ....? the 7.0.38 is beta, the current is 7.0.28.

jlhal
Send message
Joined: 1 Mar 10
Posts: 147
Credit: 1,077,535,540
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 27383 - Posted: 23 Nov 2012 | 8:52:20 UTC - in response to Message 27382.

Hello: What is the core_client 7.0.39 version ....? the 7.0.38 is beta, the current is 7.0.28.


Hello, see here:
http://boinc.berkeley.edu/dev/forum_thread.php?id=6698&sort=5

I must say this was running fine with this beta release , it has all begun 2 days ago approx. All WUs are CUDA 31 (GTX590 is Fermi)

I'm considering posting this also to boinc forum.
But any clue is welcome...
____________
Lubuntu 16.04.1 LTS x64

Profile [PUGLIA] kidkidkid3
Avatar
Send message
Joined: 23 Feb 11
Posts: 98
Credit: 1,285,488,396
RAC: 2,089,480
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 27384 - Posted: 23 Nov 2012 | 16:29:33 UTC
Last modified: 23 Nov 2012 | 16:48:02 UTC

Hi all.
Also on GTS450 under windows 7 i have the same error :

<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
Impossibile trovare il percorso specificato. (0x3) - exit code 3 (0x3)
</message>
<stderr_txt>
SWAN : FATAL : Cuda driver error 2 in file 'swanlibnv2.cpp' in line 639.
Assertion failed: a, file swanlibnv2.cpp, line 59

This application has requested the Runtime to terminate it in an unusual way.
Please contact the application's support team for more information.

</stderr_txt>
]]>
On the same computer another GTS450 is running the same type of WU (NATHAN).
I stopped the long queue request.
I don't modify nothing.
Thanks in advance for your help.
k.
____________
Dreams do not always come true. But not because they are too big or impossible. Why did we stop believing.
(Martin Luther King)

jlhal
Send message
Joined: 1 Mar 10
Posts: 147
Credit: 1,077,535,540
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 27385 - Posted: 23 Nov 2012 | 17:05:39 UTC - in response to Message 27384.

Hi all.
Also on GTS450 under windows 7 i have the same error :

<core_client_version>7.0.28</core_client_version>
<![CDATA[
<message>
Impossibile trovare il percorso specificato. (0x3) - exit code 3 (0x3)
</message>
<stderr_txt>
SWAN : FATAL : Cuda driver error 2 in file 'swanlibnv2.cpp' in line 639.
Assertion failed: a, file swanlibnv2.cpp, line 59
...


@kidkidkid3 : This error is not the same.
I suggest you open a new thread with and also in BOINC forum (as I did) :
http://boinc.berkeley.edu/dev/index.php

____________
Lubuntu 16.04.1 LTS x64

jlhal
Send message
Joined: 1 Mar 10
Posts: 147
Credit: 1,077,535,540
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 27392 - Posted: 23 Nov 2012 | 20:13:16 UTC - in response to Message 27381.


<core_client_version>7.0.39</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
# Using device 0
SWAN: FATAL : Unable to enumerate devices
acemd.linux64.2352: swanlib_nv.c:390: error: Assertion `0' failed.

SIGABRT: abort called
Stack trace (11 frames):
../../projects/www.gpugrid.net/acemd.linux64.2352(boinc_catch_signal+0x4d)[0x482bed]
/lib/x86_64-linux-gnu/libc.so.6(+0x364a0)[0x7fcdb2a004a0]
/lib/x86_64-linux-gnu/libc.so.6(gsignal+0x35)[0x7fcdb2a00425]
/lib/x86_64-linux-gnu/libc.so.6(abort+0x17b)[0x7fcdb2a03b8b]
/lib/x86_64-linux-gnu/libc.so.6(+0x2f0ee)[0x7fcdb29f90ee]
/lib/x86_64-linux-gnu/libc.so.6(+0x2f192)[0x7fcdb29f9192]
../../projects/www.gpugrid.net/acemd.linux64.2352[0x491b33]
../../projects/www.gpugrid.net/acemd.linux64.2352[0x491f32]
../../projects/www.gpugrid.net/acemd.linux64.2352[0x407e6a]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xed)[0x7fcdb29eb76d]
../../projects/www.gpugrid.net/acemd.linux64.2352[0x407a19]

Exiting...

</stderr_txt>



After posting on BOINC : http://boinc.berkeley.edu/dev/forum_thread.php?id=8009#46444
, and according to ageless@http://boinc.berkeley.edu/dev/show_user.php?userid=8 who suggest this looks like a buggy app, I came to a question :
Could it be that these failing WUs are for Kepler despite they are labelled as CUDA 31 ?

____________
Lubuntu 16.04.1 LTS x64

jlhal
Send message
Joined: 1 Mar 10
Posts: 147
Credit: 1,077,535,540
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 27393 - Posted: 23 Nov 2012 | 22:10:04 UTC - in response to Message 27392.

There are also failed CUDA42 WUs

Last GOOD WU (CUDA42)
I1R114-NATHAN_RPS1120528-89-166-RND6352_0
http://www.gpugrid.net/result.php?resultid=6078829

First FAILED (CUDA42)
I3R37-NATHAN_RPS1120528-92-166-RND0417_0
process exited with code 255 (0xff, -1)
http://www.gpugrid.net/result.php?resultid=6080250


ALL FAILED CUDA31 with SWAN FATAL code 193...
ALL FAILED CUDA42 with code 255 (0xff, -1)
All with CPU time = GPU time = 0

____________
Lubuntu 16.04.1 LTS x64

Post to thread

Message boards : Number crunching : All WUs failed on GTX590

//