Message boards : Graphics cards (GPUs) : Error compiling program: nvrtc: error: invalid value for --gpu-architecture (-arch)
Author | Message |
---|---|
3090 FE. | |
ID: 57382 | Rating: 0 | rate: / Reply Quote | |
Known issue. The CUDA101 app will fail on Ampere cards. | |
ID: 57384 | Rating: 0 | rate: / Reply Quote | |
Maybe I'm missing some context, but the link shows that issue had been fixed and does not mention my error code specifically. | |
ID: 57386 | Rating: 0 | rate: / Reply Quote | |
The thread does in fact mention exactly the error message title of this thread in the latest posts. ACEMD failed: The CUDA1121 application runs fine on Ampere cards. Only when the scheduler sends a task assigned with the CUDA 101 application do the tasks fail. The issue is that the driver level does not match the CUDA101 application. Simplest solution is to remove the CUDA101 app from the scheduler and force all hosts to use the CUDA1121 application which requires minimum CUDA 11.2 level of drivers. | |
ID: 57388 | Rating: 0 | rate: / Reply Quote | |
We have now changed the scheduler, let's see if now it's better. | |
ID: 57412 | Rating: 0 | rate: / Reply Quote | |
We have now changed the scheduler, let's see if now it's better. Is this a result of the scheduler changes or something else ? The result http://gpugrid.net/result.php?resultid=32646962 failed (see below) to launch CUDA which isn't surprising as the host doesn't show a GPU. Host: http://gpugrid.net/show_host_detail.php?hostid=514156 New version of ACEMD v2.18 (cuda1121) Stderr output <core_client_version>7.16.11</core_client_version> <![CDATA[ <message> process got signal 67</message> <stderr_txt> 14:40:06 (57305): wrapper (7.7.26016): starting 14:40:06 (57305): wrapper (7.7.26016): starting 14:40:06 (57305): wrapper: running /bin/tar (xf conda-pack.tar.bz2) 14:42:47 (57305): /bin/tar exited; CPU time 127.344146 14:42:47 (57305): wrapper: running bin/acemd3 (--boinc --device 0) ACEMD failed: Error invoking kernel: CUDA_ERROR_LAUNCH_FAILED (719) 19:16:23 (57305): bin/acemd3 exited; CPU time 6047.267986 19:16:23 (57305): app exit status: 0x1 19:16:23 (57305): called boinc_finish(195) </stderr_txt> ]]> | |
ID: 57418 | Rating: 0 | rate: / Reply Quote | |
I'm seeing my Linux machines receive the cuda1121 plan class more consistently, but my Windows machines receive cuda101 - I don't think I've ever seen cuda1121 under Windows. | |
ID: 57424 | Rating: 0 | rate: / Reply Quote | |
Message boards : Graphics cards (GPUs) : Error compiling program: nvrtc: error: invalid value for --gpu-architecture (-arch)