Author |
Message |
|
Good afternoon,
My old 2060 checked out early this week and I was able to get a 2060 Super to replace it. Windows recognizes it, and shows it running normal, but BONIC does not recognize it. I have updated the driver, but that has not helped.
Any suggestions on how to fix this?
Thanks!
|
|
|
Keith Myers Send message
Joined: 13 Dec 17 Posts: 1340 Credit: 7,652,966,070 RAC: 13,485,092 Level
Scientific publications
|
Your host shows a Radeon Pro WX 7100 card. This project requires an Nvidia card.
BOINC only recognizes the most capable card by default and sees the Radeon as more powerful than the 2060 Super and won't use it without you making changes to Boinc.
In your cc_config.xml file, you must add this to the file in the Options section.
<use_all_gpus>1</use_all_gpus>
Save the changes to the file and re-read config files in the Manager. Restart Boinc to be sure it picks up the 2060 Super.
You should see it detected as GPU1 in the beginning of the Event Log which you can view from the Manager.
As long as the Nvidia card is detected by Boinc, you should be able to get work from the project. |
|
|
|
Thanks for the reply and help, but I don't think that is the solution. I had been running for years with both the Radeon and 2060 without issue. It has only been since I replaced the 2060, this week, that the problem has occurred.
The beginning of the event log shows these line entries:
app version refers to missing GPU type NVIDIA
Application uses missing NVIDIA GPU
I don't understand how replacing a gpu prevents BONIC from seeing it when Windows sees it.
This is the cc_config.xml file:
<cc_config>
<log_flags>
<file_xfer>1</file_xfer>
<sched_ops>1</sched_ops>
<task>1</task>
<app_msg_receive>0</app_msg_receive>
<app_msg_send>0</app_msg_send>
<async_file_debug>0</async_file_debug>
<benchmark_debug>0</benchmark_debug>
<checkpoint_debug>0</checkpoint_debug>
<coproc_debug>0</coproc_debug>
<cpu_sched>0</cpu_sched>
<cpu_sched_debug>0</cpu_sched_debug>
<cpu_sched_status>0</cpu_sched_status>
<dcf_debug>0</dcf_debug>
<disk_usage_debug>0</disk_usage_debug>
<file_xfer_debug>0</file_xfer_debug>
<gui_rpc_debug>0</gui_rpc_debug>
<heartbeat_debug>0</heartbeat_debug>
<http_debug>0</http_debug>
<http_xfer_debug>0</http_xfer_debug>
<idle_detection_debug>0</idle_detection_debug>
<mem_usage_debug>0</mem_usage_debug>
<network_status_debug>0</network_status_debug>
<notice_debug>0</notice_debug>
<poll_debug>0</poll_debug>
<priority_debug>0</priority_debug>
<proxy_debug>0</proxy_debug>
<rr_simulation>0</rr_simulation>
<rrsim_detail>0</rrsim_detail>
<sched_op_debug>0</sched_op_debug>
<scrsave_debug>0</scrsave_debug>
<slot_debug>0</slot_debug>
<state_debug>0</state_debug>
<statefile_debug>0</statefile_debug>
<suspend_debug>0</suspend_debug>
<task_debug>0</task_debug>
<time_debug>0</time_debug>
<trickle_debug>0</trickle_debug>
<unparsed_xml>0</unparsed_xml>
<work_fetch_debug>0</work_fetch_debug>
</log_flags>
<options>
<abort_jobs_on_exit>0</abort_jobs_on_exit>
<allow_gui_rpc_get>0</allow_gui_rpc_get>
<allow_multiple_clients>0</allow_multiple_clients>
<allow_remote_gui_rpc>0</allow_remote_gui_rpc>
<disallow_attach>0</disallow_attach>
<dont_check_file_sizes>0</dont_check_file_sizes>
<dont_contact_ref_site>0</dont_contact_ref_site>
<lower_client_priority>0</lower_client_priority>
<dont_suspend_nci>0</dont_suspend_nci>
<dont_use_vbox>0</dont_use_vbox>
<dont_use_wsl>0</dont_use_wsl>
<exit_after_finish>0</exit_after_finish>
<exit_before_start>0</exit_before_start>
<exit_when_idle>0</exit_when_idle>
<fetch_minimal_work>0</fetch_minimal_work>
<fetch_on_update>0</fetch_on_update>
<force_auth>default</force_auth>
<http_1_0>0</http_1_0>
<http_transfer_timeout>300</http_transfer_timeout>
<http_transfer_timeout_bps>10</http_transfer_timeout_bps>
<max_event_log_lines>2000</max_event_log_lines>
<max_file_xfers>8</max_file_xfers>
<max_file_xfers_per_project>2</max_file_xfers_per_project>
<max_stderr_file_size>0.000000</max_stderr_file_size>
<max_stdout_file_size>0.000000</max_stdout_file_size>
<max_tasks_reported>0</max_tasks_reported>
<ncpus>-1</ncpus>
<no_alt_platform>0</no_alt_platform>
<use_all_gpus>1</use_all_gpus>
<no_info_fetch>0</no_info_fetch>
<no_opencl>0</no_opencl>
<no_priority_change>0</no_priority_change>
<os_random_only>0</os_random_only>
<process_priority>-1</process_priority>
<process_priority_special>-1</process_priority_special>
<proxy_info>
<socks_server_name></socks_server_name>
<socks_server_port>80</socks_server_port>
<http_server_name></http_server_name>
<http_server_port>80</http_server_port>
<socks5_user_name></socks5_user_name>
<socks5_user_passwd></socks5_user_passwd>
<socks5_remote_dns>0</socks5_remote_dns>
<http_user_name></http_user_name>
<http_user_passwd></http_user_passwd>
<no_proxy></no_proxy>
<no_autodetect>0</no_autodetect>
</proxy_info>
<rec_half_life_days>10.000000</rec_half_life_days>
<report_results_immediately>0</report_results_immediately>
<run_apps_manually>0</run_apps_manually>
<save_stats_days>30</save_stats_days>
<skip_cpu_benchmarks>0</skip_cpu_benchmarks>
<simple_gui_only>0</simple_gui_only>
<start_delay>0.000000</start_delay>
<stderr_head>0</stderr_head>
<suppress_net_info>0</suppress_net_info>
<unsigned_apps_ok>0</unsigned_apps_ok>
<use_all_gpus>0</use_all_gpus>
<use_certs>0</use_certs>
<use_certs_only>0</use_certs_only>
<vbox_window>0</vbox_window>
</options>
</cc_config>
Do you see anything that could be causing and issue?
Thanks! |
|
|
|
Yes - you have
use_all_gpus
set to 1 at line 72, among the 'n' sequence in alphabetical order.
But you have another copy of use_all_gpus at line 103, in its proper alphabetical place. But that's set to zero, and will countermand the earlier setting. That's what's disabling the NVidia card. |
|
|
Keith Myers Send message
Joined: 13 Dec 17 Posts: 1340 Credit: 7,652,966,070 RAC: 13,485,092 Level
Scientific publications
|
Yes, thanks Richard for pointing the problem out. |
|
|
|
I changed line 103 from 0 to 1 and the 2060 Super was still not recognized...
I then updated the gpu to the most recent non-gaming driver (I think NVidia calls them 'Studio' drivers) and BONIC then recognized the gpu.
The 2060 Super has finished two ATMML tasks and has started a third, so everything seems fine.
Based on the errors in my cc_config file, I don't understand how my NVidia+Radeon setup ever worked. I also don't understand the significance of changing from a gaming to a non-gaming driver.
Thank you, Keith and Richard, for helping me troubleshoot this.
[/quote] |
|
|
Keith Myers Send message
Joined: 13 Dec 17 Posts: 1340 Credit: 7,652,966,070 RAC: 13,485,092 Level
Scientific publications
|
Just a guess, that gaming drivers don't include the CUDA compute component while the standard drivers do. I've never gamed so just a guess. |
|
|
|
windows drivers are fairly easily corruptible. the gaming drivers work fine for CUDA/OpenCL/compute, and installing the Studio drivers really isnt significant in a gaming vs studio sense, its the fact that you reinstalled drivers at all. I bet reinstalling the gaming drivers would have solved your problem just the same.
the use_all_gpus flag only comes into play when you have multiple of the same brand of GPU. BOINC is designed to only use the most capable GPU, per GPU category, it wont pick a top one between AMD/Nvidia/Intel if you have one of each.
____________
|
|
|
PascalSend message
Joined: 15 Jul 20 Posts: 77 Credit: 1,560,272,434 RAC: 11,356,070 Level
Scientific publications
|
<cc_config>
<log_flags>
<task>1</task>
<file_xfer>1</file_xfer>
<sched_ops>1</sched_ops>
</log_flags>
<options>
<use_all_gpus>1</use_all_gpus>
<ncpus>-1</ncpus>
<exclude_gpu>
<url>www.gpugrid.net</url>
<device_num>2</device_num>
<type>NVIDIA</type>
</exclude_gpu>
<exclude_gpu>
<url>https://sech.me/boinc/Amicable/</url>
<device_num>0</device_num>
<type>NVIDIA</type>
</exclude_gpu>
<exclude_gpu>
<url>https://sech.me/boinc/Amicable/</url>
<device_num>1</device_num>
<type>NVIDIA</type>
</exclude_gpu>
<max_file_xfers>12</max_file_xfers>
<max_file_xfers_per_project>8</max_file_xfers_per_project>
</options>
</cc_config>
use all gpu sur 1 pas sur 0
J ai 2 rtx 4000sff ada et une nvidia t400 dans mon pc mais linux mint pas windows
useallgpu on 1 step on 0
I have 2 rtx 4000sff ada and an nvidia t400 in my pc but linux mint not windows
____________
|
|
|
PascalSend message
Joined: 15 Jul 20 Posts: 77 Credit: 1,560,272,434 RAC: 11,356,070 Level
Scientific publications
|
est ce que boinc voit vos gpu dans le journal des evenements dans le menu outils?
What does boinc see your GPU in the event log in the tools menu?
____________
|
|
|
PascalSend message
Joined: 15 Jul 20 Posts: 77 Credit: 1,560,272,434 RAC: 11,356,070 Level
Scientific publications
|
verifier que les préférences des projets ont bien gpu nvidia de selectionner.
verify that project preferences have gpu nvidia to select.
____________
|
|
|