Message boards : Graphics cards (GPUs) : Problem with Boinc device vs Nvidia X Server gpu allocation
Author | Message |
---|---|
Hi everybody !
<options> <report_results_immediately>1</report_results_immediately> <use_all_gpus>1</use_all_gpus> <ignore_nvidia_dev>0</ignore_nvidia_dev> </options> <log_flags> <coproc_debug>1</coproc_debug> <task>1</task> <file_xfer>1</file_xfer> <sched_ops>1</sched_ops> </log_flags> </cc_config>
lun. 27 juil. 2015 20:45:31 CEST | | log flags: file_xfer, sched_ops, task, coproc_debug lun. 27 juil. 2015 20:45:31 CEST | | Libraries: libcurl/7.38.0 OpenSSL/1.0.1f zlib/1.2.8 libidn/1.28 librtmp/2.3 lun. 27 juil. 2015 20:45:31 CEST | | Data directory: /var/lib/boinc-client lun. 27 juil. 2015 20:45:31 CEST | | [coproc] launching child process at /usr/bin/boinc lun. 27 juil. 2015 20:45:31 CEST | | [coproc] relative to directory / lun. 27 juil. 2015 20:45:31 CEST | | [coproc] with data directory /var/lib/boinc-client lun. 27 juil. 2015 20:45:31 CEST | | CUDA: NVIDIA GPU 0 (ignored by config): GeForce GTX TITAN Black (driver version 352.21, CUDA version 7.5, compute capability 3.5, 4096MB, 4009MB available, 6396 GFLOPS peak) lun. 27 juil. 2015 20:45:31 CEST | | CUDA: NVIDIA GPU 1: GeForce GTX TITAN Black (driver version 352.21, CUDA version 7.5, compute capability 3.5, 4096MB, 4009MB available, 6396 GFLOPS peak) lun. 27 juil. 2015 20:45:31 CEST | | OpenCL: NVIDIA GPU 0 (ignored by config): GeForce GTX TITAN Black (driver version 352.21, device version OpenCL 1.2 CUDA, 6144MB, 4009MB available, 6396 GFLOPS peak) lun. 27 juil. 2015 20:45:31 CEST | | OpenCL: NVIDIA GPU 1: GeForce GTX TITAN Black (driver version 352.21, device version OpenCL 1.2 CUDA, 6143MB, 4009MB available, 6396 GFLOPS peak) lun. 27 juil. 2015 20:45:31 CEST | | NVIDIA library reports 2 GPUs lun. 27 juil. 2015 20:45:31 CEST | | No ATI library found lun. 27 juil. 2015 20:45:31 CEST | | Host name: odysseusV lun. 27 juil. 2015 20:45:31 CEST | | Processor: 12 GenuineIntel Intel(R) Core(TM) i7-5820K CPU @ 3.30GHz [Family 6 Model 63 Stepping 2] lun. 27 juil. 2015 20:45:31 CEST | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 fma cx16 xtpr pdcm pcid dca sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm ida arat epb pln pts dtherm tpr_shadow vnmi flexpriority ept vpid fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid xsaveopt lun. 27 juil. 2015 20:45:31 CEST | | OS: Linux: 3.19.0-23-generic lun. 27 juil. 2015 20:45:31 CEST | | Memory: 15.58 GB physical, 31.98 GB virtual lun. 27 juil. 2015 20:45:31 CEST | | Disk: 203.13 GB total, 166.69 GB free lun. 27 juil. 2015 20:45:31 CEST | | Local time is UTC +2 hours lun. 27 juil. 2015 20:45:31 CEST | Milkyway@Home | Found app_config.xml lun. 27 juil. 2015 20:45:31 CEST | | Config: report completed tasks immediately lun. 27 juil. 2015 20:45:31 CEST | | Config: ignoring NVIDIA GPU 0 lun. 27 juil. 2015 20:45:31 CEST | | Config: GUI RPCs allowed from: lun. 27 juil. 2015 20:45:31 CEST | Milkyway@Home | URL http://milkyway.cs.rpi.edu/milkyway/; Computer ID 624246; resource share 100 lun. 27 juil. 2015 20:45:31 CEST | World Community Grid | URL http://www.worldcommunitygrid.org/; Computer ID 3343766; resource share 100 lun. 27 juil. 2015 20:45:31 CEST | GPUGRID | URL http://www.gpugrid.net/; Computer ID 226017; resource share 100 lun. 27 juil. 2015 20:45:31 CEST | World Community Grid | General prefs: from World Community Grid (last modified 24-Feb-2015 22:06:56) lun. 27 juil. 2015 20:45:31 CEST | World Community Grid | Computer location: home lun. 27 juil. 2015 20:45:31 CEST | | General prefs: using separate prefs for home lun. 27 juil. 2015 20:45:31 CEST | | Reading preferences override file lun. 27 juil. 2015 20:45:31 CEST | | Preferences: lun. 27 juil. 2015 20:45:31 CEST | | max memory usage when active: 11962.05MB lun. 27 juil. 2015 20:45:31 CEST | | max memory usage when idle: 11962.05MB lun. 27 juil. 2015 20:45:31 CEST | | max disk usage: 162.50GB lun. 27 juil. 2015 20:45:31 CEST | | (to change preferences, visit a project web site or select Preferences in the Manager) lun. 27 juil. 2015 20:45:31 CEST | | gui_rpc_auth.cfg is empty - no GUI RPC password protection lun. 27 juil. 2015 20:45:31 CEST | | Not using a proxy lun. 27 juil. 2015 20:45:32 CEST | GPUGRID | [coproc] Assigning NVIDIA instance 0 to e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:46:32 CEST | GPUGRID | [coproc] NVIDIA instance 0; 1.000000 pending for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:46:32 CEST | GPUGRID | [coproc] NVIDIA instance 0: confirming 1.000000 instance for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:47:33 CEST | GPUGRID | [coproc] NVIDIA instance 0; 1.000000 pending for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:47:33 CEST | GPUGRID | [coproc] NVIDIA instance 0: confirming 1.000000 instance for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:48:33 CEST | GPUGRID | [coproc] NVIDIA instance 0; 1.000000 pending for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:48:33 CEST | GPUGRID | [coproc] NVIDIA instance 0: confirming 1.000000 instance for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:49:33 CEST | GPUGRID | [coproc] NVIDIA instance 0; 1.000000 pending for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:49:33 CEST | GPUGRID | [coproc] NVIDIA instance 0: confirming 1.000000 instance for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:50:34 CEST | GPUGRID | [coproc] NVIDIA instance 0; 1.000000 pending for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:50:34 CEST | GPUGRID | [coproc] NVIDIA instance 0: confirming 1.000000 instance for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:51:34 CEST | GPUGRID | [coproc] NVIDIA instance 0; 1.000000 pending for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:51:34 CEST | GPUGRID | [coproc] NVIDIA instance 0: confirming 1.000000 instance for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:51:53 CEST | GPUGRID | [coproc] NVIDIA instance 0; 1.000000 pending for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:51:53 CEST | GPUGRID | [coproc] NVIDIA instance 0: confirming 1.000000 instance for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:52:54 CEST | GPUGRID | [coproc] NVIDIA instance 0; 1.000000 pending for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:52:54 CEST | GPUGRID | [coproc] NVIDIA instance 0: confirming 1.000000 instance for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:53:54 CEST | GPUGRID | [coproc] NVIDIA instance 0; 1.000000 pending for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0 lun. 27 juil. 2015 20:53:54 CEST | GPUGRID | [coproc] NVIDIA instance 0: confirming 1.000000 instance for e1s2_1-GERARD_FXCXCL12_LIG_501831-0-1-RND4749_0
boinc 4626 4590 13 20:45 ? 00:01:32 ../../projects/www.gpugrid.net/acemd.846-65.bin --device 1
| |
ID: 41565 | Rating: 0 | rate: / Reply Quote | |
Try using this line, because you have 2 gpus. | |
ID: 41566 | Rating: 0 | rate: / Reply Quote | |
Try using this line, because you have 2 gpus. That's wrong. The "use_all_gpus" variable is a boolean, so its value could be 0 or 1. See BOINC manager's client configuration wiki. | |
ID: 41568 | Rating: 0 | rate: / Reply Quote | |
Your BOINC log shows that it's ignoring GPU 0 according to the cc_config: lun. 27 juil. 2015 20:45:31 CEST | | CUDA: NVIDIA GPU 0 (ignored by config): GeForce GTX TITAN Black (driver version 352.21, CUDA version 7.5, compute capability 3.5, 4096MB, 4009MB available, 6396 GFLOPS peak) This line also confirms that this task is started on GPU 1:
So perhaps the NVidia X server have different ideas about the GPU numbering than the BOINC manager. I'm not a Linux expert, so I'm just guessing, but you should try to disable the other GPU in cc_config (e.g. "<ignore_nvidia_dev>1</ignore_nvidia_dev>", and then check NVidia X server again. | |
ID: 41569 | Rating: 0 | rate: / Reply Quote | |
You're right on that. It can be any number, and it works the same. My mistake! | |
ID: 41571 | Rating: 0 | rate: / Reply Quote | |
So perhaps the NVidia X server have different ideas about the GPU numbering than the BOINC manager. I'm not a Linux expert, so I'm just guessing, but you should try to disable the other GPU in cc_config (e.g. "<ignore_nvidia_dev>1</ignore_nvidia_dev>", and then check NVidia X server again. Thanks for your answers ;-) Nvidia driver enumerates GPUs in the order found on PCI bus that is : PCIE16_1 -> GPU-0 : Boinc device 0 ,the one which CRUNCHING and should be ignored according to config PCI16_2 -> not used PCIE16_3 -> GPU-1 : Boinc device 1, the one that should be crunching and does NOTHING ! If I ignore device 1 , Boinc says using device 0, that's ok, as in the first case Boinc is in phase with itself BUT, NVidia X server shows that it is GPU-1 which is CRUNCHING ! So to REALLY ignore first(0) GPU/device I must ignore number 1 and vice-versa !! ____________ Lubuntu 16.04.1 LTS x64 | |
ID: 41572 | Rating: 0 | rate: / Reply Quote | |
jihal | |
ID: 41583 | Rating: 0 | rate: / Reply Quote | |
jihal Hi captainjack ! Very interesting if some other crunchers could confirm this . I can't see which of your computers is concerned Please can you give us the model of your MB and Linux OS and Boinc version Regards ____________ Lubuntu 16.04.1 LTS x64 | |
ID: 41584 | Rating: 0 | rate: / Reply Quote | |
jihal, | |
ID: 41585 | Rating: 0 | rate: / Reply Quote | |
Message boards : Graphics cards (GPUs) : Problem with Boinc device vs Nvidia X Server gpu allocation