Advanced search

Message boards : Graphics cards (GPUs) : First check of the 6.3.14

Author Message
Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 2829 - Posted: 6 Oct 2008 | 21:40:45 UTC

Looks like the simplest calculation of the number of cores in a Quad CPU is a hard work. After starting the 6.3.14 only 3 tasks are running. So I had to create a new cc_config.xml to tell the program the real number of cores in my CPU. After the Entry ncpus=4 the program is running fine.
____________

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 2830 - Posted: 6 Oct 2008 | 21:53:27 UTC
Last modified: 6 Oct 2008 | 21:54:40 UTC

Second comment to the 6.3.14:

Finally the 6.3.14 don't eat memory when the BOINC-Manager is running under Vista 64 bit :)

Please change the entry

Aktiv (1.00 CPUs, 1 CUDA) to Aktiv (1.00 Cores, 1 CUDA)

I think, you mean to control the feeding of the GPU is one core needed and not the whole CPU wirh all 4 cores ;)

Question:
Is the number of 2 cores hard coded in this version? I'm wondering, while 2 CPU tasks und one GPU tasks are running on a Quad if no ncpus-entry is available with the real number of cores in Windows.

A team mate told me, on Linux the right number of tasks is started. I will ask back for a cc_config.xml.

EDIT: He has running a cc_config.xml with ncpus=5, so I think, the Linux-Version has the same problem.
____________

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 2831 - Posted: 6 Oct 2008 | 22:10:02 UTC

I've been running x2 hosts for about 3-4 hours. I just installed on 3rd host a few minutes ago. All are Windows XP x86

I did not want to post immediately until some testing had been done. However I have not had any problem so far and been in contact with the developer (Anderson) all day about it. I observed proper task switching when a CPU task ended another was started, so I was always left with max running tasks.

The problem with reading the XML preferences file has been fixed and the proper venue is now used. XML is parsed correctly again.

It seems to be good so far, It starts the proper number of tasks on initial start up, in my case without using ncpus setting I should have 2 and thats what I got. I did not have to do any suspending or other tinkering to get it to work. 1 CPU only and 1 CPU/CUDA task. You will note immediately CUDA tasks show as "Running (1.00 CPUs, 1 CUDA)". CPU only tasks just show as "Running". This amount of CPU used can be set by the project, currently 1.00.

The behavior is suppose to be:

Anderson wrote:
The intended behavior (which I think we've finally achieved)
is that the client runs tasks until all the GPUs and all the CPUs
are utilized.

E.g. if it has a (1 GPU + .75 CPU) job and several 1 CPU jobs,
and your machine has 1 GPU and 2 CPUs,
it will run the GPU job and 2 of the 1-CPU jobs
(so that CPU usage is 2.75, i.e. there will be contention
for the CPUs, but that's better than leaving one of them .25 idle).

BTW, I changed things so that if a job uses a GPU and < 1 CPU,
it gets run at normal process priority.
Hence in this case the GPU job will get .75 CPU
(so that the GPU runs at full speed)
and the two CPU jobs will share the remaining 1.25 CPU.

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 2832 - Posted: 6 Oct 2008 | 22:20:43 UTC - in response to Message 2830.

Second comment to the 6.3.14:

Finally the 6.3.14 don't eat memory when the BOINC-Manager is running under Vista 64 bit :)

Please change the entry

Aktiv (1.00 CPUs, 1 CUDA) to Aktiv (1.00 Cores, 1 CUDA)

I think, you mean to control the feeding of the GPU is one core needed and not the whole CPU wirh all 4 cores ;)

Question:
Is the number of 2 cores hard coded in this version? I'm wondering, while 2 CPU tasks und one GPU tasks are running on a Quad if no ncpus-entry is available with the real number of cores in Windows.

A team mate told me, on Linux the right number of tasks is started. I will ask back for a cc_config.xml.

EDIT: He has running a cc_config.xml with ncpus=5, so I think, the Linux-Version has the same problem.

No linux version of 6.3.14 has been made yet. There hasn't been a linux version made since 6.3.10.

The number of cores is not hard coded in any version. You can limit it with your preferences, cc_config.xml or global_prefs_override.xml files.

Check your preferences and any files you might have modified and make sure they are set to use 100% of CPU's and correct values for such things as ncpus.

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 2833 - Posted: 6 Oct 2008 | 22:27:38 UTC

I've installed the 6.3.14 on a second PC, also on Vista 64 bit. There are 4 tasks running. This PC has a 8800GT. Don't know, why the other only 3 tasks has running without a cc_config.xml at the first start.

I've deleted the entry in the cc_config of the other PC with the GTX280 and after the restart now also 4 tasks are running. The CPU is set to 100% on both PCs.
____________

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 2834 - Posted: 6 Oct 2008 | 22:30:45 UTC - in response to Message 2832.
Last modified: 6 Oct 2008 | 22:30:58 UTC


No linux version of 6.3.14 has been made yet. There hasn't been a linux version made since 6.3.10.


If you have the sources and the changes and a compiler ... ;)
____________

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 2835 - Posted: 6 Oct 2008 | 23:07:01 UTC

Another remark to the 6.3.14:

The temperature of my GTX280 is dropped from 79°C to 72°C. In the past this only happens with ncpus=5 and a longer running time per WU. I'm really agog for the new time per step, the last with the 6.3.10 was 34.438 ms.

____________

Sherman H.
Send message
Joined: 28 Sep 08
Posts: 27
Credit: 6,201,632,872
RAC: 2
Level
Tyr
Scientific publications
watwatwatwat
Message 2837 - Posted: 7 Oct 2008 | 4:12:43 UTC - in response to Message 2831.

You will note immediately CUDA tasks show as "Running (1.00 CPUs, 1 CUDA)". CPU only tasks just show as "Running". This amount of CPU used can be set by the project, currently 1.00.


Do you know when/if the CPU amount will be changed to lower than 1, for somebody, such as myself, who prefers to maximize the number of CPU tasks instead of having one tied up in each GPU task?

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 2840 - Posted: 7 Oct 2008 | 8:04:13 UTC - in response to Message 2837.

Once a new BOINC release proves to work we will reduce the cpu usage to less than one. We will also need to upload the new Windows and Linux application. These things are better separated by an interval to detect problems.
It will be up to you to decide if use the CPU it or not (It depends on the speed of your GPU).

GDF

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 2841 - Posted: 7 Oct 2008 | 9:42:36 UTC

The first WU only running under 6.3.14 is ready:

http://www.gpugrid.net/result.php?resultid=75290

# Time per step: 35.483 ms
# Approximate elapsed time for entire WU: 30160.337 s

Looks good!
____________

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 2844 - Posted: 7 Oct 2008 | 13:42:31 UTC - in response to Message 2841.

This is as before, isn't it?

gdf

localizer
Send message
Joined: 17 Apr 08
Posts: 113
Credit: 1,656,514,857
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 2847 - Posted: 7 Oct 2008 | 14:26:30 UTC

...... where can I get 6.3.14?

Thanks!

Profile DoctorNow
Avatar
Send message
Joined: 18 Aug 07
Posts: 83
Credit: 122,995,082
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 2848 - Posted: 7 Oct 2008 | 14:41:19 UTC - in response to Message 2847.

...... where can I get 6.3.14?

Try a look here:
http://boinc.berkeley.edu/dl/?C=M;O=D
____________
Member of BOINC@Heidelberg and ATA!

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 2850 - Posted: 7 Oct 2008 | 15:31:32 UTC - in response to Message 2844.
Last modified: 7 Oct 2008 | 15:55:42 UTC

This is as before, isn't it?

gdf


Nearly, a little bit slower ...

6.3.14 - # Time per step: 35.483 ms

... and the last 3 WUs without a ncpu-entry with the 6.3.10
6.3.10 - # Time per step: 34.438 ms
6.3.10 - # Time per step: 34.608 ms
6.3.10 - # Time per step: 34.547 ms
____________

localizer
Send message
Joined: 17 Apr 08
Posts: 113
Credit: 1,656,514,857
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 2852 - Posted: 7 Oct 2008 | 16:03:10 UTC - in response to Message 2848.

.....Got it - thanks.

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 237,700,010
RAC: 4,775,275
Level
Leu
Scientific publications
watwatwatwatwatwatwatwat
Message 2855 - Posted: 7 Oct 2008 | 19:23:13 UTC
Last modified: 7 Oct 2008 | 19:25:31 UTC

My first result crunched with 6.3.14 is uploaded - http://www.ps3grid.net/result.php?resultid=75292

It took about 43.6ms/step which is about the same speed than with previous BOINC versions (was always between 43 -45 ms/step).
____________

pixelicious.at - my little photoblog

Thamir Ghaslan
Send message
Joined: 26 Aug 08
Posts: 55
Credit: 1,475,857
RAC: 0
Level
Ala
Scientific publications
watwatwat
Message 2857 - Posted: 7 Oct 2008 | 20:55:40 UTC

Worked like a charm, 4 tasks, 3 cpu and 1 gpu.

Did the cc_config.xml to 5 cpus and now its 4 cpu, 1 gpu.

Will have to wait a day or two to average the performance.

STE\/E
Send message
Joined: 18 Sep 08
Posts: 368
Credit: 3,428,277,383
RAC: 53,544,867
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 2870 - Posted: 8 Oct 2008 | 0:00:24 UTC
Last modified: 8 Oct 2008 | 0:16:58 UTC

I installed the new 6.3.14 this morning and it immediately Trashed the 4 PS3 Wu on the Computer and has Trashed every one since then. Going back to 6.3.10 hasn't helped either as it now Trashes any PS3 Wu I download ... All I get is Computation Errors with the following message:

<core_client_version>6.3.10</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# Using CUDA device 0
Cuda error in file 'deviceQuery.cu' in line 59 : out of memory.

<core_client_version>6.3.14</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# Using CUDA device 0
Cuda error in file 'deviceQuery.cu' in line 59 : out of memory.

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 2873 - Posted: 8 Oct 2008 | 6:32:28 UTC - in response to Message 2870.

out of memory


Did you reboot your machine?

MrS
____________
Scanning for our furry friends since Jan 2002

Profile ayQue
Send message
Joined: 6 Sep 08
Posts: 18
Credit: 806,771
RAC: 0
Level
Gly
Scientific publications
watwatwatwat
Message 2876 - Posted: 8 Oct 2008 | 7:30:40 UTC
Last modified: 8 Oct 2008 | 7:53:24 UTC

...did install v6.3.14 yesterday - everything is running fine ... :-) speed is the same (as I supposed). On a 32 bit Windows XP prof. SP 3

nice job!

Profile [SETI.USA]Tank_Master
Avatar
Send message
Joined: 8 Jul 07
Posts: 85
Credit: 67,463,387
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 2877 - Posted: 8 Oct 2008 | 7:44:31 UTC

so far I haven't seen any issues with 64bit 6.3.14 on server 2008 x64

STE\/E
Send message
Joined: 18 Sep 08
Posts: 368
Credit: 3,428,277,383
RAC: 53,544,867
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 2879 - Posted: 8 Oct 2008 | 8:54:58 UTC - in response to Message 2873.
Last modified: 8 Oct 2008 | 9:22:45 UTC

out of memory


Did you reboot your machine?

MrS


No , but I did think of trying that & will this morning to see what happens. The Box was running a PS3 Wu & some other Projects just fine until I installed the v6.3.14 ...

PS: Rebooted but I have used up my Quota of 4 today so the Server won't send me any more Wu's until later ...

STE\/E
Send message
Joined: 18 Sep 08
Posts: 368
Credit: 3,428,277,383
RAC: 53,544,867
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 2892 - Posted: 8 Oct 2008 | 22:46:05 UTC

I managed to get 2 more Wu's again & 1 of them seems to be running ok now after re-booting this morning. I also increased the Page file on the System to see if that helps.

I'd like to try 6.3.14 again, should I un-install the 6.3.10 first or doesn't it matter ... ??? Yesterday I just installed the 6.3.14 over the 6.3.10 & maybe thats where it got screwed up, don't know for sure though ...

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 2893 - Posted: 8 Oct 2008 | 23:51:06 UTC - in response to Message 2892.

I managed to get 2 more Wu's again & 1 of them seems to be running ok now after re-booting this morning. I also increased the Page file on the System to see if that helps.

I'd like to try 6.3.14 again, should I un-install the 6.3.10 first or doesn't it matter ... ??? Yesterday I just installed the 6.3.14 over the 6.3.10 & maybe thats where it got screwed up, don't know for sure though ...

It should not matter. I install every version over the old. I installed on 3 computer 6.3.14 over 6.3.10 with no problem. Usually you would only need to uninstall if you go backwards a major version like from 6 to 5. I also make a quick backup first by shutting down boinc, and using windows to compress a copy of the folders, for both data and program, I then rename them with the version. I got quite a few now and they also serve as a log of update dates. Then if there is a problem I can restore quite easily. I haven't needed any copies (knock on wood), but with all the testing I do it helps to know they are there jsut in case, and actually I needed to track back some info in client_state for the last 3 installs and those backups were a big help in answering a question, otherwise I would have had to guess.

STE\/E
Send message
Joined: 18 Sep 08
Posts: 368
Credit: 3,428,277,383
RAC: 53,544,867
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 2896 - Posted: 9 Oct 2008 | 1:30:37 UTC

Okay, thanks Keith, I'll try it tommorrow, want to let it run over night to make sure there are no other problems before Up-Grading to 6.3.14 again.

naja002
Avatar
Send message
Joined: 25 Sep 08
Posts: 111
Credit: 10,352,599
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwat
Message 2902 - Posted: 9 Oct 2008 | 5:47:00 UTC
Last modified: 9 Oct 2008 | 5:56:31 UTC

Anybody tried multi-gpu on 6.3.14 yet? I'm having issues on my 2x GPU rig. It runs F@H GPU2 fine. My other 2 rigs are single GPU and I'm using the cc_config file and not having any issues.

On the multi-GPU rig:

Vista x64
2x XFX 8800GS

Same effect whether I use the cc_config file or not. I start Boinc (I run WCG and GpuGrid) and it will start running 4 of the WCG WUs and 2 of the GpuGrid WUs, but after a few minutes....sometimes 10-15mins....it will kick both of the GpuGrid WUs into "Waiting to run" mode. The 4 WCG WUs will continue on...and it will not start any other WUs to replace the GpuGrid WUs. For some reason its kicking them out. This doesn't happen on either of my single GPU rigs. And as state before: this is with or w/o the cc_config file, and it doesn't seem to matter what the ncpus is set to when the file is used. Yes, I rebooted. Desktop is extended via vga-dvi dummy. I tried them OCed and at stock.

Driver should be 177.92

All 3 rigs are running Vista x64.

Kinda lost atm...just wondering if anyone else has tried multi-gpu with this version.....?

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 2903 - Posted: 9 Oct 2008 | 6:06:39 UTC - in response to Message 2902.

Is your ressource share >33%? I'm not sure if it matters for GPUs, but it may confuse BOINc if it's too low. And something else: the BOINC scheduler sometimes gets confused by the debt. I suggest the following: stop BOINC, open the client_state.xml in the editor, search for "debt" and you'll find <long_term_debt> xxxxx </long_term_debt> and <short_term_debt> yyyyy </short_term_debt> for every project. Set them all to 0 and see if it helps.

MrS
____________
Scanning for our furry friends since Jan 2002

naja002
Avatar
Send message
Joined: 25 Sep 08
Posts: 111
Credit: 10,352,599
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwat
Message 2904 - Posted: 9 Oct 2008 | 6:13:24 UTC - in response to Message 2903.

Ok, I am going to tinker with it tomorrow...headed to bed in a few minutes. But first a question:

How do I set/change the resource share? Is that the "On multiprocessor systems, use at most" which is defaulted to 100%? If not, then I'm lost.

On the projects tab it shows 50% resource share for each project.

STE\/E
Send message
Joined: 18 Sep 08
Posts: 368
Credit: 3,428,277,383
RAC: 53,544,867
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 2907 - Posted: 9 Oct 2008 | 10:33:56 UTC

The "Resource share and graphics -- PS3GRID preferences" settings in your Account is where you set your Resource Share for each Project.

naja002
Avatar
Send message
Joined: 25 Sep 08
Posts: 111
Credit: 10,352,599
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwat
Message 2908 - Posted: 9 Oct 2008 | 14:08:29 UTC - in response to Message 2907.
Last modified: 9 Oct 2008 | 14:13:08 UTC

The "Resource share and graphics -- PS3GRID preferences" settings in your Account is where you set your Resource Share for each Project.



Ok, found it--Thanx! I don't want to derail this thread too much...if necessary I'll start a new thread:

All 3 of my rigs are set under the "default" profile in Boinc/WCG/and, I guess, GpuGrid. The 2 1xGPU rigs are running fine, so I really don't want to mess with them. Don't really want to put the PCs in different profiles and not sure what headaches might arise from that. Is there a way to change the resource share on just the 1 2xGPU rig?


Just wanted to add that, other than this single issue with the multi-gpu, this 6.3.14 seems like a much better version than .11, .12, .13......

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 2910 - Posted: 9 Oct 2008 | 16:46:18 UTC - in response to Message 2896.

Okay, thanks Keith, I'll try it tommorrow, want to let it run over night to make sure there are no other problems before Up-Grading to 6.3.14 again.

I forgot to mention, there is no warranty with my statement. Good luck.

Profile Krunchin-Keith [USA]
Avatar
Send message
Joined: 17 May 07
Posts: 512
Credit: 111,288,061
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 2912 - Posted: 9 Oct 2008 | 16:51:17 UTC - in response to Message 2908.

The "Resource share and graphics -- PS3GRID preferences" settings in your Account is where you set your Resource Share for each Project.



Ok, found it--Thanx! I don't want to derail this thread too much...if necessary I'll start a new thread:

All 3 of my rigs are set under the "default" profile in Boinc/WCG/and, I guess, GpuGrid. The 2 1xGPU rigs are running fine, so I really don't want to mess with them. Don't really want to put the PCs in different profiles and not sure what headaches might arise from that. Is there a way to change the resource share on just the 1 2xGPU rig?


Just wanted to add that, other than this single issue with the multi-gpu, this 6.3.14 seems like a much better version than .11, .12, .13......

Yes let's start a new thread for this as it pertains to all boinc versions and projects, not just this one.

naja002
Avatar
Send message
Joined: 25 Sep 08
Posts: 111
Credit: 10,352,599
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwat
Message 2913 - Posted: 9 Oct 2008 | 16:55:39 UTC - in response to Message 2912.


Yes let's start a new thread for this as it pertains to all boinc versions and projects, not just this one.




Already done.....its here:


Need 6.3.14 Multi-GPU Help.....



Thanx!




.

Post to thread

Message boards : Graphics cards (GPUs) : First check of the 6.3.14

//