Message boards : Graphics cards (GPUs) : What's going on with the credits?
Author | Message |
---|---|
Really strange results. The credits aren't even close to proportional to the time spent on the wus. | |
ID: 4525 | Rating: 0 | rate: / Reply Quote | |
I have seen the same at my WU's. Maybe new credit regulations?? | |
ID: 4529 | Rating: 0 | rate: / Reply Quote | |
They have ever been proportional to time, it's fixed credit for every WU. | |
ID: 4530 | Rating: 0 | rate: / Reply Quote | |
How are they granted?? Before for me it looked always as you get the same credit (3,232.06) for each WU. | |
ID: 4533 | Rating: 0 | rate: / Reply Quote | |
There are at minimum 3 different type of WUs. Look at the run time, it's also different. Your PCs are hidden, so I can' look at your run time for the CPU. Otherwise I would calculate the runtime from the value of | |
ID: 4534 | Rating: 0 | rate: / Reply Quote | |
I have also been discussing this in another thread (Curious Performance Difference) under Number crunching. Don't know if PS3's have the same problem or if it's just Nvidia cards but my lower granted credits started on the 10th of Dec and about the same time all these DCF and no cell processor/no work problems started. | |
ID: 4539 | Rating: 0 | rate: / Reply Quote | |
I think you take the wrong value. Can't believe, that this WU was worn out in 5822.16 seconds, thats only 1:37h. The normal runtime for this type of WU is on a GTX280 5:09h. ____________ | |
ID: 4544 | Rating: 0 | rate: / Reply Quote | |
The credit is fixed on the workunit, but it depends of the workunit. So, there are wu which are longer and shorter and give more or less credits. All the number here are fine. A 280 gets 4 times more credits than a 8800 in average. | |
ID: 4548 | Rating: 0 | rate: / Reply Quote | |
Hey GDF, | |
ID: 4567 | Rating: 0 | rate: / Reply Quote | |
Is there a limitation on the amount of WU issued to one system per day? Yes, 15 per day. It's to protect the project server if a client is bad and call one WU after the other and crash them. You can control this for your box here. In the line Maximum daily WU quota per CPU 15/day you find this value for your X9650. ____________ | |
ID: 4571 | Rating: 0 | rate: / Reply Quote | |
Sweet! | |
ID: 4581 | Rating: 0 | rate: / Reply Quote | |
From the different times per step it seems like the systems differ in complexity and number of steps. And the credits seem granted according to number of steps.
Few long steps.
About twice as many steps, system as complex as in the previous runs. MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 4584 | Rating: 0 | rate: / Reply Quote | |
Can't say that I'm fond of this new credit method. It only started on the 11th of December. and in the last 70-80 WU's under half where full credit and my rigs still complete the same number of WU/day with less credit. | |
ID: 4593 | Rating: 0 | rate: / Reply Quote | |
This should not be the case. If it is there is an mistake. | |
ID: 4604 | Rating: 0 | rate: / Reply Quote | |
Could it be, that the entry | |
ID: 4607 | Rating: 0 | rate: / Reply Quote | |
is no longer the real time for elapsed WU but rather the CPU time since the new application is running? CPU time is still different than reported elapsed time. GDF, could it be that the complexitiy of the model is omitted in the credit calculation and only number of steps is taken into account? MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 4608 | Rating: 0 | rate: / Reply Quote | |
For the same elapsed time on the same card they should give similar credits irrespective of the WU. Provide that the estimates are accurate. | |
ID: 4610 | Rating: 0 | rate: / Reply Quote | |
Task 170388 - SM21632-SH2_USPME-1-40-SH2_USPME2020000_0 has a approximated time of 39646.065 s under the 6.55, task 170472 - hg19408-SH2_USPME-1-40-SH2_USPME1400000_0 18391.686 s under the 6.54. Both are running on the same box, no games while running and the same credits. | |
ID: 4614 | Rating: 0 | rate: / Reply Quote | |
This should be do the client version. | |
ID: 4615 | Rating: 0 | rate: / Reply Quote | |
All WUs with 6.55 and running 4+1, no gaming or heavy use | |
ID: 4620 | Rating: 0 | rate: / Reply Quote | |
That's what I've been saying. Nothing lines up for a definitive answer. After checking my GPUgrid project stats my RAC has dropped 4000-5000 points. This started around 11 Dec and that was about the time all these DCF and no work issues started. | |
ID: 4630 | Rating: 0 | rate: / Reply Quote | |
Nothing lines up for a definitive answer. "Nothing" is a bit too harsh. In my case and in Xaaks example the 3232 credit WUs and the 2436 ones do line up. It just the 1888-ones that are off in both cases. I suppose it's a simple error or maybe something doesn't scale as expected with model complexity. After checking my GPUgrid project stats my RAC has dropped 4000-5000 points. This started around 11 Dec and that was about the time all these DCF and no work issues started. The credit change was intended and the change to flop-based scheduling / work distribution was necessary. What was not intended and what is causing the current issues is apparently some bug in the new work distribution. It's unrelated to the credit / model change and they're working on it. It seems that even though WU's need 0.03 CPU's they tend to finish faster with one CPU for both of them. I don't think there have been any dramatic code changes, which reduce CPU usage under windows, so a dedicated CPU core is still the way to guarantee maximum GPU performance. MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 4632 | Rating: 0 | rate: / Reply Quote | |
"Nothing" is a bit too harsh. In my case and in Xaaks example the 3232 credit WUs and the 2436 ones do line up. It just the 1888-ones that are off in both cases. Ok, ok. Nothing was a bit too harsh, but here is what I have found. There are two kinds of WU's, 500 step ones and 850 step ones. Any WU that has "GPUTEST" in the name will get full credit 3232.06. (850 steps) Any WU that has "SH2_US_..." in the name will get 2435.94 credit. (500 steps) Whilst any WU with "SH2_USPME_..." gets 1887.96 credit. (500 steps as well) When I total my elapsed time for my 8800GT rig for one days worth of return, I find that my 2 GPU's were working 47.18549 hours, so just under 24hrs each. When I total my 2 280's, I get 27.95813 hours. Just over thirteen hours a piece, which would explain the 4000-5000 RAC drop. My 8800GT's with client 6.4.5 seems to be working ok (knock on wood) and more efficiently that my 280 rig which is running client 6.5.0 and need consistent updating. I will post anythink new if it jumps out at me. Pat | |
ID: 4642 | Rating: 0 | rate: / Reply Quote | |
Yes, the drop in RAC should mainly be due to idle GPUs and to a lesser extent due to WUs with less credit per time. The out-of-work issue has hopefully been fixed by now (see new thread by GDF) ... so let's see how things work out :) | |
ID: 4644 | Rating: 0 | rate: / Reply Quote | |
Lets see in 24hrs or so! | |
ID: 4646 | Rating: 0 | rate: / Reply Quote | |
"Nothing" is a bit too harsh. In my case and in Xaaks example the 3232 credit WUs and the 2436 ones do line up. It just the 1888-ones that are off in both cases. Ok, something else I have noted. Even though 500 step WU should be faster they are not. On average it took 14-15hrs to process a 850 step WU, for 3232 credit, on each of my 8800GT's before Dec 11. Now I get 500 step WU for 2435 credit and they are still taking 11-12hrs a WU. And finally the 1887 credit WU's take a little longer, 12-13hrs. My 280's are the same. 5hrs to do a 500 step 1887 credit WU, 4.5-6hrs to do a 500 step 2435 credit WU and any where from 6.5-11hrs (average about 7.5hrs) to complete a 850 step 3232 credit WU. Observation: In most cases the variance is about 1/5th the time but losing 1/3rd the credit, except in the 1887/500 step WU where the computer processes the same amount as a 2435 credit WU, in time but gets a 1/3rd less credit, again 1887. Again just observations. I don't believe that there is such a difference in elapse time that there should be three sizes of WU's. Two, yes. A full size WU and a short WU, with credits 3232 and 2435 respecively. My two cents. Pat | |
ID: 4681 | Rating: 0 | rate: / Reply Quote | |
Ok, something else I have noted. Actually, it's the same.. just using hours instead of seconds ;) Even though 500 step WU should be faster they are not. On average it took 14-15hrs to process a 850 step WU, for 3232 credit, on each of my 8800GT's before Dec 11. Now I get 500 step WU for 2435 credit and they are still taking 11-12hrs a WU. And finally the 1887 credit WU's take a little longer, 12-13hrs. 11-12 h is not faster than 14-15 h? 14/11*2436 = 3100, which is almost 3232. Depending on the precise values (in seconds) which you choose you can get an even better fit. That's what I meant by "In my case and in Xaaks example the 3232 credit WUs and the 2436 ones do line up." And, yes, the 1888 credit WUs are still off. I don't believe that there is such a difference in elapse time that there should be three sizes of WU's I suspect the 1888-WUs are not supposed to take that long. Maybe under Linux they're faster? MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 4700 | Rating: 0 | rate: / Reply Quote | |
Just a correction in my math: millesecond verses second. | |
ID: 4715 | Rating: 0 | rate: / Reply Quote | |
Correct! | |
ID: 4739 | Rating: 0 | rate: / Reply Quote | |
This should not be the case. If it is there is an mistake. The credit/day is definitely much lower for the 1,887 credit WUs. I've looked at the elapsed times from my machine and many others on the stats pages. The 1887 credit WUs take on average the same amount of time to complete as the 2435 credit WUs. The 2435 credit WUs and the 3232 credit WUs seem to line up pretty well credit/day wise on the machines I've observed. | |
ID: 4779 | Rating: 0 | rate: / Reply Quote | |
Same here. Please discontinue the work units that send out 1888 credits, unless there is a fix for it. These work units take as long as the 24xx units or longer and give less credit! All (3) machines are Windows, 1 64 bit Vista &3 32 bit XP. All 3 boxes reflect the same reults. 1 machine has a 260 GTX 192 (Win XP 32 bit), 1 machine (2) 260 GTX 216 (Win XP 32 bit), & 1 machine (2) 280 GTX Vista 64 bit ( User ID 9485). Thanks for all of your hard work to make this happen!!! | |
ID: 4786 | Rating: 0 | rate: / Reply Quote | |
Hi, | |
ID: 4787 | Rating: 0 | rate: / Reply Quote | |
I have also some with same runtime but credits are lower now?! | |
ID: 4886 | Rating: 0 | rate: / Reply Quote | |
I have also some with same runtime but credits are lower now?! It's probably a 1888 credit WU? If so I'd kindly redirect you to my previous posts in this thread. MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 4899 | Rating: 0 | rate: / Reply Quote | |
Change in credits? New acemd WUs taking longer than before and receiving 2932 credits instead of 3232. New credit policy? | |
ID: 5066 | Rating: 0 | rate: / Reply Quote | |
Change in credits? New acemd WUs taking longer than before and receiving 2932 credits instead of 3232. New credit policy? It's the old Bait & Switch Tactic, Suck um in with High Credit & then Cut the leg's out from under them & see how many hang around ... ;) The Credits are still good but nothing like they were, I even added more Box's & my RAC is still dropping so there's been a serious reduction in Credit. When you get 3232 for a 20,000 Sec Wu & then only get 1888 for a 17,000+ Sec Wu something just don't Compute. I hardly ever see a 3232 Wu anymore, their mostly 1888 with a few 2435 Wu's thrown in. Pretty soon you'll have to run a Dozen 280's to even hit 50,000 a day RAC ... 0_o | |
ID: 5070 | Rating: 0 | rate: / Reply Quote | |
The new workunits use a more advanced algorithm for handling the electrostatic called PME. This new algorithm is very important for us. | |
ID: 5071 | Rating: 0 | rate: / Reply Quote | |
The new workunits use a more advanced algorithm for handling the electrostatic called PME. This new algorithm is very important for us. Way cool! So I got my GTX280 just in time .... :) While I have your attention, is there a way without destroying your schemes to tag the tasks with a hint of the number of iterations (not what you call them I know, and LHC is the one with the turns, but, a hint of the time length would be friendly). The reason for the request is so that we can tell if the task is "rogue" and running too long. Then the participant can call for help and make an informed decision if they want to shoot the task or not ... | |
ID: 5073 | Rating: 0 | rate: / Reply Quote | |
Our WUs are all supposed to require approximately half day on a 8800GT (114 shaders), around 6 hours for a 280GTX. For a bigger system, we reduce the number of iterations to meet this target or we increase it for bigger systems. This is not exact of course, allow for up to 50% error. | |
ID: 5074 | Rating: 0 | rate: / Reply Quote | |
One kind of WU, one level of granted credit. No matter what the WU is supposed to do. If it is a more advanced algorithm, then why are the credits lower? "It's based on FLOPS!" That great. I have seen no proof of that. It seem to be based on the number of steps in a WU. All WU's were 850k step WU's before Dec 10th and all got 3232.06 credit. Now we have 4 different WU's with all different credit, and don't reflect the lenght of time it actually took to process a WU. There is no reason that there cannot be 1 type of WU based on the 8800GT efficiency and leave it at that. | |
ID: 5076 | Rating: 0 | rate: / Reply Quote | |
Our WUs are all supposed to require approximately half day on a 8800GT (114 shaders), around 6 hours for a 280GTX. For a bigger system, we reduce the number of iterations to meet this target or we increase it for bigger systems. This is not exact of course, allow for up to 50% error. Hmm, on my 9800 GT the "typical" run time was about 17 hours. I am not complaining understand, but the uncertainty can cause angst for participants. IN that we don't have anything other than the progress bar ( and thank you for that tip) this can be troubling. Understand that projects are not ivory tower isolated. So, when you have a project like Sztaki that changes the task so that the run time jumps to hundreds of hours, with intermittant progress and task hangs, well, it can reflect on how *GPU Grid* is perceived by participants in task uncertainty. And lest you think I am singling out one project, I just stopped Rosetta as a focus project because I had nearly 15 tasks in the space of 24 hours have serious problems. To me, for a project that is classed as "production" this is seriously troubling. I concede that this project with regards to Nvidia processing is in early days and that grants some leeway, all I can say is that *MY* experience is that task execution time is fairly variable from just a few hours to over 90 ... I have covered this in other posts in other threads so will not revisit that here, but, only use that to illustrate the point ... Predictability is highly desirable ... :) | |
ID: 5079 | Rating: 0 | rate: / Reply Quote | |
Well, until there is some kind of perportionality to the credit granting scheme, I'm only going to run this project when I can't get work from others. | |
ID: 5085 | Rating: 0 | rate: / Reply Quote | |
Guys, relax. We all know the 1888 credit WUs are off and GDF just told us the reason and that a possible long-term solution should arrive within the next days. Nognlite wrote: If it is a more advanced algorithm, then why are the credits lower? "It's based on FLOPS!" That great. I have seen no proof of that. It seem to be based on the number of steps in a WU. Advanced means they implemented advanced functionality by means of a new library. It's advanced science, not necessarily more efficient code. Could be that the estimate for the flops done by the library is off or that the new code is less efficient (less flops per time) or that calling functions from that library leads to more overhead. Of course the credits are based on the number of steps in a WU. But there's also the basic complexity of the structure (e.g. how many atoms), which determines the credit value of each step. And there's the estimate of the number of flops per step, depending on the scientific model. This is where the new library gets into the game. Nognlite wrote: There is no reason that there cannot be 1 type of WU based on the 8800GT efficiency and leave it at that. Well, we don't want to solve the same system over and over again, don't we? The application needs to be flexible and thus has to be able to solve systems with varying complexity, which inherently leads to different WUs. And by the way, people with slower cards want smaller WUs.. ;) Paul wrote: all I can say is that *MY* experience is that task execution time is fairly variable from just a few hours to over 90 I've seen fairly constant run times on my box. MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 5142 | Rating: 0 | rate: / Reply Quote | |
I've seen fairly constant run times on my box. *MOST* of them now seem to be stable ... but I did get lucky with a few ... which you can see if you rummage my account ... I am NOT complaining, I was just commenting ... :) | |
ID: 5146 | Rating: 0 | rate: / Reply Quote | |
Message boards : Graphics cards (GPUs) : What's going on with the credits?