Message boards : Graphics cards (GPUs) : Credit where Credit is due ...
Author | Message |
---|---|
Here goes Paul, tilting at windmills again ... | |
ID: 16619 | Rating: 0 | rate: / Reply Quote | |
Sorry Paul - is the "other" project which is paying for failures BOINC-based? | |
ID: 16620 | Rating: 0 | rate: / Reply Quote | |
Sorry Paul - is the "other" project which is paying for failures BOINC-based? Well CPDN does and its BOINC based, but it depends on trickles to get there. Given GPUgrid doesn't use trickles that might present an issue (or an opportunity - you could have larger wu using trickles). ____________ BOINC blog | |
ID: 16624 | Rating: 0 | rate: / Reply Quote | |
Sorry Paul - is the "other" project which is paying for failures BOINC-based? As MarkJ noted one of the projects is CPDN... I was also thinking of WCG where the new sub-project DDDT2 has some molecules that when modeled "blow-up" and the application reports a failure. The just went from a 5 to 3 failure test (3 replications vice 5) and they pay because the failures of the application to solve essentially demonstrates that that line of research is a dead end ... in this case they do a post award after all the failed results are in ... When they have a suite of "successful" tasks they create the next generation of tasks which is larger and builds on the ones that "worked" Back to CPDN, their tasks run for hundreds of hours of course which is far more than GPU Grid and WCG, but the principle is the same ... if the task has a likelihood of failure, do not punish the willing for a failure of the project ... | |
ID: 16626 | Rating: 0 | rate: / Reply Quote | |
That would mean something along the lines of | |
ID: 16635 | Rating: 0 | rate: / Reply Quote | |
That would mean something along the lines of I will grant that there is that small class of users that are crass enough that they would invest the effort... the problem is that instead of looking for those people we choose instead to punish (in effect) those that are sincerely trying to help, and through no fault of their own, can't ... Now, on a more practical matter, few tasks fail like this on any of the projects, on CPDN you have to have part of the work done and return the trickles, on WCG I am not sure, but I think they have other sanity checks as well ... not sure what about here if the failed task returns partly completed files or not ... but the point is that the person would have to get one of the rares, check to find it is failing and then forge a satisfactory result... quite a feat ... easier just to run the damn thing I would think ... Besides, he/she would have to be the last of the loop to know that it was worthwhile to attempt to forge the failure ... Again, quite a feat of arms as it were ... {edit-add} BTW, I will point out that the 5-12 hour run times on a GPU is equivalent to 300-1,200 hours on the CPU, well above CPDN's run time equivalency at the current stage of their models ... so, we are in the same general vicinity of total calculations done in a model ... the GPUs just do is so much faster that we lose sight of the massive amount of work that is actually being accomplished... as I have noted elsewhere and elsewhen, an MW task takes hours to run on a CPU core, but I am running them off in less than 2 minutes on my GPUs ... | |
ID: 16638 | Rating: 0 | rate: / Reply Quote | |
Cheating is a concern, in fact, but also random errors due to overclocking/driver issues are IMHO. Up to now, the vast majority of the errors that we see are due to misconfigured hosts rather than WU mistakes. Incidentally, that's the reason why we can't reliably (ie automatically) figure out "erroneous" wu right away. | |
ID: 16664 | Rating: 0 | rate: / Reply Quote | |
How about awarding credits for failures only after a WU reaches the "too many failures" status? | |
ID: 16674 | Rating: 0 | rate: / Reply Quote | |
Cheating is a concern, in fact, but also random errors due to overclocking/driver issues are IMHO. Up to now, the vast majority of the errors that we see are due to misconfigured hosts rather than WU mistakes. Incidentally, that's the reason why we can't reliably (ie automatically) figure out "erroneous" wu right away. I would think you would be able to coorelate the reliablity index of the participating computers as part of the process ... at any rate, - this is a suggestion to consider... - it does not have to be automatic ... - it should as I originally suggest (I thought), and as Snow Crash suggested, be after "too many failures" is reached... - Cheating and mis-configuration are legitimate concerns of the project ... - tasks that are impossible to process are equally legitimate concerns of the participant ... - we do best when cooperation is the word of the day, and everyone's concerns are considered ... Thanks for taking the time to think about this ... :) | |
ID: 16675 | Rating: 0 | rate: / Reply Quote | |
Perhaps it could be done for Betas; the limited Beta numbers would deter sneaky programmers from building apps to point steal. | |
ID: 16678 | Rating: 0 | rate: / Reply Quote | |
You posit this as an either or to the exclusion of both ... | |
ID: 16687 | Rating: 0 | rate: / Reply Quote | |
I think that is a slightly limited take on what I said. | |
ID: 16692 | Rating: 0 | rate: / Reply Quote | |
Message boards : Graphics cards (GPUs) : Credit where Credit is due ...