Advanced search

Message boards : Graphics cards (GPUs) : All of a sudden not getting work

Author Message
Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4940 - Posted: 27 Dec 2008 | 14:05:28 UTC

Never had a problem getting work until now. Now this is what I get when requesting work:

Project Date Message
GPUGRID 12/27/2008 7:49:26 AM Sending scheduler request: Requested by user. Requesting 157711 seconds of work, reporting 0 completed tasks
GPUGRID 12/27/2008 7:49:31 AM Scheduler request completed: got 0 new tasks
GPUGRID 12/27/2008 7:49:31 AM Message from server: No work sent
GPUGRID 12/27/2008 7:49:31 AM Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
GPUGRID 12/27/2008 7:49:31 AM Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
GPUGRID 12/27/2008 7:50:32 AM Sending scheduler request: Requested by user. Requesting 157525 seconds of work, reporting 0 completed tasks
GPUGRID 12/27/2008 7:50:37 AM Scheduler request completed: got 0 new tasks
GPUGRID 12/27/2008 7:50:37 AM Message from server: No work sent
GPUGRID 12/27/2008 7:50:37 AM Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
GPUGRID 12/27/2008 7:50:37 AM Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
GPUGRID 12/27/2008 7:51:18 AM Sending scheduler request: Requested by user. Requesting 157401 seconds of work, reporting 0 completed tasks
GPUGRID 12/27/2008 7:51:23 AM Scheduler request completed: got 0 new tasks
GPUGRID 12/27/2008 7:51:23 AM Message from server: No work sent
GPUGRID 12/27/2008 7:51:23 AM Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
GPUGRID 12/27/2008 7:51:23 AM Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
GPUGRID 12/27/2008 7:54:06 AM Sending scheduler request: Requested by user. Requesting 156949 seconds of work, reporting 0 completed tasks
GPUGRID 12/27/2008 7:54:11 AM Scheduler request completed: got 0 new tasks
GPUGRID 12/27/2008 7:54:11 AM Message from server: No work sent
GPUGRID 12/27/2008 7:54:11 AM Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
GPUGRID 12/27/2008 7:54:11 AM Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
GPUGRID 12/27/2008 7:55:44 AM Sending scheduler request: Requested by user. Requesting 955589 seconds of work, reporting 0 completed tasks
GPUGRID 12/27/2008 7:55:49 AM Scheduler request completed: got 0 new tasks
GPUGRID 12/27/2008 7:55:49 AM Message from server: No work sent
GPUGRID 12/27/2008 7:55:49 AM Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
GPUGRID 12/27/2008 7:55:49 AM Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.
GPUGRID 12/27/2008 7:56:14 AM Sending scheduler request: Requested by user. Requesting 955612 seconds of work, reporting 0 completed tasks
GPUGRID 12/27/2008 7:56:20 AM Scheduler request completed: got 0 new tasks
GPUGRID 12/27/2008 7:56:20 AM Message from server: No work sent
GPUGRID 12/27/2008 7:56:20 AM Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
GPUGRID 12/27/2008 7:56:20 AM Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.

No changes here in config. Using BOINC v5.4.1. You can see that it's asking for work but the server isn't sending any.

The server status page says:
Results ready to send 932

What's going on?

Profile Venturini Dario[VENETO]
Send message
Joined: 26 Jul 08
Posts: 44
Credit: 4,832,360
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwatwatwatwat
Message 4941 - Posted: 27 Dec 2008 | 14:26:15 UTC - in response to Message 4940.

Same here

Donnie
Send message
Joined: 13 Nov 08
Posts: 11
Credit: 11,185,470
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 4942 - Posted: 27 Dec 2008 | 14:44:18 UTC - in response to Message 4941.

My boxes are running dry too. In 7 hours they'll be sitting idle. I've tried everything from all the other threads and no joy!!!

Copycat-Digital for WCG*
Send message
Joined: 12 Nov 07
Posts: 8
Credit: 1,000,641
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 4943 - Posted: 27 Dec 2008 | 14:46:18 UTC - in response to Message 4940.

I'm also dry!
____________

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 4945 - Posted: 27 Dec 2008 | 15:03:05 UTC - in response to Message 4943.

Scheduler logs sent to BOINC dev again.

gdf

Copycat-Digital for WCG*
Send message
Joined: 12 Nov 07
Posts: 8
Credit: 1,000,641
RAC: 0
Level
Ala
Scientific publications
watwatwatwatwat
Message 4947 - Posted: 27 Dec 2008 | 15:32:26 UTC - in response to Message 4945.

Thanks GDF
Got one
____________

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4964 - Posted: 27 Dec 2008 | 20:16:39 UTC
Last modified: 27 Dec 2008 | 20:21:37 UTC

I've tried to get work manually hundreds of times today. Nothing. Tried resetting the project several times, tried several different versions of BOINC. Tried detaching from the project and reattaching. Nothing but the same message:

12/27/2008 2:09:52 PM Sending scheduler request: Requested by user. Requesting 958003 seconds of work, reporting 0 completed tasks
12/27/2008 2:09:57 PM Scheduler request completed: got 0 new tasks
12/27/2008 2:09:57 PM Message from server: No work sent
12/27/2008 2:09:57 PM Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
12/27/2008 2:09:57 PM Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.

This is getting extremely frustrating. Everything seemed to go rotten after v6.56 got pulled back.
Now it says there's 1150 WUs ready to send. I sure can't get any.

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4969 - Posted: 27 Dec 2008 | 21:20:48 UTC - in response to Message 4964.
Last modified: 27 Dec 2008 | 21:21:48 UTC

More info: Project priority is set very high (999). Set to DL 3 days of work. Tried suspending the CPU project for a while so GPUGRID was the only project. Nothing. Rebooted. Nothing. Said 50 hail marys and 50 our fathers. Nothing. Painted a pentagram on the floor and danced around it 7 times blowing a horn. Nothing.

Now there's:
Results ready to send 1,192
Results in progress 1,896

Profile Jack Shaftoe
Send message
Joined: 26 Nov 08
Posts: 27
Credit: 1,813,606
RAC: 0
Level
Ala
Scientific publications
watwatwatwat
Message 4970 - Posted: 27 Dec 2008 | 21:39:16 UTC - in response to Message 4969.
Last modified: 27 Dec 2008 | 21:43:48 UTC

More info: Project priority is set very high (999). Set to DL 3 days of work. Tried suspending the CPU project for a while so GPUGRID was the only project. Nothing. Rebooted. Nothing. Said 50 hail marys and 50 our fathers. Nothing. Painted a pentagram on the floor and danced around it 7 times blowing a horn. Nothing.

Now there's:
Results ready to send 1,192
Results in progress 1,896


Similar issues without the pentagram on the floor...

http://www.gpugrid.net/show_host_detail.php?hostid=20480

Finished it's last gpu wu about 20 minutes ago. I updated the project while suspending CPDN (the only other project running on it) and... (drumroll please...)

12/27/2008 4:35:11 PM|GPUGRID|Sending scheduler request: Requested by user. Requesting 1728000 seconds of work, reporting 1 completed tasks
12/27/2008 4:35:16 PM|GPUGRID|Scheduler request completed: got 0 new tasks
12/27/2008 4:35:16 PM|GPUGRID|Message from server: No work sent
12/27/2008 4:35:16 PM|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer.
12/27/2008 4:35:16 PM|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer.

This issue continues to prevail, I believe it's been about 4 or 6 weeks now.

samsausage
Send message
Joined: 18 Nov 08
Posts: 12
Credit: 70,480,919
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4971 - Posted: 27 Dec 2008 | 21:52:14 UTC

Same here.

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4972 - Posted: 27 Dec 2008 | 21:53:50 UTC - in response to Message 4969.
Last modified: 27 Dec 2008 | 21:55:30 UTC

Now all of a sudden on my 871st manual update I got 1 WU. I think it was the 42nd reiteration of the Namo Uvajjhâyanam that finally did it.

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4973 - Posted: 27 Dec 2008 | 22:17:44 UTC
Last modified: 27 Dec 2008 | 22:19:09 UTC

For the first time today the available WUs are actually decreasing, so maybe somebody fixed something:

Results ready to send 1,138
Results in progress 1,950

Profile Nognlite
Send message
Joined: 9 Nov 08
Posts: 69
Credit: 25,106,923
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwat
Message 4974 - Posted: 27 Dec 2008 | 22:43:24 UTC
Last modified: 27 Dec 2008 | 22:47:18 UTC

Same here. This problem started back on the 10 or 11th of December right about the same time the WU's got split into three catagories 850k, and 2x500k steps.

Maybe it's time to go back to single WU of 850k steps and 6.3.21 with ACEMD 6.55.

Could also have something to do with the DCF. My 8800GT's are around 10.5 while my GTX280's are around 9.9 after reset and about 1 week of work to balance them out. I believe the 8800GT was the 1x factor for the project, so there could be a problem in the code(like someone shifted the period) and we now have 10x. This could be affecting all other GPU to throw the whole project off.

My 2 cents!!

Pat

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4976 - Posted: 28 Dec 2008 | 1:44:38 UTC

Got 2 new WUs :)

MrS
____________
Scanning for our furry friends since Jan 2002

STE\/E
Send message
Joined: 18 Sep 08
Posts: 368
Credit: 3,439,886,109
RAC: 53,496,822
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 4978 - Posted: 28 Dec 2008 | 2:02:24 UTC

I just filled all my Caches again, probably will have to go thru the same thing tomorrow sometime but hopefully not.

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4986 - Posted: 28 Dec 2008 | 5:35:36 UTC

Not to discount people's problems, but remember, we are in early days here with this new technology. We know there are issues with the various BOINC Manager versions and there are likely issues with the server software (feeder and scheduler) ...

As I have stated before, though I don't know if the project folks have picked up on it, is that I think we are getting the one single message instead of the variaety that we should for various client server interaction failures. In other words, we are having several different problems but we all get the one message. Kinda like the early days in window with the ubiquitous General Protection Fault message. Because almost all crashes puked up the one message it was difficult to figure out what was happening ...

Anyway, be of good cheer ...

We are getting work even if we are missing at times ...

And, just think, not THAT long ago we were wasting all those GPU cycles completely 24/7 ... now we are wasting them only SOME of the time ... in other words, the cup is really half full ...

localizer
Send message
Joined: 17 Apr 08
Posts: 113
Credit: 1,656,514,857
RAC: 0
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4987 - Posted: 28 Dec 2008 | 8:32:06 UTC - in response to Message 4986.

.... managed to fill all my caches manually - used the suspend all other projects workaround. I think I'm over my 'excitement' at the state the project is in and accept that at this time is will not run unattended/automatically for me.

It'll come good.


P.

Profile mike047
Send message
Joined: 21 Dec 08
Posts: 47
Credit: 7,330,049
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 4988 - Posted: 28 Dec 2008 | 8:33:08 UTC - in response to Message 4969.

More info: Project priority is set very high (999). Set to DL 3 days of work. Tried suspending the CPU project for a while so GPUGRID was the only project. Nothing. Rebooted. Nothing. Said 50 hail marys and 50 our fathers. Nothing. Painted a pentagram on the floor and danced around it 7 times blowing a horn. Nothing.

Now there's:
Results ready to send 1,192
Results in progress 1,896



The pentagram goes on the ceiling, a ring of salt on the floor:D

Completed work on one box and received a work unit[9:30pm EST]in a normal boinc fashion...must have been the salt:)

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 4989 - Posted: 28 Dec 2008 | 8:45:32 UTC - in response to Message 4988.

Completed work on one box and received a work unit[9:30pm EST]in a normal boinc fashion...must have been the salt:)


So i should stop chanting for you?

All this time I though it was going to be my chanting incantations for you to get work ...

That does it, I will only chant for my work from now on ...

MarkJ
Volunteer moderator
Volunteer tester
Send message
Joined: 24 Dec 08
Posts: 738
Credit: 200,909,904
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 4990 - Posted: 28 Dec 2008 | 9:06:17 UTC - in response to Message 4969.

More info: Project priority is set very high (999). Set to DL 3 days of work. Tried suspending the CPU project for a while so GPUGRID was the only project. Nothing. Rebooted. Nothing. Said 50 hail marys and 50 our fathers. Nothing. Painted a pentagram on the floor and danced around it 7 times blowing a horn. Nothing.

Now there's:
Results ready to send 1,192
Results in progress 1,896


You say "set to DL 3 days of work". Do you mean the cache is set for 3 days? GPUGRID has short turn around times and typically will want work back within 3-4 days so that might be the reason why you can't get any (as well as BOINC 6.4.5 not working properly). Try setting the cache a bit lower to say 2 days, that way BOINC should think it can complete the work in time.

STE\/E
Send message
Joined: 18 Sep 08
Posts: 368
Credit: 3,439,886,109
RAC: 53,496,822
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 4992 - Posted: 28 Dec 2008 | 10:39:42 UTC

And, just think, not THAT long ago we were wasting all those GPU cycles completely 24/7 ... now we are wasting them only SOME of the time ... in other words, the cup is really half full ...


Look @ it this way Paul, for some people the Cup is never full or even half full, if they had full Caches they would just find something else to Whine about ... :)

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5005 - Posted: 28 Dec 2008 | 14:35:28 UTC - in response to Message 4988.

The pentagram goes on the ceiling, a ring of salt on the floor:D

Oh man, no wonder that one didn't work!

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5007 - Posted: 28 Dec 2008 | 14:45:19 UTC - in response to Message 4990.

You say "set to DL 3 days of work". Do you mean the cache is set for 3 days? GPUGRID has short turn around times and typically will want work back within 3-4 days so that might be the reason why you can't get any (as well as BOINC 6.4.5 not working properly). Try setting the cache a bit lower to say 2 days, that way BOINC should think it can complete the work in time.

I tried both longer and shorter caching, neither worked at the time. It was a server problem yesterday AM. Those who had work in their caches probably didn't notice it but many had the no work problem. I don't use v6.4.5, it's highly flawed (they need to take the message saying to use v6.4.5 off the front page). At the time I tried v6.3.21, v6.4.1 and v6.4.2. None of them worked during the server problem. Things seem to be working again (at least here) and I've received 3 WUs since the server started working more or less properly again.

Profile mike047
Send message
Joined: 21 Dec 08
Posts: 47
Credit: 7,330,049
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwatwatwat
Message 5011 - Posted: 28 Dec 2008 | 15:01:48 UTC - in response to Message 5007.

I use v6.4.5 on Ubuntu and a 1.5 day cache. The system has worked as it is supposed to for the second day in a row. When work is completed, it gets one more unit. No intervention on my part.

Of course, now that I have said that....everything will fall apart.:D

Need more salt.

mike

ExtraTerrestrial Apes
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 17 Aug 08
Posts: 2705
Credit: 1,311,122,549
RAC: 0
Level
Met
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5012 - Posted: 28 Dec 2008 | 15:05:10 UTC

- I'm sure the devs know the message "no cell blabla" doesn't fit.. but since the real reason is not known yet they didn't make it more specific. Might be good to at least include one for "really no gpu work available", though!

- yesterday was a pure server problem, the server logs have been sent to the BOINC devs.. let's keep our fingers crossed they find the bug

- for everyone who accidently painted the pentagram on the floor: you can still use it, no need to rip off your parquet just yet! Do the dance upside down and mount the salt to the ceiling, just be careful not to choke yourself while blowing the horn.

MrS
____________
Scanning for our furry friends since Jan 2002

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5018 - Posted: 28 Dec 2008 | 16:17:26 UTC - in response to Message 4992.

And, just think, not THAT long ago we were wasting all those GPU cycles completely 24/7 ... now we are wasting them only SOME of the time ... in other words, the cup is really half full ...


Look @ it this way Paul, for some people the Cup is never full or even half full, if they had full Caches they would just find something else to Whine about ... :)

Like me complaining at Rosetta that 4 tasks died with a File lock problem ... I have since, now, run one task to completion and have another with an hour to go ...

Ah well, if not one thing it is another ...

Alain Maes
Send message
Joined: 8 Sep 08
Posts: 63
Credit: 1,652,401,823
RAC: 1,730,167
Level
His
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5032 - Posted: 29 Dec 2008 | 9:32:13 UTC

My last manual update now dates back to two days ago when I managed to get 4 WUs. Since then work fetch works again automatically, my cache being kept at 3 WUs almost continuously with one running. Great! And just in time for me since I am leaving in an hour for a week to visit my parents.

Just one remark though. I noticed in BOINCVIEW that my waiting WUs have a completion time of 11 days and almost 16 hours. As soon as a WU starts it immediately jumps to a completion time of 7 hr 50 min, which is BTW pretty accurate for my GTX260.

Kind regards and happy crunching.

Alain

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5059 - Posted: 30 Dec 2008 | 1:28:51 UTC

I got the now work message from a call 12/29/2008 4:59:50 (California time) and I think the situation was caused THIS TIME by having one task in my queue that was rated at 30+ hours. Now, that was from having that 2 plus day task the other day and still thinking that the computer has a 9800 GT vs the new GTX 280 ... I am 36 minutes in, 9.8% done and the time is dropping rapidly from 30 hours and is currently at 26:06 and falling.

*SO*, what that tells me is that the "generic" message we are getting for all sorts of cases is probably related to the scheduler thinking that I have plenty to do and there is no need to give me more work ... my request asked for 56,124 seconds of work, my queue is set to 0.50 days extra work, I have 6 projects on the computer with two out of work (Pirates and LHC) with work from Cosmology and WCG, I don't know why Malaria is dry ...

So, I have 9 tasks in progress, one GPU Grid, 4 WCG, and 4 Cosmology... I have 7 cosmology queued and 4 additional WCG ... runtimes span 2 hours remaining to 20 hours to complete from cold start ...

Thumbnail look seems like I only have about half a days work in hand ... though I may be just on the other side at 0.6 days in hand...

I need a nap ... it is likely I will be back up in a couple hours and I will report if anything changes ...

Oh, when I looked there were 700 plus tasks in the feeder queue ...

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5061 - Posted: 30 Dec 2008 | 2:21:01 UTC

That's similar to the situation here. To get new work I have to stop the other projects. I think it's a problem with the handling of the scheduler and the high prio mode for the core to feed the GPU in the background, hidden for the user. Only if all other tasks are stopped and the system makes a call for more than 250,000, I get new work. Here are running: CPDN (over 800 hours), PrimeGrid and MilkyWay with work for the workcache of 0.5 days, LHC and Pirates calling for work without success.
____________

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5069 - Posted: 30 Dec 2008 | 8:34:10 UTC

Ok, up from my nap, one task nearing completion I asked for more work and got two tasks.

Yes there are issues with the work fetch policy in that it has not been generalized. Worse ... well, I posted another thread where you can see the discussion as it stands at the moment...

But, yes, there are issues and the current policy as coded in the BOINC Manager may let the CUDA resource go idle ... oops ... :)

Profile [AF>Libristes>Jip] Elgran...
Avatar
Send message
Joined: 16 Jul 08
Posts: 45
Credit: 78,618,001
RAC: 0
Level
Thr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5075 - Posted: 30 Dec 2008 | 15:02:55 UTC

I always had these messages :
Full-atom molecular dynamics for Cell processor is not available for your type of computer.
Full-atom molecular dynamics on Cell processor is not available for your type of computer.

They irrate me a lot now. I tried reattaching project and all of the options available but nothing solve the problem.
These messages happened on host 6362, 5716 and 15576.
I am really tired of this situation.
Fix it rapidly please.

Profile GDF
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Send message
Joined: 14 Mar 07
Posts: 1957
Credit: 629,356
RAC: 0
Level
Gly
Scientific publications
watwatwatwatwat
Message 5077 - Posted: 30 Dec 2008 | 15:49:19 UTC - in response to Message 5075.

This is really a problem for us as well. There was a change of scheduler policy a couple of weeks ago in the BOINC server which is buggy. Please post on the BOINC dev forums as well. The holiday period is slowing them down in fixing it.


gdf

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5080 - Posted: 30 Dec 2008 | 16:13:57 UTC

I had the problem for a couple hours this morning but just got two more (whew!) ...

Are you feeding PS3 and Nvidia systems off of the same feeder?

I wonder if the pull rate is such that the feeder gets "clogged" with tasks for the other platform ...

Other projects have had issues like this ... in the dark past ...

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 240,561,326
RAC: 4,779,440
Level
Leu
Scientific publications
watwatwatwatwatwatwatwat
Message 5083 - Posted: 30 Dec 2008 | 16:44:32 UTC - in response to Message 5080.

Looking at the server status it seems there's only one feeder for all platforms...
____________

pixelicious.at - my little photoblog

Profile Nognlite
Send message
Joined: 9 Nov 08
Posts: 69
Credit: 25,106,923
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwat
Message 5086 - Posted: 30 Dec 2008 | 16:50:39 UTC

No work again. Second day in a row. (As my 280's sit idly by contemplating their existance!!)

Pat

lllvette
Avatar
Send message
Joined: 1 Nov 08
Posts: 6
Credit: 16,022,037
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 5089 - Posted: 30 Dec 2008 | 17:58:26 UTC

When is this "Atom Not available " Problem Going to be fixed? It Has been going on for about 2 weeks now.... Is Any one addressing this problem? I have tried 6.4.2, 6.5.0 versions, Reset project several times , Reloaded drivers Etc..and nothing works... Please help... Thanks.
____________

Profile Stefan Ledwina
Avatar
Send message
Joined: 16 Jul 07
Posts: 464
Credit: 240,561,326
RAC: 4,779,440
Level
Leu
Scientific publications
watwatwatwatwatwatwatwat
Message 5094 - Posted: 30 Dec 2008 | 18:59:43 UTC - in response to Message 5089.

"lllvette" wrote:
When is this "Atom Not available " Problem Going to be fixed? It Has been going on for about 2 weeks now.... Is Any one addressing this problem? I have tried 6.4.2, 6.5.0 versions, Reset project several times , Reloaded drivers Etc..and nothing works... Please help... Thanks.


Since you are posting in the right thread, could you please look at the last posts? ;) Especially the one 4 posts before yours...

The project admin is aware of the problem but he can not fix it because there's something wrong with the server software and they are also waiting for a fix from Berkeley...


"GDF" wrote:
This is really a problem for us as well. There was a change of scheduler policy a couple of weeks ago in the BOINC server which is buggy. Please post on the BOINC dev forums as well. The holiday period is slowing them down in fixing it.


gdf



____________

pixelicious.at - my little photoblog

JAMC
Send message
Joined: 16 Nov 08
Posts: 28
Credit: 12,688,454
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 5099 - Posted: 30 Dec 2008 | 21:43:04 UTC

Not able to get any new WU's even with manual priming...5 machines... old "Atom Not Available" message...server problems???

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5100 - Posted: 30 Dec 2008 | 22:18:19 UTC

Same here, got the last WU at 4:44 UTC, now 3 machines will running dry in the next 4 to 8 hours.
____________

STE\/E
Send message
Joined: 18 Sep 08
Posts: 368
Credit: 3,439,886,109
RAC: 53,496,822
Level
Arg
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwat
Message 5101 - Posted: 30 Dec 2008 | 22:19:42 UTC - in response to Message 5089.

When is this "Atom Not available " Problem Going to be fixed? It Has been going on for about 2 weeks now.... Is Any one addressing this problem? I have tried 6.4.2, 6.5.0 versions, Reset project several times , Reloaded drivers Etc..and nothing works... Please help... Thanks.


Nothing you can do about it lllvette, some of my Box's are out too and I've switched them to 4&1 from 3&1 so 1 Core doesn't sit Idle waiting for another GPU Wu.

It wouldn't be so bad though if the Project would change that 24 Hour Wait Period before it trys to contact the Server again so we didn't have to Manually Contact every so often to try & pick up a Wu ...

Jayargh
Send message
Joined: 21 Dec 07
Posts: 47
Credit: 5,252,135
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 5102 - Posted: 30 Dec 2008 | 22:48:09 UTC
Last modified: 30 Dec 2008 | 22:53:08 UTC

At this point why is the project continuing to keep short deadlines,1 task per core, and 24 hour back-offs???????

If they could flood us with work even once while all these problems are going on they would be getting work back while now they get none or very little and we could be happily crunching????

I don't understand the logic of letting us starve while the scheduler/feeder is broken!!!!!

Profile Nightlord
Avatar
Send message
Joined: 22 Jul 08
Posts: 61
Credit: 5,461,041
RAC: 0
Level
Ser
Scientific publications
watwatwatwatwat
Message 5103 - Posted: 31 Dec 2008 | 0:03:49 UTC
Last modified: 31 Dec 2008 | 0:15:58 UTC

Did somebody just kick a server?

I manually updated my boxes using Boincview and half of my dry machines downloaded new WU's! Unfortunately, the others are stuck again in the normal cycle....it seems the kick was not hard enough ;-)

/edit

that did it! The second kick must have pushed the server network cable back in properly....all machines back running again :-)
____________

Profile Kokomiko
Avatar
Send message
Joined: 18 Jul 08
Posts: 190
Credit: 24,093,690
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5104 - Posted: 31 Dec 2008 | 1:29:18 UTC
Last modified: 31 Dec 2008 | 1:30:02 UTC

Stop the other projects and make a manually update, than you should also get work for the other boxes.
____________

Profile Paul D. Buck
Send message
Joined: 9 Jun 08
Posts: 1050
Credit: 37,321,185
RAC: 0
Level
Val
Scientific publications
watwatwatwatwatwatwatwatwatwat
Message 5106 - Posted: 31 Dec 2008 | 3:35:37 UTC

It is because I just got back up and started chanting ....

Profile DoctorNow
Avatar
Send message
Joined: 18 Aug 07
Posts: 83
Credit: 122,995,082
RAC: 0
Level
Cys
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5107 - Posted: 31 Dec 2008 | 6:42:32 UTC - in response to Message 5104.
Last modified: 31 Dec 2008 | 6:43:45 UTC

Stop the other projects and make a manually update, than you should also get work for the other boxes.

Geez, it took me now a dozen times even with this trick to get a new WU, and that with over 800 WUs waiting in line... :-\
Seems it's not even a bit better.
____________
Member of BOINC@Heidelberg and ATA!

JAMC
Send message
Joined: 16 Nov 08
Posts: 28
Credit: 12,688,454
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwat
Message 5119 - Posted: 31 Dec 2008 | 15:34:07 UTC

I am managing to get new WU's via manual priming again...

Profile Beyond
Avatar
Send message
Joined: 23 Nov 08
Posts: 1112
Credit: 6,162,416,256
RAC: 0
Level
Tyr
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 5121 - Posted: 31 Dec 2008 | 15:44:00 UTC - in response to Message 5106.

It is because I just got back up and started chanting ....

Paul, you're going to have to chant louder :-)

Post to thread

Message boards : Graphics cards (GPUs) : All of a sudden not getting work

//