Message boards : Graphics cards (GPUs) : Server won't give me work
Author | Message |
---|---|
So, after the problems with the KASHIF work units, and coming in this mornign to find that three work units had all failed with a computation error, I decided to go ahead and upgrade to 6.6.28. Now the server won't give me any work saying I have no CUDA device. It's been running for a while now no problem, why this issue now? Any suggestions? | |
ID: 10074 | Rating: 0 | rate: / Reply Quote | |
So, after the problems with the KASHIF work units, and coming in this mornign to find that three work units had all failed with a computation error, I decided to go ahead and upgrade to 6.6.28. Now the server won't give me any work saying I have no CUDA device. It's been running for a while now no problem, why this issue now? Any suggestions? Did you change the opt-in / opt-out setting in the preferences? If you changed from the right versino of BOINC it was opt-in, then they suddenly changed it to opt out so that CUDA cards are disabled by default. Change the preference on the web site here, then update the machine... | |
ID: 10076 | Rating: 0 | rate: / Reply Quote | |
Thanks Paul. I'm gonna need a bit more help here (which is sad as I'm a professional scientist) but where is this opt/in opt/out setting. There's three preferences tabs (computing/gpugrid/computing) and in none of them do I see anything labeled opt-in or opt-out. Searching the pages for 'opt' isn't turning up anything, and when I look at my preferences it looks like the GPU should be available for computing. I'd like to get back to crunching since I've done nothing for the last almost 2 days due to a series of compute failures. Thanks. | |
ID: 10083 | Rating: 0 | rate: / Reply Quote | |
It is in the computing preferences - http://www.gpugrid.net/prefs.php?subset=global. Paul is talking about the setting "Suspend GPU work while computer is in use?". | |
ID: 10084 | Rating: 0 | rate: / Reply Quote | |
The error says you're in "device emulation" mode, meaning no CUDA device is found. Did you change anything else? Changed the GPU and installed drivers from the CD? | |
ID: 10091 | Rating: 0 | rate: / Reply Quote | |
Ok, Time Bandit helped me out as I was not clear ... sorry about that ... but that is only one possibility that leapt off the top of my pate... | |
ID: 10100 | Rating: 0 | rate: / Reply Quote | |
No, I haven't changed anything else, just upgraded the BOINC client. I did notice however that after 4 tasks had failed due to a 'computation error' that new work hadn't been requested or given and it had been sitting idle. I have the same card and the same drivers. I even reinstalled the drivers from NVidia's website (no CD for this computer) but no change. Per the requested I'm pasting the error messages below. | |
ID: 10104 | Rating: 0 | rate: / Reply Quote | |
Which driver are you using? I cae you don't know and have already deleted the installation file GPU-Z should tell you. | |
ID: 10106 | Rating: 0 | rate: / Reply Quote | |
Also, should say the the driver version is 6.14.11.8265. Just a month old. | |
ID: 10108 | Rating: 0 | rate: / Reply Quote | |
Have you already tried to reinstall the drivers? 5/23/2009 12:59:42 PM No CUDA devices found ____________ pixelicious.at - my little photoblog | |
ID: 10109 | Rating: 0 | rate: / Reply Quote | |
Um, yes, I can tell from the messages that it can't find the device. When I first joined the project in early April Boinc was unable to find a CUDA device. I upgraded the drivers from the ones that came with the machine when I got it last September. With that update, 182.46, everything worked fine and I've been crunching away. | |
ID: 10110 | Rating: 0 | rate: / Reply Quote | |
I don't know why, but it sure does not see the CUDA device. And I am stumped as to what to try next. | |
ID: 10116 | Rating: 0 | rate: / Reply Quote | |
I was told to upgrade BOINC. Who was that? Must have been someone who doesn't like you? ;) No, seriously.. if your answer to Pauls question does not include any "yes" I'd revert to 6.6.20 and see if that gets you going again. If not - we'll all scratch our heads. You could try 6.5.0 and the drivers you list, 182.46 and 182.65 are both beta, as far as I know. 182.50 is the last WHQL driver of the non-185 series. I'd try 182.50 or 185.6x or 185.8x, though the latter 2 seem not to be trouble free yet. MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 10124 | Rating: 0 | rate: / Reply Quote | |
Hmm, today it started working again on its own. Argh. Ok, well, will watch and see if I get errors of if ti runs smoothly again. Thanks everyone for your input. | |
ID: 10147 | Rating: 0 | rate: / Reply Quote | |
Well, it's getting work, but ALL the work units seem to be failing after only a few seconds (<10-20 sec). Are we having more workunit problems? | |
ID: 10176 | Rating: 0 | rate: / Reply Quote | |
You are running in device emulation. | |
ID: 10181 | Rating: 0 | rate: / Reply Quote | |
Interesting about the emulation mode and that it wasn't a problem before. Can you say where in the advanced tab this setting is? I don't see it under any of the subheadings in the advanced menus. Are you suggesting I find the config file and change it at the level of the text? Is unproected mode going to affect the stability of the system? | |
ID: 10196 | Rating: 0 | rate: / Reply Quote | |
Interesting about the emulation mode and that it wasn't a problem before. Can you say where in the advanced tab this setting is? I don't see it under any of the subheadings in the advanced menus. Are you suggesting I find the config file and change it at the level of the text? Is unproected mode going to affect the stability of the system? It is the third or fourth screen in during the install. I cannot recall if the repair install allows you to change this setting or not. Uninstall only removes BOINC is should not change your data or project settings unless you whack the directories yourself. You can also do a downlevel and up level of the version ... but you have to catch the screen as it goes by and make sure that the setting is unchecked. | |
ID: 10200 | Rating: 0 | rate: / Reply Quote | |
The repair utility doesn't give you access there. I uninstalled and reinstalled. The option to run in protected mode (an unpriveldged account) was already unchecked, so it seems like that's not the issue. | |
ID: 10201 | Rating: 0 | rate: / Reply Quote | |
The repair utility doesn't give you access there. I uninstalled and reinstalled. The option to run in protected mode (an unpriveldged account) was already unchecked, so it seems like that's not the issue. Ok, there is still the issue where the device emulation comes up. Are you running remote monitoring software, a remote desktop, VM emulation, anything like that? If the video device is "virtualized" it cannot be used for work. Many of these software systems make the video card a "virtual" device that can be shared. Sadly, that means that it cannot be used by BOINC. Once you have errored out a certain number of tasks you cannot get any more until 24 hours has passed. This is to prevent you from "trashing" all the available tasks with a bad system set-up. When you get another task and return it safely, then you will be able to get another and another ... until you are back in good graces and can get the maximum. But, if the device is "broken" we have to fix that first... | |
ID: 10203 | Rating: 0 | rate: / Reply Quote | |
Ok, the server just let me have a workunit. They take 17-22 hours each (despite the large number of shaders on my card) so it'll be a while before I'm 'redeemed' but so far it's running smoothly. | |
ID: 10216 | Rating: 0 | rate: / Reply Quote | |
Cool ... | |
ID: 10219 | Rating: 0 | rate: / Reply Quote | |
thanks for the note about VNC. I'd have to install it across too many different computers just to be able to support this one card on this one project. Unforutnately the time and resource required to continue to participate is getting a little high. Remote Desktop didn't cause problems for the first few months I was on the project so it's weird to see it happening now. Maybe earlier BOINC versions just paused it or something, I don't know. At this point I think it's best to detach from the project. | |
ID: 10308 | Rating: 0 | rate: / Reply Quote | |
thanks for the note about VNC. I'd have to install it across too many different computers just to be able to support this one card on this one project. Unforutnately the time and resource required to continue to participate is getting a little high. Remote Desktop didn't cause problems for the first few months I was on the project so it's weird to see it happening now. Maybe earlier BOINC versions just paused it or something, I don't know. At this point I think it's best to detach from the project. ugh, well, we will miss you ... we always do ... :) Come back when it is easier ... Oh, and GDF indicated that they might be doing CPU work later ... (will that eventually trigger a name change to "CPU and GPU CUDA and OpenCL with ATI cards and Larabie Grid.net" when they finish adding capabilities ???) :) Sorry guys, only sense of humor I got ... Anyway, come back when you can ... | |
ID: 10337 | Rating: 0 | rate: / Reply Quote | |
Message boards : Graphics cards (GPUs) : Server won't give me work