Message boards : Number crunching : SANTI WU Killed My GPU
Author | Message |
---|---|
Win7 Home, ASUS GTX 660, GPUGrid 24/7: | |
ID: 33832 | Rating: 0 | rate: / Reply Quote | |
I don't know what happened, but I have seen that many many times at my 660 and Santi SR's. So I do only LR's now on the 660. However today I had also a Fatal cuda driver error. But in my case it only resulted in the GPU-clock to down clock. I did a reboot and it is okay again. | |
ID: 33843 | Rating: 0 | rate: / Reply Quote | |
Here is the offending WU. What the heck happened?? I see you are still on the 327.23 drivers. Why not try 331.65? I think they implement a later version of CUDA, and work fine on my GTX 660s on the Longs, including many Santis. | |
ID: 33844 | Rating: 0 | rate: / Reply Quote | |
Sounds familiar to this problem, doesn't it? | |
ID: 33855 | Rating: 0 | rate: / Reply Quote | |
Sounds familiar to this problem, doesn't it? I don't think so, as there was no power outage in my case, I can not speak for Tomba. And even when one of my PSU burnt down and the main fuse went off, and all PC's where abruptly shut down, the GPUGRID WU did start nicely without problems after I reboot the systems. So I think this is caused by something in the Santi WU's as I have seen it only with these. And I watch my systems closely. ____________ Greetings from TJ | |
ID: 33864 | Rating: 0 | rate: / Reply Quote | |
In the last 24 hours I have had a very similar issue where the nVidia kernel keeps crashing when doing GPU units. | |
ID: 33869 | Rating: 0 | rate: / Reply Quote | |
Storm wrote: Today I've removed the nVidia drivers and cleaned by system of any profile or configs left from them. Then reloaded the latest 331.65 drivers for my dual 670 GTX in SLI mode and suspended all GPU jobs. FYI: SLI is not recommended while crunching on the GPUs. | |
ID: 33870 | Rating: 0 | rate: / Reply Quote | |
New Beta app coming later today that should help with this. | |
ID: 33873 | Rating: 0 | rate: / Reply Quote | |
Yesterday another Santi LR resulted in this error: SWAN : FATAL : Cuda driver error 715 in file 'swanlibnv2.cpp' in line 1969. | |
ID: 33979 | Rating: 0 | rate: / Reply Quote | |
It is something with Santi's WU and a GTX660, as I have not yet seen this error on my GTX770, running since August. Maybe I have just been lucky, but ever since implementing my under-clocking and over-volting fix http://www.gpugrid.net/forum_thread.php?id=3466&nowrap=true#33677, I have not had an error, or even an instance of slow-running. The GTX 660s may be susceptible to problems if they are factory overclocked more for example, or don't have as large heatsinks as the others relative to their heat output. But it appears that they can be fixed, and there is nothing inherently wrong with the chip itself for running the current work units. By the way, I now suspect that the slow-running is triggered by hitting against the GPU power limit, which probably causes the self-protective circuity in the chip to reduce its clock rate. Then, it never resumes the higher rate until you reboot it. Increasing the power limit helps avoid this problem, as long as you monitor the resulting temperature (the limit is there for a reason). | |
ID: 33980 | Rating: 0 | rate: / Reply Quote | |
My 660 is not factory overclocked and I have set the clock speed little lower than what it would be without intervention. | |
ID: 33981 | Rating: 0 | rate: / Reply Quote | |
Yes, you look good. | |
ID: 33982 | Rating: 0 | rate: / Reply Quote | |
Just chipping in with a "me too" post: | |
ID: 33984 | Rating: 0 | rate: / Reply Quote | |
I have a box that has two GT 430's that are having this problem. | |
ID: 33985 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : SANTI WU Killed My GPU