Message boards : Graphics cards (GPUs) : Encounter 10-12 H-bond term == Client error 0x1 ?
Author | Message |
---|---|
Virtually all my currently failing jobs have this "Found zero 10-12 H-bond term" warning. I have examined other people's results and more than once, the 'buddies' will error out as well. Is it known what side effects this "Found zero 10-12" warning has ? Is it being investigated ? One job (519558) had an out of memory error and was terminated by XP. I had disabled my 'faulty' GPU3, so this is NOT the 'faulty' one. My errors have occurred over several GPUs today. Do we have a GPU testing program ? | |
ID: 10439 | Rating: 0 | rate: / Reply Quote | |
I.m running on CUDA device: GeForce 9800 GTX/9800 GTX+ (driver version 18608, compute capability 1.1, 1024MB, est. 85GFLOPS) | |
ID: 10440 | Rating: 0 | rate: / Reply Quote | |
I have been experiencing this type of symptom for quite a while on one of my computers.... which as 2x GTX 260 Core 216 SC...backing off clocking doesn't seem to have helped, nor has reloading the driver, downgrading the driver or upgrading the driver. | |
ID: 10445 | Rating: 0 | rate: / Reply Quote | |
I have a number of machines: 4 quaddies with GTS250's and an i7 with dual GTX260+. I run WinXP on them. Nothing is overclocked. | |
ID: 10447 | Rating: 0 | rate: / Reply Quote | |
Please check your driver version. It is possible that very new drivers have problems. You should use the one suggested by Nvidia for CUDA 2.1 unless you have a reason to use another one (for instance a game requiring a new driver). | |
ID: 10448 | Rating: 0 | rate: / Reply Quote | |
i tried every version of driver and still get the same errors, only a few wus finnishing correctly | |
ID: 10457 | Rating: 0 | rate: / Reply Quote | |
A bit less than 2 weeks ago I almost constantly had this kind of errors. Backing down the GPU clock (including GPU Memory clock) did resolve the issues. With one exception, everything gpugrid has thrown to the system concerned ever since, ran without a glitch, although a bit slower .... | |
ID: 10459 | Rating: 0 | rate: / Reply Quote | |
Please check your driver version. It is possible that very new drivers have problems. You should use the one suggested by Nvidia for CUDA 2.1 unless you have a reason to use another one (for instance a game requiring a new driver). I thought that we were going to be moving up to CUDA 2.2 or did the error on Nvidia's part put a stop to that? I always receive this message on my results, but they don't error out. Rob | |
ID: 10460 | Rating: 0 | rate: / Reply Quote | |
I returned all cards to stock speeds but have not crunched since. I did RMA my GPU3 (later GPU5 after swapping slots) card. It showed a hardware failure on one test with OCCT. | |
ID: 10468 | Rating: 0 | rate: / Reply Quote | |
I tried to slow down my GPU processes but there still same problem with WUs from GPU grid | |
ID: 10470 | Rating: 0 | rate: / Reply Quote | |
@Hydro: seems like you're back up'n running. Was it "just" the downclocking? jrobbio wrote: I thought that we were going to be moving up to CUDA 2.2 The next client is going to be 2.2, but no reason to hurry. Hydro wrote: Is it known what side effects this "Found zero 10-12" warning has ? Is it being investigated ? I think Ignasi said this warning is nothing to worry about (for us). Sounds like "no side effects are known". MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 10513 | Rating: 0 | rate: / Reply Quote | |
Hi, not sure, the absence of GPU3 may have something to do with it too. I currently have the remaining 6 mildly overclocked to 633 (as that is an evga advertized speed for G200 based cards). So far so good. GPU3(5) has been RMA'd and its slot is currently empty. Fans are at 89% with temperatures not over 65c. It was not a power issue as there is plenty. @Hydro: seems like you're back up'n running. Was it "just" the downclocking? ____________ Join team Bletchley Park, the innovators. | |
ID: 10526 | Rating: 0 | rate: / Reply Quote | |
Ubuntu 8 failed installation because of: | |
ID: 10527 | Rating: 0 | rate: / Reply Quote | |
Same error. Gpu isn't oc'ed and driver version is 182.08. | |
ID: 10529 | Rating: 0 | rate: / Reply Quote | |
Your error "Incorrect function. (0x1) - exit code 1 (0x1)" is a very general one which, roughly speaking, can happen due to anything going wrong during the calculation. | |
ID: 10531 | Rating: 0 | rate: / Reply Quote | |
The error is caused by "Cuda error: Kernel [frc_sum_nb_forces] failed in file 'f | |
ID: 10532 | Rating: 0 | rate: / Reply Quote | |
Your error "Incorrect function. (0x1) - exit code 1 (0x1)" is a very general one which, roughly speaking, can happen due to anything going wrong during the calculation. Ok, i thought this was the same error we were talking about. My bad. Hydropower, this is a XFX version of the card, so it is guaranteed to work with these clocks. | |
ID: 10535 | Rating: 0 | rate: / Reply Quote | |
That's what I mean only 3 %. | |
ID: 10537 | Rating: 0 | rate: / Reply Quote | |
The error is caused by "Cuda error: Kernel [frc_sum_nb_forces] failed in file 'f" I'm quite convinced that it's a transient error, which means at some point some calculation threw out a bad result. That means it wouldn't matter in which file and in which code line it happened.. unless we'd discover some regularity. I'm not disagreeing with you, but IMO saying "The error is caused by.." probably misses the point. Again I think, even for overclocking tests, a good shader testing program would be useful. We don't have the perfect tool yet, but I think if a card survives FurMark for an hours without artefacts it should be fine for GPU-Grid. Yes, it doesn't run exactly the same code (but nothing except GPU-Grid itself could do that), so there might be problems where only certain combinations of instructions trigger errors. But FurMark stressed the cards so hard, it could almost be called a thermal virus and should easily generate 20 - 30°C more than GPU-Grid (at constant fan speed). This reduces the maximum stable frequency by quite a bit and thus errors are much more likely to show up. Good old 3D Mark also has error detection built in. It's far from perfect, but if you can't finish it you know you're in trouble (it doesn't work the other way around, though). MrS ____________ Scanning for our furry friends since Jan 2002 | |
ID: 10539 | Rating: 0 | rate: / Reply Quote | |
Message boards : Graphics cards (GPUs) : Encounter 10-12 H-bond term == Client error 0x1 ?