Message boards : Number crunching : Too many errors!!
Author | Message |
---|---|
I suspect this is happening to everyone, something needs to be done right away. | |
ID: 52173 | Rating: 0 | rate: / Reply Quote | |
Maybe so but nothing happens at this project 'right away'. | |
ID: 52178 | Rating: 0 | rate: / Reply Quote | |
Getting many frozen units, usually around 6% or so the time runs, but after 2-3 hours nothing is happening. | |
ID: 52321 | Rating: 0 | rate: / Reply Quote | |
The GPU 0 in your PC is quite hot (85°C=185°F), that may cause workunits to freeze. | |
ID: 52322 | Rating: 0 | rate: / Reply Quote | |
More errors and time wasted. I will wait until I have an answer/correction is made. Shame since I would like to crunch for medical science. | |
ID: 52324 | Rating: 0 | rate: / Reply Quote | |
Retvari Zoltan gave you the answer. No one else responded as it is probably correct and the 1st thing that should be fixed. | |
ID: 52325 | Rating: 0 | rate: / Reply Quote | |
Though temperature may have something to do with it, it may be other things as well. | |
ID: 52326 | Rating: 0 | rate: / Reply Quote | |
The GPU 0 in your PC is quite hot (85°C=185°F), that may cause workunits to freeze. My personal experience is that this temperature doesn't cause problems. I have been crunching GPUGrid for over a year now (24/7) with a 1070 in a case with airflow problems that crunches GPUGrid WUs at between 82 and 85 degrees. Feel free to check my recent tasks to confirm. I might be decreasing the life of the GPU (time will tell), but at least in my case everything is working OK. (I am now looking at cooling options for this one. Because of my experience from my first build (Linux only), I have gained some confidence in trying modifications for the Dell machine with the 1070.) | |
ID: 52327 | Rating: 0 | rate: / Reply Quote | |
GPUGrid long work units especially may 'freeze' as you describe if interrupted frequently.That's correct. But his tasks (except for the last ones) haven't been suspended (interrupted) at all. So this left the high GPU temperature as the most probable cause. The cause could be also: - overcomitted CPU - inadequate PSU (maybe the PSU got too hot too) - overclocked GPU memory - overclocked GPU - overclocked CPU memory - overclocked CPU - SLI cable - burnt PCIe power connector(s) - burnt M/B power connector (the 12V pins) - interfering application which use GPU acceleration (for example: browsers, video playback software, games) So all the usual stuff. Of course it could be something we can't think of, as we don't know his system. | |
ID: 52328 | Rating: 0 | rate: / Reply Quote | |
Recent GPUs tolerate high temperatures much better than the GPUs tolerated it 5-6 years ago, but the physics haven't changed since then. Thermal expansion and contraction stayed the same over the years. The damage it causes is the same. The capacitors are more advanced now than they were 5-6 years ago, but they still prefer lower temperatures.The GPU 0 in your PC is quite hot (85°C=185°F), that may cause workunits to freeze.My personal experience is that this temperature doesn't cause problems. I have been crunching GPUGrid for over a year now (24/7) with a 1070 in a case with airflow problems that crunches GPUGrid WUs at between 82 and 85 degrees. Feel free to check my recent tasks to confirm.I did. Some of your tasks hit 90°C. I might be decreasing the life of the GPU (time will tell),You do. but at least in my case everything is working OK.For now. | |
ID: 52329 | Rating: 0 | rate: / Reply Quote | |
85° is just too hot no matter which way you look at it, there's no margin for error. I would feel very uncomfortable running my GPU's that hot, I'm in the mid 70's now and keep a very close watch on them. | |
ID: 52336 | Rating: 0 | rate: / Reply Quote | |
Not only prone to errors when very hot, but they also run less efficiently. Heated metal has a higher resistivity than colder metal. The GPU will use more power at 85-90C than 60-70C | |
ID: 52338 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : Too many errors!!