Message boards : Graphics cards (GPUs) : Memory leak in the 6.54_x86_64 for Linux?
Author | Message |
---|---|
My Linux box has a problem with the 6.54_x86_64. 4 GB RAM is not enough, other WUs are waiting for memory. The 6.54_x86_64 use all of my RAM and I found only 35 MB free. Never seen this problem with the 6.53. After a update from 6.4.2 to 6.4.5 (I still miss the 6.5.0 for Linux 64 bit) the failure is yet not present, I'm still waiting and have a eye on it. On Vista 64 the application 6.55 is only using 35 MB. | |
ID: 4592 | Rating: 0 | rate: / Reply Quote | |
Same things for me on GTX280 and 8800GTS512 graphic cards. | |
ID: 4606 | Rating: 0 | rate: / Reply Quote | |
My WU with 6.54 has grown, in approx 1 hour, from 50MB to 180MB and it keeps growing | |
ID: 4657 | Rating: 0 | rate: / Reply Quote | |
Try to use 6.5.0. | |
ID: 4659 | Rating: 0 | rate: / Reply Quote | |
They released the 6.5.0 for Linux as 32bit, unfortunately there is no 64bit build... root@frickelbude:~# while true; do ps aux | grep acemd |grep -v grep; sleep 60; done boinc 23292 8.2 2.8 81544 58488 ? RNLl 12:04 0:19 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23292 8.2 3.1 87168 64180 ? RNLl 12:04 0:24 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23292 8.2 3.4 93400 70300 ? SNLl 12:04 0:29 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23292 8.2 3.6 98600 75576 ? SNLl 12:04 0:34 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 ^C root@frickelbude:~# invoke-rc.d boinc-client restart * Stopping BOINC core client: boinc ...done. * Starting BOINC core client: boinc ...done. * Setting up scheduling for BOINC core client and children: ...done. root@frickelbude:~# while true; do ps aux | grep acemd |grep -v grep; sleep 60; done boinc 23802 42.0 1.8 61272 38172 ? SNLl 12:11 0:00 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23802 8.6 2.0 64480 41464 ? SNLl 12:11 0:05 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23802 8.4 2.2 70224 47152 ? SNLl 12:11 0:10 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23802 8.3 2.5 75828 52840 ? SNLl 12:11 0:15 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23802 8.3 2.8 81564 58536 ? SNLl 12:11 0:20 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23802 8.2 3.1 87208 64228 ? SNLl 12:11 0:24 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23802 8.2 3.4 93404 70304 ? SNLl 12:11 0:29 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23802 8.2 3.6 98616 75608 ? RNLl 12:11 0:34 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23802 8.2 3.9 104368 81304 ? RNLl 12:11 0:39 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23802 8.2 4.2 110000 87004 ? RNLl 12:11 0:44 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 23802 8.2 4.5 115744 92696 ? RNLl 12:11 0:49 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 System is the following: http://www.sysprofile.de/id84658 | |
ID: 4661 | Rating: 0 | rate: / Reply Quote | |
Which WU is that for? | |
ID: 4666 | Rating: 0 | rate: / Reply Quote | |
mC16040-SH2_US-5-40-SH2_US1720000_0 | |
ID: 4671 | Rating: 0 | rate: / Reply Quote | |
Look at this type of WU, host concerned and the other. | |
ID: 4682 | Rating: 0 | rate: / Reply Quote | |
I've got the same problem :( | |
ID: 4684 | Rating: 0 | rate: / Reply Quote | |
I have two units on my dualcore, which are mC16040-SH2_US-5-40-SH2_US1720000 and ME12403-SH2_US-4-40-SH2_US950000. | |
ID: 4686 | Rating: 0 | rate: / Reply Quote | |
Please move to 6.4.5 and see if the problem disapper as reported by the first post. | |
ID: 4687 | Rating: 0 | rate: / Reply Quote | |
I have the memory leak too. | |
ID: 4688 | Rating: 0 | rate: / Reply Quote | |
I upgrade to 6.4.5 now, the remaining task stays in "waiting for memory" mode and again I got: So 21 Dez 2008 20:48:57 CET|GPUGRID|Message from server: No work sent So 21 Dez 2008 20:48:57 CET|GPUGRID|Message from server: Full-atom molecular dynamics for Cell processor is not available for your type of computer. So 21 Dez 2008 20:48:57 CET|GPUGRID|Message from server: Full-atom molecular dynamics on Cell processor is not available for your type of computer. My slot 14 which holds the stalled 77% WU is just 19MB... edit: I'm going nuts, without any help from my side the work unit continued crunching now. I wonder how, as there can hardly be more memory available than 30 minutes ago, when I restarted my system and the WU wouldn't start. But again the app eats up my memory: root@frickelbude:/var/lib/boinc-client/slots# while true; do ps aux | grep acemd | grep -v grep; sleep 60;done boinc 12414 9.1 2.9 83496 60488 ? RNLl 21:10 0:24 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 12414 8.8 3.2 89096 66128 ? RNLl 21:10 0:29 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 boinc 12414 8.7 3.4 94748 71764 ? RNLl 21:10 0:34 acemd_6.54_x86_64-pc-linux-gnu__cuda --device 0 root@frickelbude:/var/lib/boinc-client/slots# ps aux | head -1 USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND | |
ID: 4689 | Rating: 0 | rate: / Reply Quote | |
...after a few minutes the files in the slot are growing very slow, but the application is growing with the same speed from the beginning. | |
ID: 4690 | Rating: 0 | rate: / Reply Quote | |
This morning my Quad was crunching a SH2_USPME-5 workunit (pN16075-SH2_USPME-5-40-SH2_USPME470000) and the memory usage was again increasing. I stopped it and started a GPUTEST6 unit (lY10341-GPUTEST6-1-20-acemd_0), the memory usage is stable, not a single Kbyte more after some minutes... Now I stopped the GPUTEST6 and started one SH2_US (to20339-SH2_US_1-5-40-SH2_US_1240000_0), immediately the memory usage starts growing... | |
ID: 4724 | Rating: 0 | rate: / Reply Quote | |
So we can say for certain that the linux GPU client or driver has a problem with certain WUs, probably independent of the BOINC client? | |
ID: 4737 | Rating: 0 | rate: / Reply Quote | |
I had already upgraded to 6.4.5 when I posted in this thread about the memory leak. | |
ID: 4740 | Rating: 0 | rate: / Reply Quote | |
Another WU with memory exceeded error : | |
ID: 4754 | Rating: 0 | rate: / Reply Quote | |
I found now an acemd_6.57_x86_64-pc-linux-gnu__cuda process running, unfortunately memory usage is still growing over the time... CPU usage of that app is ~40% of one core, on a C2D 3,4GHz... Now Windows and Linux change roles? ;-) | |
ID: 4777 | Rating: 0 | rate: / Reply Quote | |
I'm running the 6.57 on a GPUTEST6 right now, no problems. Load is at ~8% (normal on my system) and memory is stable. | |
ID: 4794 | Rating: 0 | rate: / Reply Quote | |
I'm crunching a WU that uses up to 15% of my CPU and has already eaten over 1GB RAM... | |
ID: 4819 | Rating: 0 | rate: / Reply Quote | |
We reproduced the problem and implementing a fix. | |
ID: 4823 | Rating: 0 | rate: / Reply Quote | |
We reproduced the problem and implementing a fix. I changed to 6.4.5 this morning and it "leaks" the same as 6.4.2. It take about 4 hours to devours all the memory and takes up all the swap. When it starts to write to the hard drive constantly, I re-run benchmarks and it releases the memory and most of the swap. The wu's are 6.57. It makes no difference if I run another project at the same time. I am running current drivers[177.82] for my card, XFX 9600GSO. Ubuntu 8.04 on a bone stock X2 3800. Is there something else that I can do or is this a problem beyond my control?? mike | |
ID: 4843 | Rating: 0 | rate: / Reply Quote | |
We reproduced the problem and implementing a fix. You just have to wait until the fix is issued. So, yes, beyond your control for the moment. | |
ID: 4844 | Rating: 0 | rate: / Reply Quote | |
Is there something else that I can do or is this a problem beyond my control?? While waiting for a fix could you schedule a task to run boinccmd, say, every three hours? I don't know Linux but in Windows the .bat file would be: e: cd e:\boinc boinccmd --run_benchmarks You'd need to change the first two lines to point to where boinccmd lives on your system. Runing this every three hours means you'd lose 8 minutes processing GPU and your other project(s) every day but you could lose a lot more if it crashes overnight. Phoneman1 | |
ID: 4853 | Rating: 0 | rate: / Reply Quote | |
I have a 6.58 WU in queue, so I guess the new version adresses this specific problem. I'll keep you updated about the results. | |
ID: 4862 | Rating: 0 | rate: / Reply Quote | |
I have a 6.58 WU in queue, so I guess the new version adresses this specific problem. I'll keep you updated about the results. Well, now I am jealous ... I only have the old 6.55 ... sniff, sniff ... :) | |
ID: 4876 | Rating: 0 | rate: / Reply Quote | |
6.58 for Linux fixed the memory leak. | |
ID: 4880 | Rating: 0 | rate: / Reply Quote | |
So far it seems to be working. Good work ;) | |
ID: 4882 | Rating: 0 | rate: / Reply Quote | |
Message boards : Graphics cards (GPUs) : Memory leak in the 6.54_x86_64 for Linux?