Message boards : Number crunching : This computer has finished a daily quota of 31 tasks
Author | Message |
---|---|
I seem to be one of the few people who can run the QC jobs without problems. But I have 32 GB memory, and 180 GB free on my SSD. | |
ID: 50345 | Rating: 0 | rate: / Reply Quote | |
There's something a bit strange about that statement. The current state of play is Quantum Chemistry 3.30 x86_64-pc-linux-gnu (mt) (from Application details for host 483848) Ah - three of your tasks errored out this morning. Look at task 18604263. Apart from ==> WARNING: A newer version of conda exists. <== the task actually failed because of <message> upload failure: <file_xfer_error> <file_name>6955_1_15_16_18_dd130713_n00001-SDOERR_SELE2-0-1-RND2528_0_1</file_name> <error_code>-131 (file size too big)</error_code> </file_xfer_error> </message> That's a job creation error by the project, outside your control. But it will have reset the daily quota to 31, and by then you were already way off into the distance. This is a good safety measure in the BOINC code. When you hit the bad batch of WUs, you were prevented from wasting bandwidth by downloading further tasks which were doomed to fail. That gives the project team 24 hours to fix the problem: if the tasks available tomorrow morning have been fixed, or otherwise come from a batch without this error, your daily quota will start rising with every task you return, and you won't hit any limit until things go wrong again. | |
ID: 50346 | Rating: 0 | rate: / Reply Quote | |
That's a job creation error by the project, outside your control. But it will have reset the daily quota to 31, and by then you were already way off into the distance. That makes sense. Thanks for looking into it. I will wait it out, though maybe with a backup project. | |
ID: 50350 | Rating: 0 | rate: / Reply Quote | |
Two issues at play here: the limit on tasks, being hit because too fast completions. This we should probably raise for CPU jobs. | |
ID: 50401 | Rating: 0 | rate: / Reply Quote | |
Then there is file size too big, which is somewhat surprising. Another cruncher completed it, so does not seem to be a WU issue. Possibly restart-related (a large leftover temporary file maybe). I will detach and re-attach. That should clean it out. | |
ID: 50402 | Rating: 0 | rate: / Reply Quote | |
Or just reset. I don't think it would occur in more than one wus though (or did it?). | |
ID: 50403 | Rating: 0 | rate: / Reply Quote | |
Or just reset. I don't think it would occur in more than one wus though (or did it?). There were three visible at the time I first responded to the issue, I think with very similar reporting times: I did say "three of your tasks errored out this morning", though I didn't look for or comment on any other similarities. One thing I did notice was that the errored tasks had run for approximately double the time of all Jim's other (successful) tasks. They're still visible on page 2 of error tasks for host 483848, sent around 09:30 on 30 August and reported a couple of hours later. | |
ID: 50407 | Rating: 0 | rate: / Reply Quote | |
Message boards : Number crunching : This computer has finished a daily quota of 31 tasks