Message boards :
News :
SiDock@home September Sailing
Message board moderation
Previous · 1 · 2 · 3 · 4 · Next
Author | Message |
---|---|
Send message Joined: 11 Oct 20 Posts: 333 Credit: 25,501,093 RAC: 6,591 |
About 15 or 30 minutes ago one of participant (or may be several participants) flush ~ 25 000 results or even more. But it was done after 14 hours from challenge start. And now project server successfully process this results and after short break (about 15 minutes?) new tasks already available in queue. Have you considered raising the limit a bit as long as the workunits are this small? We can try to do this, if it is really necessary, but not in first day. When we change settings from 2 to 3 (for example) number of received tasks will increase in several times for first few hours. |
Send message Joined: 24 Oct 20 Posts: 7 Credit: 532,824 RAC: 318 |
Michael H.W. Weber wrote:Please take a look at these guidelines which my team colleague Yoyo has written downThis guide is about keeping the server responsive, not so much about keeping the hosts utilized. This is right. But without server hosts get nothing! |
Send message Joined: 24 Oct 20 Posts: 7 Credit: 532,824 RAC: 318 |
xii5ku wrote:Michael H.W. Weber wrote:Please take a look at these guidelines which my team colleague Yoyo has written downThis guide is about keeping the server responsive, not so much about keeping the hosts utilized. The guide addresses this as well, how to reduce client request frequency. |
Send message Joined: 1 Jan 21 Posts: 9 Credit: 2,775,045 RAC: 5,102 |
xii5ku wrote:Michael H.W. Weber wrote:Please take a look at these guidelines which my team colleague Yoyo has written downThis guide is about keeping the server responsive, not so much about keeping the hosts utilized. The underlying problem is that the tasks are not sent out efficiently enough. If a client would not have to ask for tasks several times before it receives any, there would be far less requests. Of course, as soon as the work supply is shaky, the more enthusiastic participants will take measures to avoid running dry (i.e. forcing work requests as frequently as possible, setting higher buffers, etc.), thereby creating (most of) the server problems your guide is trying to mitigate. |
Send message Joined: 23 Oct 20 Posts: 5 Credit: 10,737,832 RAC: 28 |
|
Send message Joined: 24 Oct 20 Posts: 7 Credit: 532,824 RAC: 318 |
I'm sure, that this is the reason and the base problem is the diskio on the server and this base problem is addressed. |
Send message Joined: 22 May 21 Posts: 11 Credit: 3,283,899 RAC: 0 |
There are some more issues. This Forum and the whole webpage are very unresponsive... it takes like 13,21 seconds to load the server status site. Another issue is Boincstats not updating the challenge as it should do. We all have 0 Credits. Is that Error related to the poor server performance? Or is there another Problem with the Project or BoincStats?? EDIT: wow over 30 seconds to submit this post ^__^ |
Send message Joined: 5 Sep 21 Posts: 4 Credit: 113,431 RAC: 0 |
I get no tasks anymore :( |
Send message Joined: 24 Oct 20 Posts: 7 Credit: 532,824 RAC: 318 |
Yes, this is all related to the poor server performance. The forum runs most probably on the same server as the boinc services and the DB and the stats which is fetched by boincstats are also fetched from this server. Therfore again, most important is to stablize the server. Second point is to make it pleasant for the user regarding wus-in-progress, connect interval and so on. So the question is, why is the server so slow. This has to be evaluated and to be improved. My analysis and mitigations are here https://www.rechenkraft.net/wiki/Benutzer_Diskussion:Yoyo/Boincserver_Tuning I run yoyo@home, which is in the meantime mostly stable and fast also in big races. The server has only 2 cores and 8 GB ram and hard disks. yoyo |
Send message Joined: 11 Oct 20 Posts: 333 Credit: 25,501,093 RAC: 6,591 |
More interesting that "simple I/O". :) For example, now : Average: DEV tps rkB/s wkB/s areq-sz aqu-sz await svctm %util Average: dev8-0 192.08 80.64 4349.12 23.06 0.35 1.84 1.34 25.71 Initially we use 1 feeder. After some time (usually - after big flush of tasks) internal tasks cache become empty for significant time (because computers, that report many tasks, request many tasks also). This is a good time to switch to 2 feeders, for example. Two feeders runs excellent and provide tasks for computers. But after several hours (about 3 or 5) sending tasks are stopped. Feeders runs, no any errors, but tasks from queue not sent to computers. After simple restart situation does not change. But if in this moment perform switch to single feeder (and restart project server processes) problem is resolved. Tasks successfully puts into internal tasks cache and sent to computers. Not so fast as with two feeders, but "Tasks in progress" metric is grow. May be we have an interesting interaction of latency and logic of BOINC server processes. In any case, next bunches of will be mixed with bunches of Eprot_v1_run_2 tasks. |
Send message Joined: 5 Jan 21 Posts: 7 Credit: 21,108,200 RAC: 48,544 |
I can't speak to the intricacies of server configuration, but I can report that the main problem I am having is stuck downloads. These require user intervention with short WUs because further downloads are halted by the timed-out downloads. I propose shortening the timeout interval somewhat. |
Send message Joined: 3 Jan 21 Posts: 24 Credit: 30,966,368 RAC: 105 |
hoarfrost wrote: In any case, next bunches of will be mixed with bunches of Eprot_v1_run_2 tasks.Thanks! as far as I can tell, everything works smoothly now. The client's estimation of task durations is thrown off now of course, therefore it's good that you have the 2-tasks-in-progress limit, preventing the clients from putting more on their plate than they can chew. :-) A bit off topic: yoyo_rkn wrote: I run yoyo@home, which is in the meantime mostly stable and fast also in big races. The server has only 2 cores and 8 GB ram and hard disks.That's nice that you can get by with a severely (and in these days, unnecessarily) under-powered server. But the price is drastically reduced functionality.
Furthermore, the boinc server version at yoyo@home seems curiously outdated, but I have no idea if this too is in place because of performance reasons. Given that e.g. results tables in the web interface cannot be filtered, which makes them practically useless, I guess there are performance considerations in play too. |
Send message Joined: 24 Oct 20 Posts: 7 Credit: 532,824 RAC: 318 |
Most of your conclusions and assumptions are wrong. But I will not run a discussion battle here, so I leave this discussion. |
Send message Joined: 11 Oct 20 Posts: 333 Credit: 25,501,093 RAC: 6,591 |
corona_Eprot_v1_run_2* working blocks (that are about 15 times longer in processor time and even less in data for writing) are now in the queue. We can safely add more and more power! |
Send message Joined: 7 Nov 20 Posts: 8 Credit: 11,148,833 RAC: 0 |
Client request frequency allowance is key. My machines still do not get tasks, so the issue is not fixed - although at least the forum works again (so something has been tweaked or people are already leaving the competition - it is sometimes hard to keep team colleagues to contribute when virtually no work is delievered and even the project website does not respond properly anymore). Practical suggestions for corrections have been posted. If you listen to (external?) people who like to deviate the discussion to rant about outdated servers of other projects instead of just trying to implement what has been kindly suggested by people who practically work with BOINC on multiple projects since about 15 years, you will most likely remain with your problems where you are at present. It's simply your choice. But remember that this project operates way beyond it's capabilities with the current hardware and settings. Michael. President of Rechenkraft.net - This world's first and largest distributed computing organization. We make those things possible that supercomputers don't. |
Send message Joined: 22 May 21 Posts: 11 Credit: 3,283,899 RAC: 0 |
wow finally it doesn't take 20 seconds to load the forum. Seems like you fixxed something ^_^ |
Send message Joined: 11 Oct 20 Posts: 333 Credit: 25,501,093 RAC: 6,591 |
My machines still do not get tasks, so the issue is not fixed - although at least the forum works again (so something has been tweaked or people are already leaving the competition - it is sometimes hard to keep team colleagues to contribute when virtually no work is delievered and even the project website does not respond properly anymore). Looks very strange. Tasks present in queue, cache, and freely sends to many computers. Can you post a messages from event log of BOINC client? May be hosts try to report too many results in one request? With recommendations like below we live from RakeSearch time (this is a good recommendations but only part of whole tuning). We started a challenge with small workunits, but 10 hours ago switched to Eprot_v1 15 times larger. From my computer this thread opens in about 1 second... |
Send message Joined: 11 Oct 20 Posts: 333 Credit: 25,501,093 RAC: 6,591 |
TeAm AnandTech and [H]ard|OCP at the same time passed SETI.Germany and SETI.USA. Hardwarers beat the SETIens? :) The leading six at now: 1. Planet 3DNow! 2227253 2. TeAm AnandTech 1172867 3. SETI.Germany 1147797 4. Rechenkraft.net 873543 5. [H]ard|OCP 702525 6. SETI.USA 700473 ... |
Send message Joined: 11 Oct 20 Posts: 333 Credit: 25,501,093 RAC: 6,591 |
1 day and 15 hours of challenge. Planet 3DNow! crunch workunits on a separate planet. Will anyone to able to reach these green crunchers in the next days of crunch-week? :) TeAm AnandTech inhabit on second place, but who knows - may be techies building an own Starship for greens conquer? SETIens from Germany and USA do not have now a time for searching a Great Green Crunchers due to hard pressing from [H]ard|OCP and crafty crunching by rakes-masters from .net. In the basement of top 10, Metals and Crystals find out who will perform brighter. TOP 6 at now: 1. Planet 3DNow! 3168535 2. TeAm AnandTech 1811902 3. SETI.Germany 1453868 4. Rechenkraft.net 1345601 5. [H]ard|OCP 1076506 6. SETI.USA 1067469 Some technical news: After solid block of ~60 000 workunits of Eprot_v1 we try to make a mixed compound: 20000 of 3CLpro_v4 + 6000 of Eprot_v1 + 20000 of 3CLpro_v5 + 6000 of Eprot_v1 + 20000 of 3CLpro_v6 + 8000 of Eprot_v1. |
Send message Joined: 22 May 21 Posts: 11 Credit: 3,283,899 RAC: 0 |
since the challenge started the project performance has increased by roughly ~30% ^__^ That's really awesome :) And thank you for dealing with all the issues so fast :) |
©2024 SiDock@home Team