Message boards :
Number crunching :
No checkpoints?
Message board moderation
Previous · 1 · 2 · 3 · Next
Author | Message |
---|---|
Send message Joined: 27 Jan 21 Posts: 2 Credit: 62,625 RAC: 0 |
Me too, I quit until chechpoint issure is resolved. |
Send message Joined: 6 Nov 20 Posts: 2 Credit: 2,519,470 RAC: 0 |
Help, Natalia ! Add some check-points.if you can You are our only hope ! |
Send message Joined: 24 Oct 20 Posts: 23 Credit: 9,020 RAC: 0 |
As stated on the forums, they are working on checkpoints. https://www.sidock.si/sidock/forum_thread.php?id=105&postid=776#776 |
Send message Joined: 2 Mar 21 Posts: 5 Credit: 26,113,755 RAC: 27,511 |
This is great project! Please please add checkpoint support, thanks ;-) |
Send message Joined: 2 Mar 21 Posts: 5 Credit: 26,113,755 RAC: 27,511 |
This is great project! Please please add checkpoint support. Thanks ;-) |
Send message Joined: 11 Nov 20 Posts: 47 Credit: 83,493 RAC: 0 |
Dear Boincers, we are working to solve the checkpoint issue, which is critical in the case of long-running docking problems. E-protein has a relatively large binding site and it takes more time to sample configurations/conformations of the ligand within the binding site. The suggestions to split WU into smaller ones seems a good idea, in this intermediate time. To provide better experience and some changes will be done soon: * we will add new server (s) to the project * we will increase diskspace (this is probably the most urgent) * I hope that we will hardcoded checkpointing very soon (very top on our task list) All the best, Crtomir |
Send message Joined: 13 Jan 21 Posts: 76 Credit: 38,846,214 RAC: 0 |
Does SiDock use adaptive replication (Q2R2 changes to Q1R1) or does it always stay as Q2R2 ??? |
Send message Joined: 5 Jan 21 Posts: 7 Credit: 23,392,338 RAC: 66,801 |
There was a power outage this morning, which in my case means I have totally lost hundreds of hours of computing., approximately the work of 220 logical cores over the span of 12 hours... |
Send message Joined: 9 Oct 20 Posts: 185 Credit: 2,782,517 RAC: 50 |
Even with Q2R2, we still need to recompute certain tasks. Quorum = 2 helps a lot. |
Send message Joined: 31 Oct 20 Posts: 32 Credit: 847,529 RAC: 123 |
Approaching the BOINC Pentalon, I am curious as to where you stand on checkpointing. What is the issue holding you up atm to implement this feature? |
Send message Joined: 9 Oct 20 Posts: 185 Credit: 2,782,517 RAC: 50 |
The team needs to implement them in CmDock for all platforms, build and test. It is not a good idea to hurry just before the competition... |
Send message Joined: 31 Oct 20 Posts: 32 Credit: 847,529 RAC: 123 |
I second that thought. Might be wise to hold back on this plan for the while being. But just a litlle longer or else I fear that you might lose volunteers with powerful systems who understandibly get angry with you about loads of kWh being potentially lost. |
Send message Joined: 7 Nov 20 Posts: 8 Credit: 11,239,887 RAC: 6,266 |
The CmDock GitLab repo just reported the appearance of a new version with the desperately awaited checkpointing feature. So, things are well in the works. ;-) Michael. President of Rechenkraft.net - This world's first and largest distributed computing organization. We make those things possible that supercomputers don't. |
Send message Joined: 22 Nov 20 Posts: 10 Credit: 13,167,232 RAC: 127 |
I concur with KPX as all my Win 10 machines decided to do Windows updates and the obligatory reboot cycle at 3am so I lost all the tasks in progress. Please bring in checkpoints and in the time we wait for the application to be upgraded send out smaller tasks (shorter run times).Many take over 6 hours per core. |
Send message Joined: 12 Jan 21 Posts: 13 Credit: 2,513,888 RAC: 0 |
It is not writing checkpoints correctly:(win8.1, latest boinc Version, checkpoint Intervall 600 sec. Application CurieMarieDock on BOINC + zipped input 2.00 Name corona_Eprot_v1_nb3di_203441_1 StateRunning Received 21/05/2021 10:56:34 Report deadline25/05/2021 10:56:37 Estimated computation size 50,000 GFLOPs CPU time 08:45:21 CPU time since checkpoint 06:24:52 Elapsed time08:47:10 Estimated time remaining 07:34:02 Fraction done 53.727% Virtual memory size 148.43 MB Working set size 149.75 MB Directory slots/1 Process ID 3924 Progress rate 6.120% per hour Executable cmdock-boinc-zip_wrapper_2.0_windows_x86_64.exe |
Send message Joined: 11 Oct 20 Posts: 338 Credit: 25,686,198 RAC: 9,023 |
Hello! Try to check file docking_out.chk in appropriate slot directory. If you see a number (between 1 and 500) and name of ligand inside file, checkpoints - are created. :) |
Send message Joined: 24 Oct 20 Posts: 19 Credit: 11,101,112 RAC: 52,416 |
With windows7 Boinc dont shows the checkpoints correct: Projekt SiDock@home Name corona_Eprot_v1_nb3di_206123_2_1 Anwendung CurieMarieDock on BOINC + zipped input 2.00 Arbeitspaketname corona_Eprot_v1_nb3di_206123_2 Status Aktiv Erhalten 23.05.2021 00:10:17 Deadline 28.05.2021 00:10:00 Geschätzte Anwendungsgeschwindigkeit 1,63 GFLOPs/sec Geschätzte Arbeitspaketgröße 50.000 GFLOPs CPU-Zeit beim letzten Checkpoint 00:09:39 CPU-Zeit 09:00:41 Vergangene Zeit 08:31:45 Geschätzte verbleibende Zeit 04:58:35 Fortschritt 63,153% Größe des virtuellen Speichers 147,75 MB Größe des Arbeitspakets im Speicher 151,31 MB Verzeichnis slots/2 Prozess-ID 2765056 The docking _log and the docking_out.chk shows 462 and the same ligand name. On my ARMs with Linux BoincTasks shows the checkpoints correct. |
Send message Joined: 12 Jan 21 Posts: 13 Credit: 2,513,888 RAC: 0 |
Thanks. The first number is counting up.(xxx ZINCyyyyyyyyyyyyyy) However, the checkpoints appear to be written every 2 min, ignoring the boinc setting. Inerestingly the change date of the file you mentioned is not updated, even thoug the contense of it is changing. |
Send message Joined: 6 Feb 21 Posts: 2 Credit: 5,083,832 RAC: 0 |
Congratulations on adding checkpoints! Although not all seems to be perfect yet, this is a great relieve for crunshers like me who don't have their machines up and running 24/7. From now on I will dedicate them more towards your project. |
Send message Joined: 24 Oct 20 Posts: 19 Credit: 458,162 RAC: 0 |
I still don't see any use of checkpoints on Windows PC's. BOINC always shows ZERO and time since checkpoints as run time. |
©2024 SiDock@home Team