1. This site uses cookies. By continuing to use this site, you are agreeing to our use of cookies. Learn More.

Server issue?

Discussion in 'bit-tech Folding Team' started by Keith_Whi, 1 Aug 2009.

  1. Keith_Whi

    Keith_Whi What's a Dremel?

    Joined:
    6 May 2009
    Posts:
    228
    Likes Received:
    3
    Been having a little trouble over the past 24 hours with my VM SMP clients.

    When they finish the unit they struggle to return results and recieve a new work unit. This is from 171.64.65.56. (Recieving)

    I have found that by restarting the VM it enables it to recieve a new WU and away it goes on it's merry way. Not sure that I am going to be credited for them though. :sigh:

    Looking on the FAH forums I'm not alone in this either and some say that the number of units available is low!?!

    Anyone else finding this? Why would a restart allow a WU to be downloaded when it had previously tried up to 19 times to recieve one? Will I be credited do you think?

    Any ideas? :confused:
     
    Last edited: 1 Aug 2009
  2. Christopher N. Lew

    Christopher N. Lew Folding in memory of my father

    Joined:
    23 Apr 2009
    Posts:
    1,358
    Likes Received:
    46
    From the latest postings in the Stanford forum, it appears to be a strange problem with the server. Are you getting 503 errors - Could not connect to server?

    Restarting the client often gets a new WU in this sort of situation. One of those strange things - it happens, but at first sight it shouldn't. Perhaps a client that is just starting somehow appears different to the server?

    Are your completed workunits actually being received? Do you get the 'Thank you for contributing' message - if so all is well and you should get the credits once everything is running smoothly.

    Your completed workunits may be piling up in your own queue - have a look under Status/Queue Info. If so then your machine will try to upload them all, at intervals. I believe your queue has ten slots, and I think that if they are all full, then the client will not download another WU until some space has been created.

    HTH
     
  3. Keith_Whi

    Keith_Whi What's a Dremel?

    Joined:
    6 May 2009
    Posts:
    228
    Likes Received:
    3
    Yes, I am getting the 503 error.

    But it would appear that my WU's are not being returned.

    Here is a recent example.

    :09] + 757760 bytes downloaded
    [21:20:09] + 768000 bytes downloaded
    [21:20:09] + 778240 bytes downloaded
    [21:20:10] + 788480 bytes downloaded
    [21:20:10] + 798720 bytes downloaded
    [21:20:10] + 808960 bytes downloaded
    [21:20:10] + 819200 bytes downloaded
    [21:20:10] + 829440 bytes downloaded
    [21:20:10] + 839680 bytes downloaded
    [21:20:10] + 849920 bytes downloaded
    [21:20:10] + 860160 bytes downloaded
    [21:20:10] + 870400 bytes downloaded
    [21:20:10] + 880640 bytes downloaded
    [21:20:10] + 890880 bytes downloaded
    [21:20:10] + 901120 bytes downloaded
    [21:20:10] + 911360 bytes downloaded
    [21:20:11] + 921600 bytes downloaded
    [21:20:11] + 931840 bytes downloaded
    [21:20:11] + 942080 bytes downloaded
    [21:20:11] + 952320 bytes downloaded
    [21:20:11] + 962560 bytes downloaded
    [21:20:11] + 972800 bytes downloaded
    [21:20:11] + 983040 bytes downloaded
    [21:20:11] + 993280 bytes downloaded
    [21:20:11] + 1003520 bytes downloaded
    [21:20:11] + 1013760 bytes downloaded
    [21:20:11] + 1024000 bytes downloaded
    [21:20:11] + 1034240 bytes downloaded
    [21:20:11] + 1044480 bytes downloaded
    [21:20:11] + 1054720 bytes downloaded
    [21:20:11] + 1064960 bytes downloaded
    [21:20:11] + 1075200 bytes downloaded
    [21:20:11] + 1085440 bytes downloaded
    [21:20:11] + 1095680 bytes downloaded
    [21:20:11] + 1105920 bytes downloaded
    [21:20:11] + 1116160 bytes downloaded
    [21:20:11] + 1126400 bytes downloaded
    [21:20:11] + 1136640 bytes downloaded
    [21:20:11] + 1146880 bytes downloaded
    [21:20:11] + 1157120 bytes downloaded
    [21:20:11] + 1167360 bytes downloaded
    [21:20:11] + 1177600 bytes downloaded
    [21:20:11] + 1187840 bytes downloaded
    [21:20:11] + 1198080 bytes downloaded
    [21:20:11] + 1208320 bytes downloaded
    [21:20:11] + 1218560 bytes downloaded
    [21:20:11] + 1228800 bytes downloaded
    [21:20:11] + 1239040 bytes downloaded
    [21:20:11] + 1249280 bytes downloaded
    [21:20:11] + 1259520 bytes downloaded
    [21:20:12] + 1269760 bytes downloaded
    [21:20:12] + 1280000 bytes downloaded
    [21:20:12] + 1290240 bytes downloaded
    [21:20:12] + 1300480 bytes downloaded
    [21:20:12] + 1310720 bytes downloaded
    [21:20:12] + 1320960 bytes downloaded
    [21:20:12] + 1331200 bytes downloaded
    [21:20:12] + 1341440 bytes downloaded
    [21:20:12] + 1351680 bytes downloaded
    [21:20:12] + 1361920 bytes downloaded
    [21:20:12] + 1372160 bytes downloaded
    [21:20:12] + 1382400 bytes downloaded
    [21:20:12] + 1392640 bytes downloaded
    [21:20:12] + 1402880 bytes downloaded
    [21:20:12] + 1413120 bytes downloaded
    [21:20:12] + 1423360 bytes downloaded
    [21:20:12] + 1433600 bytes downloaded
    [21:20:12] + 1443840 bytes downloaded
    [21:20:12] + 1454080 bytes downloaded
    [21:20:12] + 1464320 bytes downloaded
    [21:20:12] + 1474560 bytes downloaded
    [21:20:12] + 1484800 bytes downloaded
    [21:20:12] + 1495040 bytes downloaded
    [21:20:12] + 1505280 bytes downloaded
    [21:20:13] + 1515520 bytes downloaded
    [21:20:13] + 1525760 bytes downloaded
    [21:20:13] + 1536000 bytes downloaded
    [21:20:13] + 1546240 bytes downloaded
    [21:20:13] + 1556480 bytes downloaded
    [21:20:13] + 1566720 bytes downloaded
    [21:20:13] + 1576960 bytes downloaded
    [21:20:13] + 1587200 bytes downloaded
    [21:20:13] + 1597440 bytes downloaded
    [21:20:13] + 1607680 bytes downloaded
    [21:20:13] + 1617920 bytes downloaded
    [21:20:13] + 1628160 bytes downloaded
    [21:20:13] + 1638400 bytes downloaded
    [21:20:13] + 1648640 bytes downloaded
    [21:20:13] + 1658880 bytes downloaded
    [21:20:13] + 1669120 bytes downloaded
    [21:20:13] + 1679360 bytes downloaded
    [21:20:13] + 1689600 bytes downloaded
    [21:20:13] + 1699840 bytes downloaded
    [21:20:13] + 1710080 bytes downloaded
    [21:20:13] + 1720320 bytes downloaded
    [21:20:13] + 1730560 bytes downloaded
    [21:20:13] + 1740800 bytes downloaded
    [21:20:14] + 1751040 bytes downloaded
    [21:20:14] + 1761280 bytes downloaded
    [21:20:14] + 1771520 bytes downloaded
    [21:20:14] + 1781760 bytes downloaded
    [21:20:14] + 1785668 bytes downloaded
    [21:20:14] Verifying core Core_a2.fah...
    [21:20:14] Signature is VALID
    [21:20:14]
    [21:20:14] Trying to unzip core FahCore_a2.exe
    [21:20:14] Decompressed FahCore_a2.exe (4382312 bytes) successfully
    [21:20:14] + Core successfully engaged
    [21:20:19]
    [21:20:19] + Processing work unit
    [21:20:19] At least 4 processors must be requested.Core required: FahCore_a2.exe
    [21:20:19] Core found.
    [21:20:19] Working on Unit 04 [July 31 21:20:19]
    [21:20:19] + Working ...
    [21:20:19]
    [21:20:19] *------------------------------*
    [21:20:19] Folding@Home Gromacs SMP Core
    [21:20:19] Version 2.08 (Mon May 18 14:47:42 PDT 2009)
    [21:20:19]
    [21:20:19] Preparing to commence simulation
    [21:20:19] - Ensuring status. Please wait.
    [21:20:29] - Assembly optimizations manually forced on.
    [21:20:29] - Not checking prior termination.
    [21:20:31] - Expanded 4842153 -> 24001453 (decompressed 495.6 percent)
    [21:20:31] Called DecompressByteArray: compressed_data_size=4842153 data_size=24001453, decompressed_data_size=24001453 diff=0
    [21:20:32] - Digital signature verified
    [21:20:32]
    [21:20:32] Project: 2675 (Run 0, Clone 171, Gen 113)
    [21:20:32]
    [21:20:32] Assembly optimizations on if available.
    [21:20:32] Entering M.D.
    [21:20:39] Multi-core optimizations on
    [21:20:42] Completed 0 out of 250000 steps (0%)
    [21:33:05] Completed 2500 out of 250000 steps (1%)
    [21:45:30] Completed 5000 out of 250000 steps (2%)
    [21:57:55] Completed 7500 out of 250000 steps (3%)
    [22:10:19] Completed 10000 out of 250000 steps (4%)
    [22:23:14] Completed 12500 out of 250000 steps (5%)
    [22:35:49] Completed 15000 out of 250000 steps (6%)
    [22:48:14] Completed 17500 out of 250000 steps (7%)
    [23:00:38] Completed 20000 out of 250000 steps (8%)
    [23:13:04] Completed 22500 out of 250000 steps (9%)
    [23:25:28] Completed 25000 out of 250000 steps (10%)
    [23:37:51] Completed 27500 out of 250000 steps (11%)
    [23:50:17] Completed 30000 out of 250000 steps (12%)
    [00:02:42] Completed 32500 out of 250000 steps (13%)
    [00:15:12] Completed 35000 out of 250000 steps (14%)
    [00:27:38] Completed 37500 out of 250000 steps (15%)
    [00:40:05] Completed 40000 out of 250000 steps (16%)
    [00:52:32] Completed 42500 out of 250000 steps (17%)
    [01:04:58] Completed 45000 out of 250000 steps (18%)
    [01:17:25] Completed 47500 out of 250000 steps (19%)
    [01:29:53] Completed 50000 out of 250000 steps (20%)
    [01:42:20] Completed 52500 out of 250000 steps (21%)
    [01:54:47] Completed 55000 out of 250000 steps (22%)
    [02:07:15] Completed 57500 out of 250000 steps (23%)
    [02:19:40] Completed 60000 out of 250000 steps (24%)
    [02:32:08] Completed 62500 out of 250000 steps (25%)
    [02:43:59] Completed 65000 out of 250000 steps (26%)
    [02:55:53] Completed 67500 out of 250000 steps (27%)
    [03:07:49] Completed 70000 out of 250000 steps (28%)
    [03:20:17] Completed 72500 out of 250000 steps (29%)
    [03:32:46] Completed 75000 out of 250000 steps (30%)
    [03:45:14] Completed 77500 out of 250000 steps (31%)
    [03:57:41] Completed 80000 out of 250000 steps (32%)
    [04:10:11] Completed 82500 out of 250000 steps (33%)
    [04:22:39] Completed 85000 out of 250000 steps (34%)
    [04:35:07] Completed 87500 out of 250000 steps (35%)
    [04:47:36] Completed 90000 out of 250000 steps (36%)
    [05:00:07] Completed 92500 out of 250000 steps (37%)
    [05:12:37] Completed 95000 out of 250000 steps (38%)
    [05:25:08] Completed 97500 out of 250000 steps (39%)
    [05:37:38] Completed 100000 out of 250000 steps (40%)
    [05:50:08] Completed 102500 out of 250000 steps (41%)
    [06:02:36] Completed 105000 out of 250000 steps (42%)
    [06:15:05] Completed 107500 out of 250000 steps (43%)
    [06:27:38] Completed 110000 out of 250000 steps (44%)
    [06:40:10] Completed 112500 out of 250000 steps (45%)
    [06:52:39] Completed 115000 out of 250000 steps (46%)
    [07:05:07] Completed 117500 out of 250000 steps (47%)
    [07:17:39] Completed 120000 out of 250000 steps (48%)
    [07:30:09] Completed 122500 out of 250000 steps (49%)
    [07:42:40] Completed 125000 out of 250000 steps (50%)
    [07:55:11] Completed 127500 out of 250000 steps (51%)
    [08:07:38] Completed 130000 out of 250000 steps (52%)
    [08:20:07] Completed 132500 out of 250000 steps (53%)
    [08:32:40] Completed 135000 out of 250000 steps (54%)
    [08:45:12] Completed 137500 out of 250000 steps (55%)
    [08:57:41] Completed 140000 out of 250000 steps (56%)
    [09:10:10] Completed 142500 out of 250000 steps (57%)
    [09:22:39] Completed 145000 out of 250000 steps (58%)
    [09:35:06] Completed 147500 out of 250000 steps (59%)
    [09:47:36] Completed 150000 out of 250000 steps (60%)
    [10:00:05] Completed 152500 out of 250000 steps (61%)
    [10:12:34] Completed 155000 out of 250000 steps (62%)
    [10:25:06] Completed 157500 out of 250000 steps (63%)
    [10:37:36] Completed 160000 out of 250000 steps (64%)
    [10:50:05] Completed 162500 out of 250000 steps (65%)
    [11:02:36] Completed 165000 out of 250000 steps (66%)
    [11:15:06] Completed 167500 out of 250000 steps (67%)
    [11:27:38] Completed 170000 out of 250000 steps (68%)
    [11:40:09] Completed 172500 out of 250000 steps (69%)
    [11:52:41] Completed 175000 out of 250000 steps (70%)
    [12:05:12] Completed 177500 out of 250000 steps (71%)
    [12:17:43] Completed 180000 out of 250000 steps (72%)
    [12:30:12] Completed 182500 out of 250000 steps (73%)
    [12:42:42] Completed 185000 out of 250000 steps (74%)
    [12:55:10] Completed 187500 out of 250000 steps (75%)
    [13:07:37] Completed 190000 out of 250000 steps (76%)
    [13:20:02] Completed 192500 out of 250000 steps (77%)
    [13:32:32] Completed 195000 out of 250000 steps (78%)
    [13:45:04] Completed 197500 out of 250000 steps (79%)
    [13:57:35] Completed 200000 out of 250000 steps (80%)
    [14:10:04] Completed 202500 out of 250000 steps (81%)
    [14:22:34] Completed 205000 out of 250000 steps (82%)
    [14:35:01] Completed 207500 out of 250000 steps (83%)
    [14:47:30] Completed 210000 out of 250000 steps (84%)
    [15:00:01] Completed 212500 out of 250000 steps (85%)
    [15:12:32] Completed 215000 out of 250000 steps (86%)
    [15:25:00] Completed 217500 out of 250000 steps (87%)
    [15:37:33] Completed 220000 out of 250000 steps (88%)
    [15:50:09] Completed 222500 out of 250000 steps (89%)
    [16:02:40] Completed 225000 out of 250000 steps (90%)
    [16:15:11] Completed 227500 out of 250000 steps (91%)
    [16:27:42] Completed 230000 out of 250000 steps (92%)
    [16:40:14] Completed 232500 out of 250000 steps (93%)
    [16:52:44] Completed 235000 out of 250000 steps (94%)
    [17:05:13] Completed 237500 out of 250000 steps (95%)
    [17:17:44] Completed 240000 out of 250000 steps (96%)
    [17:30:16] Completed 242500 out of 250000 steps (97%)
    [17:42:47] Completed 245000 out of 250000 steps (98%)
    [17:55:19] Completed 247500 out of 250000 steps (99%)
    [18:07:50] Completed 250000 out of 250000 steps (100%)
    [18:07:52] DynamicWrapper: Finished Work Unit: sleep=10000
    [18:08:02]
    [18:08:02] Finished Work Unit:
    [18:08:02] - Reading up to 21142368 from "work/wudata_04.trr": Read 21142368
    [18:08:02] trr file hash check passed.
    [18:08:02] - Reading up to 4509204 from "work/wudata_04.xtc": Read 4509204
    [18:08:02] xtc file hash check passed.
    [18:08:02] edr file hash check passed.
    [18:08:02] logfile size: 181238
    [18:08:02] Leaving Run
    [18:08:07] - Writing 25977562 bytes of core data to disk...
    [18:08:07] ... Done.
    [18:08:08] - Shutting down core
    [18:08:08]
    [18:08:08] Folding@home Core Shutdown: FINISHED_UNIT
    [18:11:23] CoreStatus = 64 (100)
    [18:11:23] Sending work to server


    [18:11:23] + Attempting to send results
    [18:11:24] - Couldn't send HTTP request to server
    [18:11:24] + Could not connect to Work Server (results)
    [18:11:24] (171.64.65.56:8080)
    [18:11:24] - Error: Could not transmit unit 04 (completed August 1) to work server.
    [18:11:24] Keeping unit 04 in queue.


    [18:11:24] + Attempting to send results
    [18:11:25] - Couldn't send HTTP request to server
    [18:11:25] + Could not connect to Work Server (results)
    [18:11:25] (171.64.65.56:8080)
    [18:11:25] - Error: Could not transmit unit 04 (completed August 1) to work server.


    [18:11:25] + Attempting to send results
    [18:11:25] - Couldn't send HTTP request to server
    [18:11:25] + Could not connect to Work Server (results)
    [18:11:25] (171.67.108.25:8080)
    [18:11:25] Could not transmit unit 04 to Collection server; keeping in queue.
    [18:11:25] - Preparing to get new work unit...
    [18:11:25] + Attempting to get work packet
    [18:11:25] - Connecting to assignment server
    [18:11:26] - Successful: assigned to (171.64.65.56).
    [18:11:26] + News From Folding@Home: Welcome to Folding@Home
    [18:11:26] Loaded queue successfully.
    [18:11:26] - Couldn't send HTTP request to server
    [18:11:26] (Got status 503)
    [18:11:26] + Could not connect to Work Server
    [18:11:26] - Attempt #1 to get work failed, and no other work to do.
    Waiting before retry.
    [18:11:41] + Attempting to get work packet
    [18:11:41] - Connecting to assignment server
    [18:11:41] - Successful: assigned to (171.64.65.56).
    [18:11:41] + News From Folding@Home: Welcome to Folding@Home
    [18:11:41] Loaded queue successfully.
    [18:11:42] - Couldn't send HTTP request to server
    [18:11:42] (Got status 503)
    [18:11:42] + Could not connect to Work Server
    [18:11:42] - Attempt #2 to get work failed, and no other work to do.
    Waiting before retry.
    [18:11:56] + Attempting to get work packet
    [18:11:56] - Connecting to assignment server
    [18:11:57] - Successful: assigned to (171.64.65.56).
    [18:11:57] + News From Folding@Home: Welcome to Folding@Home
    [18:11:57] Loaded queue successfully.
    [18:11:58] - Couldn't send HTTP request to server
    [18:11:58] (Got status 503)
    [18:11:58] + Could not connect to Work Server
    [18:11:58] - Attempt #3 to get work failed, and no other work to do.
    Waiting before retry.
    [18:12:19] + Attempting to get work packet
    [18:12:19] - Connecting to assignment server
    [18:12:20] - Successful: assigned to (171.64.65.56).
    [18:12:20] + News From Folding@Home: Welcome to Folding@Home
    [18:12:20] Loaded queue successfully.
    [18:12:20] - Couldn't send HTTP request to server
    [18:12:20] (Got status 503)
    [18:12:20] + Could not connect to Work Server
    [18:12:20] - Attempt #4 to get work failed, and no other work to do.
    Waiting before retry.
    [18:13:02] + Attempting to get work packet
    [18:13:02] - Connecting to assignment server
    [18:13:03] - Successful: assigned to (171.64.65.56).
    [18:13:03] + News From Folding@Home: Welcome to Folding@Home
    [18:13:03] Loaded queue successfully.
    [18:13:04] - Couldn't send HTTP request to server
    [18:13:04] (Got status 503)
    [18:13:04] + Could not connect to Work Server
    [18:13:04] - Attempt #5 to get work failed, and no other work to do.
    Waiting before retry.
    [18:14:27] + Attempting to get work packet
    [18:14:27] - Connecting to assignment server
    [18:14:27] - Successful: assigned to (171.64.65.56).
    [18:14:27] + News From Folding@Home: Welcome to Folding@Home
    [18:14:28] Loaded queue successfully.
    [18:14:28] - Couldn't send HTTP request to server
    [18:14:28] (Got status 503)
    [18:14:28] + Could not connect to Work Server
    [18:14:28] - Attempt #6 to get work failed, and no other work to do.
    Waiting before retry.
    [18:17:15] + Attempting to get work packet
    [18:17:15] - Connecting to assignment server
    [18:17:15] - Successful: assigned to (171.64.65.56).
    [18:17:15] + News From Folding@Home: Welcome to Folding@Home
    [18:17:16] Loaded queue successfully.
    [18:17:16] - Couldn't send HTTP request to server
    [18:17:16] (Got status 503)
    [18:17:16] + Could not connect to Work Server
    [18:17:16] - Attempt #7 to get work failed, and no other work to do.
    Waiting before retry.
    [18:22:47] + Attempting to get work packet
    [18:22:47] - Connecting to assignment server
    [18:22:48] - Successful: assigned to (171.64.65.56).
    [18:22:48] + News From Folding@Home: Welcome to Folding@Home
    [18:22:48] Loaded queue successfully.
    [18:22:48] - Couldn't send HTTP request to server
    [18:22:48] (Got status 503)
    [18:22:48] + Could not connect to Work Server
    [18:22:48] - Attempt #8 to get work failed, and no other work to do.
    Waiting before retry.
    [18:33:29] + Attempting to get work packet
    [18:33:29] - Connecting to assignment server
    [18:33:30] - Successful: assigned to (171.64.65.56).
    [18:33:30] + News From Folding@Home: Welcome to Folding@Home
    [18:33:30] Loaded queue successfully.
    [18:33:30] - Couldn't send HTTP request to server
    [18:33:30] (Got status 503)
    [18:33:30] + Could not connect to Work Server
    [18:33:30] - Attempt #9 to get work failed, and no other work to do.
    Waiting before retry.


    So it would appear that I cannot transmit results either. This is the same on all 4 of my VM machines.

    Any ideas as to what I can do?

    Am I alone in this?

    TIA,

    Keith.
     
  4. Christopher N. Lew

    Christopher N. Lew Folding in memory of my father

    Joined:
    23 Apr 2009
    Posts:
    1,358
    Likes Received:
    46
    OK, this bit is quite clear ...
    So if you look at Status/Queue Info you should see that slot 4 has not been returned, and that it has had at least one Upload Failure. So long as there is not a huge disaster with your machine, the client will attempt to send this at intervals.

    This bit ...
    etc., etc. ... shows that the client waits for longer and longer times between attempts. I think this is why stopping the client and starting again sometimes connects more successfully than just leaving it; the client simply tries more often.

    You aren't alone, the posts on the Stanford forums show that. Whatever the problem is, it won't get properly sorted until someone at the other end goes in to fix it. Given the time difference, that'll probably be sometime during our night.

    Stopping and restarting the client at your end is probably the only thing you can do, or turn off completely and try again tomorrow (at least you wouldn't be wasting electricity) :D
     
  5. Keith_Whi

    Keith_Whi What's a Dremel?

    Joined:
    6 May 2009
    Posts:
    228
    Likes Received:
    3
    Thanks Christopher.

    Keith.
     
  6. Christopher N. Lew

    Christopher N. Lew Folding in memory of my father

    Joined:
    23 Apr 2009
    Posts:
    1,358
    Likes Received:
    46
    Just realised you're using VMs. I assume the queue system is the same as with system tray clients, but have no idea how to look at the queue.
    Sorry
     

Share This Page