question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

wandb: Network error (TransientException), entering retry loop.

See original GitHub issue

wandb, version 0.8.22 Python 3.6.8 Ubuntu 18

Description

W&B was working fine until one day I started getting these error messages. I’m using W&B normally as far as I can tell. I’m not using sweeps or anything fancy. I “init” and then “log”:

wandb: Tracking run with wandb version 0.8.22
wandb: Run data is saved locally in XXX/wandb/run-20200128_181440-yz2o7uiw
wandb: Syncing run A002
wandb: ⭐ View project at https://app.wandb.ai/XXX
wandb: 🚀 View run at https://app.wandb.ai/XXX
wandb: Run `wandb off` to turn off syncing.

wandb: Network error (TransientException), entering retry loop. See /home/XXX/wandb/debug.log for full traceback.
wandb: ERROR Error uploading "___batch_archive_1.tgz": CommError, None
[ batch loss: 0.000208 | batch RMSE: 3.7270] :  27%|███████████▋                               | 36/132 [00:28<01:17,  1.24it/sBus error (core dumped) 2.6732 | val-loss: 0.000312| val_rmse: 4.0230:  74%|█████████████▎    | 37/50 [1:07:44<23:51, 110.11s/it]
(sia-env) XXX:~/projects/orofacial$ wandb: Program ended successfully.        | 14/132 [00:10<01:34,  1.25it/s]
wandb: Run summary:
wandb:                       _step 73
wandb:                  _timestamp 1580239346.2581983
wandb:                    _runtime 4074.0023016929626
wandb:                        Loss 0.00013416090676909802
wandb:               learning rate 1.25e-06
wandb: Syncing 8 W&B file(s) and 0 media file(s)
 (%(failed_batches)d failed uploads)wandb: Network error (TransientException), entering retry loop. See /home/siarez/projects/orofacial/artifacts/train.py/2020-01-28-13-14-32_0/wandb/debug.log for full traceback.
 (%(failed_batches)d failed uploads)wandb: ERROR Error uploading "config.yaml": CommError, None
 (%(failed_batches)d failed uploads)wandb: ERROR Error uploading "wandb-summary.json": CommError, None
wandb: ERROR Error uploading "wandb-metadata.json": CommError, None
 (%(failed_batches)d failed uploads)wandb: ERROR Error uploading "output.log": CommError, None
 (%(failed_batches)d failed uploads)wandb:                                                                                
wandb: Synced A002: https://app.wandb.ai/siarez/orofacial/runs/yz2o7uiw

What I Did

Nothing

The debug.log was too large to fit in “pastebin”. But here are the first 140 lines: https://hastebin.com/ixonizoyal.sql

Here are the last 170 lines of debug.log: https://hastebin.com/alevotuduv.sql

Issue Analytics

  • State:closed
  • Created 4 years ago
  • Reactions:1
  • Comments:16 (6 by maintainers)

github_iconTop GitHub Comments

1reaction
syoramicommented, Jul 5, 2020

I’m using wandb localwith python=3.6.8 wandb=0.9.2. The log file is large and here is part of it:

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home/user/anaconda3/lib/python3.6/site-packages/requests/adapters.py", line 449, in send
    timeout=timeout
  File "/home/user/anaconda3/lib/python3.6/site-packages/urllib3/connectionpool.py", line 725, in urlopen
    method, url, error=e, _pool=self, _stacktrace=sys.exc_info()[2]
  File "/home/user/anaconda3/lib/python3.6/site-packages/urllib3/util/retry.py", line 403, in increment
    raise six.reraise(type(error), error, _stacktrace)
  File "/home/user/anaconda3/lib/python3.6/site-packages/urllib3/packages/six.py", line 734, in reraise
    raise value.with_traceback(tb)
  File "/home/user/anaconda3/lib/python3.6/site-packages/urllib3/connectionpool.py", line 677, in urlopen
    chunked=chunked,
  File "/home/user/anaconda3/lib/python3.6/site-packages/urllib3/connectionpool.py", line 426, in _make_request
    six.raise_from(e, None)
  File "<string>", line 3, in raise_from
  File "/home/user/anaconda3/lib/python3.6/site-packages/urllib3/connectionpool.py", line 421, in _make_request
    httplib_response = conn.getresponse()
  File "/home/user/anaconda3/lib/python3.6/http/client.py", line 1331, in getresponse
    response.begin()
  File "/home/user/anaconda3/lib/python3.6/http/client.py", line 297, in begin
    version, status, reason = self._read_status()
  File "/home/user/anaconda3/lib/python3.6/http/client.py", line 266, in _read_status
    raise RemoteDisconnected("Remote end closed connection without"
urllib3.exceptions.ProtocolError: ('Connection aborted.', RemoteDisconnected('Remote end closed connection without response',))

wandb sync gives this error:

wandb: Network error (ConnectionError), entering retry loop. See /home/user/Documents/syoya/Competes/Alaska2/wandb/debug.log for full traceback.
wandb: Network error (ConnectionError), entering retry loop. See /home/user/Documents/syoya/Competes/Alaska2/wandb/debug.log for full traceback.

I’ve also tested on my local Mac and same problem occurs.

0reactions
jw3126commented, May 7, 2021

I will try to produce a mwe.

Read more comments on GitHub >

github_iconTop Results From Across the Web

wandb: Network error (TransientException), entering retry loop.
wandb, version 0.8.22 Python 3.6.8 Ubuntu 18 Description W&B was working fine until one day I started getting these error messages.
Read more >
Troubleshooting - Documentation - Weights & Biases - Wandb
If you're seeing SSL or network errors: wandb: Network error (ConnectionError), entering retry loop . You can try a couple of different approaches...
Read more >
Weights and Biases: Login and network errors - Stack Overflow
This error happens when I use the command: wandb login ... refused')) wandb: Network error (ConnectionError), entering retry loop.
Read more >
wandb: Network error (ConnectionError), entering retry loop.
How do I deal with network issues? 1. SSL certificate. 解决SSL certificate,我没有管理员权限,无法更改,而且更改完之后可能依然会报错?
Read more >
Weights and Biases blocked on the ETH Proxy Server
Weights and biases (wandb) is a web service used by many Euler user to have ... "wandb: Network error (TransientError), entering retry loop....
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found