question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Web Interface "Stop run" does not work

See original GitHub issue

wandb --version && python --version && uname

  • Weights and Biases version: 0.8.13
  • Python version: 3.7.4
  • Operating System: Linux

Description

Stop an in-progress run

What I Did

I ran the following demo script:

import wandb
from random import random
from time import sleep

wandb.init(project="testing-abort-job")

wandb.config.dropout = 0.2
n = 120
for i in range(1, n):
    sleep(random())
    wandb.log({"epoch": i / 10, "loss": 1/i**0.5})

I saw the project appear in the wandb web app, but when I selected “Stop run” under the context menu (using the three dots to the left of the run name) nothing happened. I did this when it was reporting step 50, and then again around step 80. It continued to run until it completed all 120 steps.

Here are the packages installed in this conda environment:

# Name                    Version                   Build  Channel
_libgcc_mutex             0.1                        main  
argh                      0.26.2                   pypi_0    pypi
ca-certificates           2019.10.16                    0  
certifi                   2019.9.11                py37_0  
chardet                   3.0.4                    pypi_0    pypi
click                     7.0                      pypi_0    pypi
configparser              4.0.2                    pypi_0    pypi
docker-pycreds            0.4.0                    pypi_0    pypi
gitdb2                    2.0.6                    pypi_0    pypi
gitpython                 3.0.4                    pypi_0    pypi
gql                       0.1.0                    pypi_0    pypi
graphql-core              2.2.1                    pypi_0    pypi
idna                      2.8                      pypi_0    pypi
libedit                   3.1.20181209         hc058e9b_0  
libffi                    3.2.1                hd88cf55_4  
libgcc-ng                 9.1.0                hdf63c60_0  
libstdcxx-ng              9.1.0                hdf63c60_0  
ncurses                   6.1                  he6710b0_1  
nvidia-ml-py3             7.352.0                  pypi_0    pypi
openssl                   1.1.1d               h7b6447c_3  
pathtools                 0.1.2                    pypi_0    pypi
pip                       19.3.1                   py37_0  
promise                   2.2.1                    pypi_0    pypi
psutil                    5.6.3            py37h7b6447c_0  
python                    3.7.4                h265db76_1  
python-dateutil           2.8.0                    pypi_0    pypi
pyyaml                    5.1.2                    pypi_0    pypi
readline                  7.0                  h7b6447c_5  
requests                  2.22.0                   pypi_0    pypi
rx                        1.6.1                    pypi_0    pypi
sentry-sdk                0.13.1                   pypi_0    pypi
setuptools                41.4.0                   py37_0  
shortuuid                 0.5.0                    pypi_0    pypi
six                       1.12.0                   pypi_0    pypi
smmap2                    2.0.5                    pypi_0    pypi
sqlite                    3.30.1               h7b6447c_0  
subprocess32              3.5.4                    pypi_0    pypi
tk                        8.6.8                hbc83047_0  
urllib3                   1.25.6                   pypi_0    pypi
wandb                     0.8.13                   pypi_0    pypi
watchdog                  0.9.0                    pypi_0    pypi
wheel                     0.33.6                   py37_0  
xz                        5.2.4                h14c3975_4  
zlib                      1.2.11               h7b6447c_3  

Issue Analytics

  • State:closed
  • Created 4 years ago
  • Comments:10 (3 by maintainers)

github_iconTop GitHub Comments

21reactions
jlehrer1commented, Apr 21, 2022

This is still a problem. When training a model tracked with Wandb on a cluster, the active state persists after the job is shut down and the web button for stop run doesn’t work.

2reactions
DanyAyalacommented, Oct 3, 2022

Giving this a bump as I’ve encountered the exact same issue when attempting to stop a hung run (> 24 hrs) using the “Stop run” button on the web interface. I don’t have physical access to the machine the experiment is running on, but I’ll speak to our internal contact about it.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Web Interface "Stop run" does not work · Issue #648 · wandb ...
I saw the project appear in the wandb web app, but when I selected "Stop run" under the context menu (using the three...
Read more >
[Solved] Web interface stops working - OPNsense Forum
Hi, after reboot, the web interface stops working for me and instead I instead get a 503. This seems to be happening only...
Read more >
ESXi 7 running but web interface not loading
The system is up and running for a few months without issues. All hosts are working as expected but the web interface stopped...
Read more >
How to fix QNAP NAS web GUI interface timing out or never ...
The Problem You're trying to connect to your NAS's web interface but it's just spinning forever and not actually loading.
Read more >
Web Interface to the controller stopped but can get to it via SSH
I am able to ping it and SSH to it. We are running 6.3.1.2 AOS. Is there a service(s) to restart without restarting...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found