question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

requests.exceptions.ConnectionError: HTTPConnectionPool(host='localhost', port=8140): Max retries exceeded with url: /api/v1/nni/check-status (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7ffa090c0d00>: Failed to establish a new connection: [Errno 111] Connection refused'))

See original GitHub issue

when i run ''nnictl create --config config.yml -p 8140", i get the error:

requests.exceptions.ConnectionError: HTTPConnectionPool(host='localhost', port=8140): Max retries exceeded with url: /api/v1/nni/check-status (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7ffa090c0d00>: Failed to establish a new connection: [Errno 111] Connection refused'))

Environment:

  • NNI version:
  • v2.9
  • Training service (local|remote|pai|aml|etc):
  • remote
  • Client OS:
  • Server OS (for remote mode only):
  • Python version:
  • 3.8
  • PyTorch/TensorFlow version:
  • PyTorch 1.7.0
  • Is conda/virtualenv/venv used?:
  • conda
  • Is running in Docker?: no

Configuration:

  • Experiment config (remember to remove secrets!):
trialConcurrency: 2 #trail的并发数,根据GPU数量设置,此值为几就有几个train在同时跑
trainingService:
  platform: local
  gpuIndices: [6,7] # 使用哪几个GPU
  # gpuIndices: [0] # 使用哪几个GPU
  useActiveGpu: True # 默认值false。是否使用已经被其他进程使用的gpu,包括graphical desktop占用的。
  maxTrialNumberPerGpu: 1 #指定1个GPU上最大并发trail的数量,在确保显存达到足以容下任何两个trail时,再设置为2。
trialGpuNumber: 1 # 每个trail所需要的gpu
  • Search space:
{
    "epochs":{"_type":"choice","_value":[400,500]},
    "lr":{"_type":"quniform","_value":[0.0001,0.0025,0.0005]},
}

Log message:

  • nnimanager.log:
[2022-09-13 20:23:23] INFO (main) Start NNI manager
  • dispatcher.log: none
  • nnictl stdout and stderr: none

How to reproduce it?:

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:15 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
szhang963commented, Sep 18, 2022

This error has happened due to a change in the item of experimentWorkingDirectory in the config.yml. One can cancel the change and maintain the default. However, I can not check the real cause, but I never find the error in version of 2.0.

0reactions
xiangtaowongcommented, Sep 15, 2022

@QuanluZhang Would you please have a look about the new issue when u can? #5128 It’s a iuuse about the visualization website, thank u!

Read more comments on GitHub >

github_iconTop Results From Across the Web

Max retries exceeded with URL in requests - Stack Overflow
OP's error doesn't say "Connection refused", it says "Name or service not known". This answer seems to assume that all ConnectionError are due ......
Read more >
Max retries exceeded and connection refused errors ... - GitHub
The part of the error mentioning the port seems to depend on the port that was in my post message python script. The...
Read more >
requests.exceptions.ConnectionError: HTTPSConnectionPool ...
requests.exceptions.ConnectionError: HTTPSConnectionPool(host='acme-v02.api.letsencrypt.org', port=443): Max retries exceeded with url: ...
Read more >
connection refused: 8140 - Google Groups
When I run "puppet agent --test" i am getting the connection refused error at port 8140. When I am installing puppet master I...
Read more >
Port 8140 (tcp/udp) - SpeedGuide
TCP enables two hosts to establish a connection and exchange streams of data. TCP guarantees delivery of data and that packets will be...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found