question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

"Unhealthy" state can't be recovered remotely in a default setup

See original GitHub issue

Describe the issue you are experiencing

I also filed an issue on core repository about this (https://github.com/home-assistant/core/issues/78973). When a power cycle happens at home, Internet usually comes back up later than RPi4+HassOS. That apparently forces a “Unhealthy” state as I understand (because the documentation is very limited and mostly specific to certain components).

When RPi4+HassOS becomes Unhealthy, it’s impossible to fix that state remotely using the UI. Only fix option is to use “Terminal & SSH” add-on, which isn’t available in a default setup. You can’t even install that add-on after the fact because add-on installs are blocked by Unhealthy state.

Restarting HA Core doesn’t fix this issue. Restarting HA Supervisor isn’t available in the UI.

This is such an unacceptable problem in the operating flow of Home Assistant. I say, HA Supervisor should be able to automatically recover from problems created by missing Internet connection during startup. It should never require manual intervention. Unhealthy should explicitly be reserved for truly unrecoverable phases.

What is the used version of the Supervisor?

supervisor-2022.09.1

What type of installation are you running?

Home Assistant OS

Which operating system are you running on?

Home Assistant Operating System

What is the version of your installed operating system?

There’s no supervisor panel, and System tab doesn’t show this information.

What version of Home Assistant Core is installed?

core-2022.9.5

Steps to reproduce the issue

  1. Experience power loss at home.
  2. Supervisor becomes Unhealthy.

Anything in the Supervisor logs that might be useful for us?

Failed to to call /addons/core_ssh/install - 'AddonManager.install' blocked from execution, system is not healthy


### Additional information

_No response_

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:8 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
ssgcommented, Sep 29, 2022

Thanks @mdegat01, yes, I wasn’t able to find this option. Updating the docs would be great. I appreciate your help.

1reaction
mdegat01commented, Sep 29, 2022

or add a “reboot Host OS” option in the menu.

This already exists. Go to Settings -> System -> Hardware, click the 3 dots in the top right and do “Reboot Host”: Screen Shot 2022-09-29 at 2 47 46 PM

Also just an FYI this is in the link. Immediately after it suggests running ha supervisor restart it says this:

If this does not help, you can try to reboot the host. If you are running Home Assistant Operating System, this can be done from the “System” tab in the Supervisor panel. On the card for “Host System”, there is a button to reboot the host.

If you find the wording confusing you’re welcome to propose an update, there’s an edit button right at the bottom of that page.

EDIT: Ok actually that does need an update. It still mentions the host system card which is the old UI. I’ll update it.

EDIT 2: https://github.com/home-assistant/home-assistant.io/pull/24283

Read more comments on GitHub >

github_iconTop Results From Across the Web

Unhealthy state of Exchange servers when monitoring ...
Get-ServerHealth shows health sets in an Unhealthy state because the monitoring mailboxes are deleted. Provides a resolution.
Read more >
Office Web Apps Server 2013 - machines are always reported ...
But all the machines are reporting that they are Unhealthy. ... with an exception: The remote server returned an error: (404) Not Found....
Read more >
Recovering Terraform State - Medium
Here's the scenario, you've documented the steps for creating new infrastructure using Terraform including ensuring that state files are dealt ...
Read more >
Health checks for Auto Scaling instances - AWS Documentation
The health status of an Auto Scaling instance indicates whether it is healthy or unhealthy. All instances in your Auto Scaling group start...
Read more >
Troubleshooting Deployed NSS Servers - Zscaler Help
You can use the following commands within the virtual machine (VM) console for your platform in order to configure and troubleshoot the NSS....
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found