question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

long latency node join 5 nodes cluster failure

See original GitHub issue

I tried to join a node to my existing cluster from a different region which has 100ms latency. The API server of the new node keeps restarting.

But, if I join another empty one node cluster in the same place with the 5 nodes cluster. It could work.

It seems the API server waited too short time before the deadline is reached?

Feb 19 05:01:15 de1 microk8s.daemon-apiserver[13747]: W0219 05:01:15.220327   13747 authentication.go:519] AnonymousAuth is not allowed with the AlwaysAllow authorizer. Resetting AnonymousAuth to false. You should use a different authorizer
Feb 19 05:02:15 de1 microk8s.daemon-apiserver[13747]: Error: context deadline exceeded
Feb 19 05:02:15 de1 systemd[1]: snap.microk8s.daemon-apiserver.service: Main process exited, code=exited, status=1/FAILURE
Feb 19 05:02:15 de1 systemd[1]: snap.microk8s.daemon-apiserver.service: Failed with result 'exit-code'.
Feb 19 05:02:15 de1 systemd[1]: snap.microk8s.daemon-apiserver.service: Scheduled restart job, restart counter is at 10.

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:21

github_iconTop GitHub Comments

1reaction
balchuacommented, Mar 12, 2021

Aaa you’re right. It does a kubectl get no.

You can actually forcefully remove it from dqlite with this command.

snap/microk8s/current/bin/dqlite -s file:///var/snap/microk8s/current/var/kubernetes/backend/cluster.yaml -c /var/snap/microk8s/current/var/kubernetes/backend/cluster.crt -k /var/snap/microk8s/current/var/kubernetes/backend/cluster.key -f json k8s ".remove <node-ip-with-port-19001>"
0reactions
stale[bot]commented, Feb 11, 2022

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

Read more comments on GitHub >

github_iconTop Results From Across the Web

long latency node join 5 nodes cluster failure · Issue #2033
I tried to join a node to my existing cluster from a different region which has 100ms latency. The API server of the...
Read more >
Tuning Failover Cluster Network Thresholds
Delay – This defines the frequency at which cluster heartbeats are sent between nodes. The delay is the number of seconds before the...
Read more >
Cluster fault detection | Elasticsearch Guide [8.5]
If a node takes too long to process a cluster state update, it can be harmful to the cluster. The master will remove...
Read more >
Data-Driven Packet Loss Estimation for Node Healthy Sensing ...
If one detects a failure, it will mark this node in its own member state table and try to gossip it to the...
Read more >
Integrated Storage | Vault
A 3 node cluster with a large amount of data that's at a failure tolerance of 1. · Another 3 new nodes then...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found