failure to autoscale unless workers are already present
See original GitHub issueI am testing the PBSCluster along with autoscaling. It seems that I am unable to get the cluster to launch any workers without explicitly starting at least one worker. I would expect that this configuration would scale from 0 to 10 (180 processes) without further interaction/configuration.
cluster = PBSCluster(queue='default',
walltime='01:00:00',
project='MyAccount',
resource_spec='1:ncpus=36:mpiprocs=36:mem=109GB',
interface='ib0',
threads=4,
processes=18)
client = Client(cluster)
cluster.adapt(minimum=0, maximum=10)
@mrocklin - this may actually be a problem with the dask adaptive cluster but I wanted to discuss here to see if I am missing something obvious specific to PBS.
Issue Analytics
- State:
- Created 5 years ago
- Comments:9 (9 by maintainers)
Top Results From Across the Web
Troubleshoot Amazon EC2 Auto Scaling issues
An EC2 instance in an Amazon EC2 Auto Scaling group reboots during a deployment. Your deployment can fail if an EC2 instance is...
Read more >Autoscaling in Kubernetes: Why doesn't the Horizontal Pod ...
I'm sure all of this seems simple enough that you're now wondering what could possibly go wrong for the HPA to not work....
Read more >Troubleshooting Azure Monitor autoscale - Microsoft Learn
Review the autoscale metrics if you are using a metric-based scale rule. It's possible that the Observed metric value or Observed Capacity are ......
Read more >Autoscaling clusters | Dataproc Documentation - Google Cloud
If there are pending containers, autoscaling may add workers to the cluster. You can view these metrics in Cloud Monitoring. As a default,...
Read more >Azure autoscaling is not working - Stack Overflow
If there was a failure auto-scaling your service, you would be able to get ... of 1% you suddenly have 99% for 15...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Can you report the contents of cluster._adaptive.log ?
On Thu, Mar 29, 2018 at 3:35 PM, Joe Hamman notifications@github.com wrote:
closed via #63