[FEA] Dynamic configuration of worker device memory limits
See original GitHub issueI’d like to be able to dynamically configure worker device memory limits. Given a running cluster, a user may have various sized workloads. Some workloads would benefit from device memory limits, while others would be harmed by this. Requiring an explicit cluster bringdown/bringup to change memory limits can sometimes pose a real challenge. Allowing a user to dynamically configure a device memory limit would let them optimally run each individual workload.
From discussions with @pentschev , this would likely require changing the LRU parameters and require that the cache be empty. Given that, I wonder if this would be possible in something like a call to Client.restart
, though of course not requiring that might be desirable, too.
Issue Analytics
- State:
- Created 3 years ago
- Comments:8 (8 by maintainers)
Top Results From Across the Web
How to solve Not enough memory error - Ansys Learning Forum
Ansys Employee. Try these: 1. Increase memory in Solver Process Settings. ... Insufficient memory - You may not have enough virtual memory ......
Read more >Efficient Static and Dynamic Memory Management ...
Efficient Static and Dynamic Memory Management. Techniques for Multi-GPU Systems. Max Grossman. Dept. of Computer Science - MS 132.
Read more >SAP HANA Dynamic Tiering: Administration Guide
The SAP HANA host and SAP HANA dynamic tiering worker communicate over a secure, encrypted internal connection.
Read more >Dynamic memory allocation - Sybase Infocenter
Dynamic memory allocation ... Adaptive Server allows you to allocate total physical memory dynamically. Many of the configuration parameters that consume memory ......
Read more >Memory optimized instances - Amazon Elastic Compute Cloud
Instance type Default vCPUs Memory (GiB)
hpc6id.32xlarge 64 1048
r4.large 2 15.25
r4.xlarge 4 30.5
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Thanks for confirming @beckernick . In the interest of keeping maintenance costs down to a minimum, I’ll go ahead and close the PR and this issue. If in the future this becomes important again, we can pickup from where we left.
This is no longer relevant for the original motivating use case. I suspect that @madsbk 's work in #353 and his ongoing work to understand the details of the underlying device memory while spilling may further reduce the need for this.
With that said, it might be a nice small addition in the future.