question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

"Baseline" workers combination with submitted worker jobs

See original GitHub issue

I am working on the Princeton Cluster tigercpu, which presents some challenges to an efficient workflow with dask-jobqueue: Small fast starting jobs are somewhat discouraged, e.g. most of the time the fastest starting jobs have 2 nodes (each with 40 cores) if submitted as batch script. Interactive jobs start faster most of the time.

My current workflow looks something like this:

  1. Request a single node as interactive job (fast), start a Jupyter notebook, start a SLURMcluster, and ssh to this node.
  2. Request another interactive job (more resources, usually fastish when I request an interactive session) where I basically manually execute the jobscript created by the SLURMcluster, connecting to the notebook
  3. Dask away.

This is quite cumbersome. If I substitute 2) by using the actual dask-jobqueue functionality this would be much cleaner, but wait times can be longer. Is there a way to start a few workers on the same node as the scheduler (created in 1)) directly from the SLURMcluster? I guess this is somewhat a combination of LocalCluster and SLURMcluster?

That would be ideal, since it would immediately provide a few baseline workers to explore data and make some preliminary analysis, before the real compute power comes on.

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:7 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
guillaumeebcommented, Aug 30, 2022

Closing this issue as stale, and should be fixed once we implement #419.

Another solution would be to manually start a worker from inside the Notebook where SLURMCluster has been used.

0reactions
jbuseckecommented, Aug 30, 2022

Awesome. Thanks. And sorry for the radio silence. I am actually not at Princeton anymore, but will keep these tips in mind the next time I work on an HPC!

Read more comments on GitHub >

github_iconTop Results From Across the Web

Baseline jobs Definition | Law Insider
Baseline jobs means Three Hundred and Ten (310) jobs. Grantee shall submit to MTC, within thirty (30) days of each annual anniversary of...
Read more >
The idea of a “baseline” level of staffing is “minimum staff ...
The "minimum staff complement”—the number of employees and their workload. To have enough staff, all you need is the correct number of the ......
Read more >
Glossary : U.S. Bureau of Labor Statistics
Formal relationship between a worker and a sponsor that consists of a combination of on-the-job training and related occupation-specific instruction in which ...
Read more >
REPORT ON BASELINE EMPLOYER SURVEY AND ...
The worker study is designed to follow workers in low-paying jobs for three years during the implementation of the Seattle minimum wage ordinance....
Read more >
fws4749.pdf - OPM
Federal Wage System Job Grading Standard for ... Work in this occupation varies in the combination of trade practices that are performed in ......
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found