Recommended approach to "resetting" the workers for each new parallel computation?

Question

Mitsu on 1 Jul 2022

1
Link

Direct link to this question

https://se.mathworks.com/matlabcentral/answers/1751640-recommended-approach-to-resetting-the-workers-for-each-new-parallel-computation

Answered: Raymond Norris on 1 Jul 2022

Every time I run a script, I want the parallel computation to be ran smoothly, regardless of any previous tasks the workers were working on the last time they were running or interrupted.

My ugly approach is this:

myCluster = parcluster('local');
delete(myCluster.Jobs); % I believe this "cleans" any unfinished/interrupted jobs and has the workers drop them
delete(gcp('nocreate')); % Delete the current parpool, in case it exists
myPool = parpool(myCluster,"AttachedFiles","example_library_file"); % Create a new(?) pool

I think this is not the proper way to do things.

It works fine for my purpose, although I do get the warning of "Cannot cancel or destroy a job that was not created by this Local cluster" upon the first time I run it for a new MATLAB session.

I don't think I fully understand how the pool and the cluster interact with each other. I am still a bit in the dark even after reading the documentation.

Could you correct my code so that it properly "clears" the current pool of workers and their jobs each time I run it?

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Raymond Norris on 1 Jul 2022

1
Link

Direct link to this answer

https://se.mathworks.com/matlabcentral/answers/1751640-recommended-approach-to-resetting-the-workers-for-each-new-parallel-computation#answer_997850

Open in MATLAB Online

Hi @Mitsu

You have the right idea. A couple of quick comments

If you have a parallel pool running, then deleting the jobs will also delete the pool (a pool is a type of job). Therefore

delete(myCluster.Jobs)

is all you need.

The "Cannot cancel or destroy a job that was not created by this Local cluster" is a known issue that I believe was resolved in R2022a.

The "cluster" (myCluster in your case) is a MATLAB object that contains properties of the cluster (e.g., number of workers, name of host, location of MATLAB, etc.) and methods that can operate on it (e.g, job launchers). There are two job launchers: parpool and batch. By and large, when using the local profile, you'll only call parpool (but you could in theory run batch jobs as well).

Therefore, there are two reasons you'd want to call parcluster.

To override the default cluster

If you simply call

parpool

MATLAB will startup a pool of workers with a default pool size using your default profile. If you call

parpool(8)

MATLAB will startup a pool of eight workers using your default profile (if using the local profile, setting the size is the most common reason to call parpool explicitly). If you call

parpool(myCluster,10)

MATLAB will startup a pool of 10 workers using the myCluster profile. As listed in your example, there's more arguments you can pass to parpool as well.

[Optional] To set properties the job submitter will pickup to tweak your job submission.

Depending on the scheduler (and in particular 3rd party schedulers like Slurm and PBS), where MATLAB will submit your job, you can interject scheduler flags (e.g., walltime, queue name, etc.). Therefore, you might run something like

myCluster = parcluster('slurm');
myCluster.AdditionalSubmitArgs = '-p short';  % Doing this off the top of my head, might be a bit off
pool = myCluster(8);

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Recommended approach to "resetting" the workers for each new parallel computation?

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Recommended approach to "resetting" the workers for each new parallel computation?

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments