MDCE Causing Blue Screen of Death (Clock Watchdog Timeout)
9 views (last 30 days)
Show older comments
We have a 24 core computer that has a MDCE license on it. Currently I have 23 workers set for the machine. I am able to successfuly submit complex simulink runs to the machine and retrieve the data when they finish. However, once I start submitting enough jobs where some are in the queue the machine crashes given a BSoD error of "Clock Watchdog Timeout". I am not entirely sure how many jobs causes the error. Once I had 24 jobs going and they all finished without a problem.
2 Comments
Ming Yue
on 7 Dec 2018
It would be good to know whether the same crash happens without Simulink. To diagnose, you can submit some non-Simulink jobs (enough to have some jobs in the queue). The jobs could just be some long MATLAB function calls.
Also you could investigate the MDCE log file to look for signs out-of-memory errors. If you are using Windows, the log files are located here:
<TEMP>\MDCE\Log where <TEMP> refers to the system TEMP variable. By default, it is in the directory: C:\TEMP\MDCE\Log
Since you are using one computer with 24 cores, parallel computing could be done with PCT as well. Would these jobs crash if you use PCT instead? For PCT, you need to select "local" profile when you create the cluster:https://www.mathworks.com/help/distcomp/submit.html. Then the steps to submit job would be the same as submitting to MDCS.
Accepted Answer
More Answers (0)
See Also
Categories
Find more on Startup and Shutdown in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!