Skip to content

Commit 93bb9a8

Browse files
authored
fix when the job fails and doesn't kick the node (Azure#3186)
* fix when the job fails and doesn't kick the node * reformat
1 parent ec99008 commit 93bb9a8

File tree

1 file changed

+1
-0
lines changed
  • best-practices/largescale-deep-learning/Debugging/Compute

1 file changed

+1
-0
lines changed

best-practices/largescale-deep-learning/Debugging/Compute/run_nhc.py

+1
Original file line numberDiff line numberDiff line change
@@ -100,6 +100,7 @@ def parse_output(output_file):
100100
output, error = process.communicate()
101101
print(output)
102102
print(error)
103+
if full_errors:
103104
raise Exception(
104105
"Failures were found while running the node health checks. Please see the std_log_process.txt files under the 'outputs and logs' tab of the job for more information."
105106
+ full_errors

0 commit comments

Comments
 (0)