Skip to content

Commit e1c95f7

Browse files
committed
Setting logging.raiseExceptions=True so that DATA_LOSS errors that occur in debug/info/other logs raise exceptions immediately.
1 parent b1e3c0a commit e1c95f7

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

axlearn/common/launch_trainer.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@
44

55
import contextlib
66
import json
7+
import logging as py_logging
78
import os
89
from typing import Any, Optional
910

@@ -16,6 +17,8 @@
1617
from axlearn.common.utils import MeshShape, get_data_dir, infer_mesh_shape
1718
from axlearn.experiments import TrainerConfigFn, get_named_trainer_config
1819

20+
logging.raiseException = True
21+
1922
# Trainer-specific flags.
2023
flags.DEFINE_string(
2124
"module",

0 commit comments

Comments
 (0)