-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[feature] add the ability to recover shuffling datasets in checkpoint. #247
Conversation
[fix] bug fixed, epoch for load checkpoint cannot exceed the config one.
check failed. |
89d498a
to
aa99a07
Compare
[feature] add cache deletion function. [fix] Fixed bug and improved feature.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. A couple of minor comments.
mlora/executor/task/task.py
Outdated
data = preprocess_func[preprocess_type](data) | ||
# If data preprocess_type is shuffle, create a cache folder, | ||
# to store shuffled data and use it for saving checkpoints. | ||
if preprocess_type == "shuffle": |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can we put the function into line 97 or create a new function?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe it's better to avoid using the "if" branch here because different process_type have different process functions defined in line 94.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
OK, let me have a try.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good idea, this is indeed better. I have already modified the code.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
[feature] add the ability to recover shuffling datasets in checkpoint.
[fix] bug fixed, epoch for load checkpoint cannot exceed the config one.