-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error executing job with overrides: ['exp=base_test'] #11
Comments
Yeah, I'm having the same issue. I can see that |
Thanks, so you suppose that I need to write a new training script with datamodule? |
I am facing this issue right now too! Can you please provide some pointers on what needs to be changed? The config yaml file or train.py? |
I updated the |
Okay, running now. Thanks! I'm curious; since I'm not familiar with Hydra or OmegaConf (yet) I'm not really clear what I'm training on when running Any clarification appreciated. |
It downloads a single song from YouTube as a test, you can see the url in the |
Okay, I assumed it must download something, but does it only use that song during this test training? Mostly I'm confused about the step of adding my data path to What would I change to use my own data (i.e., the path I specified in UPDATE: I've been looking at |
Thank you for the attempted fix! Unfortunately the current commit (d5f6870) still experiences this error. I've put the full error below in text form so that it can be more easily searched by other people with the same issue: Expand for full error(venv7) quoththeraver@ip-26-0-139-220:~/workspace/audio-diffusion-pytorch-trainer$ python train.py exp=base_test [2023-01-12 20:35:59,628][main.utils][INFO] - Disabling python warnings! Global seed set to 12345 [2023-01-12 20:35:59,637][__main__][INFO] - Instantiating datamodule . Error executing job with overrides: ['exp=base_test'] Traceback (most recent call last): File "/admin/home-quoththeraver/workspace/audio-diffusion-pytorch-trainer/venv7/lib/python3.8/site-packages/hydra/_internal/utils.py", line 644, in _locate obj = getattr(obj, part) AttributeError: module 'main' has no attribute 'module_base'During handling of the above exception, another exception occurred: Traceback (most recent call last): The above exception was the direct cause of the following exception: Traceback (most recent call last): The above exception was the direct cause of the following exception: Traceback (most recent call last): Specifically, this happens when following the install and run instructions in the readme (with a fresh virtual environment). Interestingly, I was able to successfully run the experiment on my local machine which already had plenty of packages installed, so there is clearly something wrong with the requirements. When I install the packages below (from my local machine's full pip freeze) I am able to run the experiment on the remote machine. I'd submit a PR with this updated requirements.txt, but obviously that would be pretty messy since not all of these packages are required. In case the issue was simply versioning, I did make a version of the repo's requirments.txt with explicit versions based on what I had in my local environment (see below) but unfortunately that also failed which means that there are some missing packages. requirements with versions (fails).txt Because I am now currently unblocked I think I'll end my debugging here, but I wanted to share this info with you guys so that future people are unblocked and maybe someone will go through each package to see where the problem is :P |
When this error manifested to me, it was related to the fact that |
Hello!
I tried to use " python train.py exp=base_test " in Eshell, but some errors were found as shown follows:
The text was updated successfully, but these errors were encountered: