-
Notifications
You must be signed in to change notification settings - Fork 219
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
(Training qwen2.5-VL-7B-Instruct) AssertionError: Input and cos/sin must have the same dtype, got torch.float16 and torch.bfloat16 #105
Comments
Hello, when I switched the model from Qwen2.5-VL-3B-Instruct to Qwen2-VL-2B-Instruct, the error was resolved. I suspect it might be due to differences in model precision? |
This issue appears to be due to changes in the transformers library version. A similar issue (huggingface/transformers#36188) references the transformers version (f7a3c62), but after installing that specific version, I encountered a new error: |
+1 Same issue when using this script:
Lib versions:
@TobiasLee Any pointers on this issue? 👀 |
may be a "deepspeed" error. I run this command without "--deepspeed local_scripts/zero3.json", it can work |
I test your method: delete "--deepspeed local_scripts/zero3.json". I only have 4 A100 GPUs, but when I run the code with export CUDA_VISIBLE_DEVICES="0,1,6,7", it outputs the error: ** CUDA out of memory. Tried to allocate 30.00 MiB. GPU 3 has a total capacity of 79.15 GiB**. What should I do? |
decrease "max_prompt_length" \ "num_generations" \ "max_completion_length" \ "max_prompt_length" |
Temporary fix: |
It works! |
Bash file:
data:image/s3,"s3://crabby-images/4dad7/4dad77ff771c0c9a3f9ef354b3e5186d5b13a390" alt="Image"
data:image/s3,"s3://crabby-images/a3dbb/a3dbbb3783baab2c766a1abb18c462c24c983ede" alt="Image"
Log:
data:image/s3,"s3://crabby-images/8bf84/8bf84232b7f4cd886e7aec027ba92bd78db4fa47" alt="Image"
data:image/s3,"s3://crabby-images/b85a4/b85a4a1dd38ae66fcf25b1096d4588c35929af44" alt="Image"
The text was updated successfully, but these errors were encountered: