-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ArrowInvalid: Column 4 named images expected length 360 but got length 352 #5
Comments
Ran the same config through without the I am hoping to train on multimodal datasets with sequence parallelism - would love advice on how we could enable training on image datasets too. |
Thanks for your interest! |
Thanks for the update! What timelines are you tracking internally for releasing multimodal SP? And any way I could help with adding support for multimodal SP? This is relatively high on my priority list at the moment! |
@HaoshengZou following up here - curious to hear how your team is thinking about integrating this! |
@DhruvaBansal00 Hi! Thanks for your interest! We are now at Chinese New Year and expect to get on this in two weeks. |
@DhruvaBansal00 Sorry we haven't had man power recently as focus is shifted to R1 models for now. We'll update later when we have man power. |
Reminder
System Info
llamafactory
version: 0.9.1Reproduction
^config file for training
Stack trace:
Expected behavior
Training should proceed for Qwen 2.5 VL 72b normally
Others
mllm_demo is a dataset with images. Has this repo been tested with multimodal datasets yet?
The text was updated successfully, but these errors were encountered: