May you add IDA-VLM? #186

jiyt17 · 2024-10-17T02:57:48Z

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model: we are the first work to propose visual instruction tuning with ID reference

xjtupanda · 2024-10-29T14:22:19Z

Thanks for sharing! We've incorporated the work into our repo.
Please also consider citing our works:

@article{fu2023mme,
  title={MME: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models},
  author={Fu, Chaoyou and Chen, Peixian and Shen, Yunhang and Qin, Yulei and Zhang, Mengdan and Lin, Xu and Yang, Jinrui and Zheng, Xiawu and Li, Ke and Sun, Xing and others},
  journal={arXiv preprint arXiv:2306.13394},
  year={2023}
}

@article{fu2024vita,
  title={VITA: Towards Open-Source Interactive Omni Multimodal LLM},
  author={Fu, Chaoyou and Lin, Haojia and Long, Zuwei and Shen, Yunhang and Zhao, Meng and Zhang, Yifan and Wang, Xiong and Yin, Di and Ma, Long and Zheng, Xiawu and He, Ran and Ji, Rongrong and Wu, Yunsheng and Shan, Caifeng and Sun, Xing},
  journal={arXiv preprint arXiv:2408.05211},
  year={2024}
}

@article{fu2024video,
  title={Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis},
  author={Fu, Chaoyou and Dai, Yuhan and Luo, Yondong and Li, Lei and Ren, Shuhuai and Zhang, Renrui and Wang, Zihan and Zhou, Chenyu and Shen, Yunhang and Zhang, Mengdan and others},
  journal={arXiv preprint arXiv:2405.21075},
  year={2024}
}

@article{yin2023survey,
  title={A survey on multimodal large language models},
  author={Yin, Shukang and Fu, Chaoyou and Zhao, Sirui and Li, Ke and Sun, Xing and Xu, Tong and Chen, Enhong},
  journal={arXiv preprint arXiv:2306.13549},
  year={2023}
}

@article{yin2023woodpecker,
  title={Woodpecker: Hallucination correction for multimodal large language models},
  author={Yin, Shukang and Fu, Chaoyou and Zhao, Sirui and Xu, Tong and Wang, Hao and Sui, Dianbo and Shen, Yunhang and Li, Ke and Sun, Xing and Chen, Enhong},
  journal={arXiv preprint arXiv:2310.16045},
  year={2023}
}

jiyt17 · 2024-10-29T14:42:21Z

OK, nice works! I used MME for test in the new version of IDA-VLM.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

May you add IDA-VLM? #186

May you add IDA-VLM? #186

jiyt17 commented Oct 17, 2024

xjtupanda commented Oct 29, 2024 •

edited

Loading

jiyt17 commented Oct 29, 2024

May you add IDA-VLM? #186

May you add IDA-VLM? #186

Comments

jiyt17 commented Oct 17, 2024

xjtupanda commented Oct 29, 2024 • edited Loading

jiyt17 commented Oct 29, 2024

xjtupanda commented Oct 29, 2024 •

edited

Loading