为什么多模态关于规范格式的prompt不写在system中 #115

munian08 · 2025-02-18T10:20:34Z

def make_conversation(example):
        return {
            "prompt": [
                {"role": "system", "content": SYSTEM_PROMPT},
                {"role": "user", "content": example["problem"]},
            ],
        }

    QUESTION_TEMPLATE = "{Question}  Output the thinking process in <think> </think> and final answer (number) in <answer> </answer> tags."

    def make_conversation_image(example):
        return {
            "prompt": [
                {
                    "role": "user",
                    "content": [
                        {"type": "image"},
                        {"type": "text", "text": QUESTION_TEMPLATE.format(Question=example["problem"])},
                    ],
                },
            ],
        }

The text was updated successfully, but these errors were encountered:

Yukino256 · 2025-02-19T02:35:34Z

个人猜测，Qwen2vl的system prompt应该是You are Qwen XXXXXXXX XXXX 这个。如果加入system prompt也许对instruct的模型的训练造成不良影响，因为前面那一段的system prompt被替换了？或许base模型可以替换system prompt。

以上仅供猜测🥲🥲

xingenju · 2025-02-19T02:46:44Z

可以实验验证一下, 加入和不加入的效果对比

Jia-py · 2025-02-19T09:07:05Z

这好像是因为qwen在huggingface上给的例子就没有放在system prompt中。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

为什么多模态关于规范格式的prompt不写在system中 #115

为什么多模态关于规范格式的prompt不写在system中 #115

munian08 commented Feb 18, 2025

Yukino256 commented Feb 19, 2025

xingenju commented Feb 19, 2025

Jia-py commented Feb 19, 2025

为什么多模态关于规范格式的prompt不写在system中 #115

为什么多模态关于规范格式的prompt不写在system中 #115

Comments

munian08 commented Feb 18, 2025

Yukino256 commented Feb 19, 2025

xingenju commented Feb 19, 2025

Jia-py commented Feb 19, 2025