Automatically remove the string `"<|image_sentinel|>"` from chat outputs #627

rhennigan · 2024-03-16T17:49:45Z

This occasionally appears in chat outputs when using gpt-4-vision. It's clearly a special token used by OpenAI, since the vision model can't read it like other models can:

In[1]:= AssociationMap[
    Function[
        LLMSynthesize[
            "Repeat the following string back to me:\nMy favorite token is <|image_sentinel|>!",
            LLMEvaluator -> <| "Model" -> #1, "MaxTokens" -> 64 |>
        ]
    ],
    { "gpt-3.5-turbo", "gpt-4", "gpt-4-turbo-preview", "gpt-4-vision-preview" }
]

Out[1]= <|
    "gpt-3.5-turbo"        -> "My favorite token is <|image_sentinel|>!",
    "gpt-4"                -> "My favorite token is <|image_sentinel|>!",
    "gpt-4-turbo-preview"  -> "My favorite token is <|image_sentinel|>!",
    "gpt-4-vision-preview" -> "\"My favorite token is !\""
|>

The fact that it occasionally appears in chat outputs is almost certainly a bug on OpenAI's end. However, it's simple enough to just delete it from chat outputs as a workaround.

Automatically remove the string "<|image_sentinel|>" from chat outputs

620e842

rhennigan linked an issue Mar 16, 2024 that may be closed by this pull request

The string <|image_sentinel|> frequently shows up in gpt-4-vision responses #534

Closed

rhennigan merged commit 44c656a into main Mar 16, 2024
1 check passed

rhennigan deleted the 534-the-string-image-sentinel-frequently-shows-up-in-gpt-4-vision-responses branch March 16, 2024 21:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatically remove the string `"<|image_sentinel|>"` from chat outputs #627

Automatically remove the string `"<|image_sentinel|>"` from chat outputs #627

rhennigan commented Mar 16, 2024

Automatically remove the string "<|image_sentinel|>" from chat outputs #627

Automatically remove the string "<|image_sentinel|>" from chat outputs #627

Conversation

rhennigan commented Mar 16, 2024

Automatically remove the string `"<|image_sentinel|>"` from chat outputs #627

Automatically remove the string `"<|image_sentinel|>"` from chat outputs #627