feat: selective render #630

laanak08 · 2025-01-16T17:14:28Z

This change allows "Goose" (not user, not assistant) to show information to the user which is outside of the user's conversation with the llm, and avoids sending those out-of-band messages to the llm.

ideally, visually, it would be nice to update the prompt so Goose-messages show as a third speaker.

changelist:

add role Goose
after truncation, create message to user from Role::Goose and append to the message history
filter out these Goose-messages before sending to llm-preprocessing
after llm response, render llm response, then render any goose response if one exists

this keeps the Goose messages in the message history.

github-actions · 2025-01-16T18:41:22Z

Desktop App for this PR

The following build is available for testing:

📱 macOS Desktop App (Universal, signed)

The app is signed and notarized for macOS. After downloading, unzip the file and drag the Goose.app to your Applications folder.

This link is provided by nightly.link and will work even if you're not logged into GitHub.

ahau-square · 2025-01-16T19:18:10Z

how do these messages render in the UI / CLI?

laanak08 · 2025-01-16T21:29:39Z

@ahau-square

how do these messages render in the UI / CLI?
its the last line of text

As i said above though, i could work on giving it an icon so its clearly another speaker

michaelneale · 2025-01-16T23:13:18Z

yeah nice - would be nice in the GUI if it could have some goose like icon or prefix to make it clear it is a side channel message or just from goose? (optional though as most people won't necessarily know or care where it came from probably)

baxen

Looks good, i like this solution! left a comment below on how we can filter these from model inference

baxen · 2025-01-16T23:26:04Z

crates/goose/src/agents/truncate.rs

                    &tools,
                ).await?;
                capabilities.record_usage(usage).await;

                // Yield the assistant's response
                yield response.clone();

+                // if ctx limit added goose message yield it


thinking through this, we need to make sure that the goose role messages never get to an LLM provider.

I think we can do that one of two ways:

make the agent filter the messages out before any provider calls

update the providers to filter (which i see you've done for anthropic but i think is missing from others?)

vaguely i think the former is a bit easier, maybe add a helper method on the capabilities struct?

@baxen im updating the providers to filter goose-role messages, and its more involved than would be wise to do for tomorrow. im putting it in its own PR. ive found and updated all the providers to skip processing goose-role messages, but testing all of it will spill into next week.

actually @baxen i think this PR is all thats needed. please lmk if ive overlooked anything.

ahau-square · 2025-01-16T23:34:35Z

crates/goose/src/agents/truncate.rs

                .count_everything(system_prompt, messages, tools, &resources, model);

        let mut new_messages = messages.to_vec();
        if approx_count > target_limit {
-            let user_msg_size = self.text_content_size(new_messages.last(), model);


are we sure we should take this out? was the intention here to catch if the user's message is too big itself to fit into the limit size? I think that's a reasonable edge case. But we'd have to check for probably combination of user message and tool outputs since the last user message

before @baxen's review on the this i thought it was necessary too. upon reflection, i concluded all paths leads to same outcome, and that the user-msg-size check was duplicative. im now thinking about as follows (lmk if those dont seem complete):

sum(entire history) > limit: trunc until under

last user msg > limit: trunc until under

=> all msgs deleted until usr-msg,

which doesnt bring below limit,

so after one final deletion the usr-msg is now gone, and no msgs remain

=> return Err

the second case seems problematic in that you lose your whole session history just because your last message or tool outputs were too big. I would think we'd rather discard those messages and raise an error so the user can enter a different message and/or the llm is notified that the tool outputs were too large

what you said should happen is what happens, i just described it poorly.
well, more specifically, while "truncating", if that error-condition of "no more messages" occurs, an error is returned. in practice i see it as a user as the error-string, which, i can then choose to write a new message. at this point, the message-list now contains everything + my new message. i.e. the truncation process is non-destructive if it fails, and does not commit the user's too-long message.

baxen · 2025-02-17T20:46:55Z

Still like this idea a lot but going to close this out for now and we can open something similar against main when we revisit the use case?

laanak08 added 4 commits January 16, 2025 06:07

remove the leading underscore cause we use these vars

04ea48a

remove post-llm truncation

21cbeba

remove superfluous user-message ctx-len check

24cf4e7

output truncation alert without sending to llm

fd29837

laanak08 requested review from baxen and ahau-square January 16, 2025 17:14

fix non-exhaustive pattern match + fmt nit

0cc3d91

laanak08 requested a review from yingjiehe-xyz January 16, 2025 22:21

laanak08 changed the title ~~Marcelle/selective render~~ feat: Marcelle/selective render Jan 16, 2025

baxen reviewed Jan 16, 2025

View reviewed changes

ahau-square reviewed Jan 16, 2025

View reviewed changes

laanak08 requested review from ahau-square and baxen January 17, 2025 15:48

laanak08 changed the title ~~feat: Marcelle/selective render~~ feat: selective render Jan 17, 2025

baxen closed this Feb 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: selective render #630

feat: selective render #630

laanak08 commented Jan 16, 2025 •

edited

Loading

github-actions bot commented Jan 16, 2025

ahau-square commented Jan 16, 2025

laanak08 commented Jan 16, 2025 •

edited

Loading

michaelneale commented Jan 16, 2025

baxen left a comment

baxen Jan 16, 2025

laanak08 Jan 17, 2025

laanak08 Jan 17, 2025

ahau-square Jan 16, 2025 •

edited

Loading

laanak08 Jan 17, 2025

ahau-square Jan 17, 2025

laanak08 Jan 17, 2025 •

edited

Loading

baxen commented Feb 17, 2025

feat: selective render #630

feat: selective render #630

Conversation

laanak08 commented Jan 16, 2025 • edited Loading

github-actions bot commented Jan 16, 2025

Desktop App for this PR

ahau-square commented Jan 16, 2025

laanak08 commented Jan 16, 2025 • edited Loading

michaelneale commented Jan 16, 2025

baxen left a comment

Choose a reason for hiding this comment

baxen Jan 16, 2025

Choose a reason for hiding this comment

laanak08 Jan 17, 2025

Choose a reason for hiding this comment

laanak08 Jan 17, 2025

Choose a reason for hiding this comment

ahau-square Jan 16, 2025 • edited Loading

Choose a reason for hiding this comment

laanak08 Jan 17, 2025

Choose a reason for hiding this comment

ahau-square Jan 17, 2025

Choose a reason for hiding this comment

laanak08 Jan 17, 2025 • edited Loading

Choose a reason for hiding this comment

baxen commented Feb 17, 2025

laanak08 commented Jan 16, 2025 •

edited

Loading

laanak08 commented Jan 16, 2025 •

edited

Loading

ahau-square Jan 16, 2025 •

edited

Loading

laanak08 Jan 17, 2025 •

edited

Loading