Skip to content

task_desc can mislead agent. #35

Closed
@HacksonClark

Description

@HacksonClark

Doing mitigation evaluations, I saw the agent get confused solving the AssignNonExistentNodeSocialNetMitigation problem. It was able to get the correct answer, but the task_desc is inaccurate which did confused the agent:

"RemediationJSONReportCustomTool NL prompt received: The final answer to the original input question is that the compose-post-service is experiencing connection refused errors when trying to connect to the user-service. After investigating the logs, metrics, and traces of the user-service pod, I was unable to find any issues. Therefore, I recommend scaling up the user-service deployment to resolve the issue. The remediation plan involves either scaling up using existing resources, adding new resources, or using automated scaling.
RemediationJSONReportCustomTool function arguments identified are: {
"remediation":[
[
{ "action" : "Scale up user-service deployment using existing resources" },
{ "action" : "Add new resources to user-service deployment" },
{ "action" : "Implement automated scaling for user-service deployment" }
]
]"

Here is the task description in the code:

################## Mitigation Problem ##################
class AssignNonExistentNodeSocialNetMitigation(
    AssignNonExistentNodeSocialNetBaseTask, MitigationTask
):
    def __init__(self):
        AssignNonExistentNodeSocialNetBaseTask.__init__(self)
        MitigationTask.__init__(self, self.app)
        self.task_desc += "Start by investigating the `compost-post-service` pod"

I personally think that we should remove these, as they can either mislead the agent or give too easy of a hint. In practice, the agent would never know where the fault is or receive a "hint", so I don't see the purpose of having these included.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions