[agentserver-responses] Harden response model, type safety, and builder API by ankitbko · Pull Request #46302 · Azure/azure-sdk-for-python

ankitbko · 2026-04-14T06:25:25Z

[agentserver-responses] Harden response model, type safety, and builder API

Summary

Comprehensive hardening of the azure-ai-agentserver-responses package to ensure strict type safety, correct model lifecycle, robust builder APIs, and minimal public API surface.

Changes

Response Model Always Present (Bug Fix)

ResponseEventStream now always initialises a ResponseObject envelope at construction
Eliminates None-reference errors when handlers access stream.response before emit_created()

Type Safety Audit

All emit_* methods across 16 builder classes return specific event subtypes via typing.cast() instead of the base ResponseStreamEvent
70 new unit tests validate return types for every emitter method

Contract Type Tests

22 tests codifying structural rules (sequence numbering, status transitions, required fields, etc.) derived from the .NET reference implementation

Deterministic Session ID Derivation

derive_session_id() produces SHA-256 based IDs from conversation context, matching .NET SessionIdDerivation.Derive

OutputItemBuilder Tightening

OutputItemBuilder.emit_added() / emit_done() accept only OutputItem model instances (no raw dicts)

Public API Parameter Tightening (dict → generated models)

Constructor: agent_reference, request, response accept only their respective model types
Terminal methods (emit_completed/emit_failed/emit_incomplete): usage accepts only ResponseUsage
Convenience generators: all action, output, environment, operation params accept only generated model types
emit_annotation_added: accepts only Annotation (no dict)
Custom tool call output: output tightened to str | list[FunctionAndCustomToolCallOutput]
Function call output builder: output tightened to str | list[InputTextContentParam | InputImageContentParamAutoParam | InputFileContentParam]

API Surface Reduction — Internalized Methods

emit_event() → _emit_event(): low-level dict-based emitter
with_output_item_defaults() → _with_output_item_defaults(): item stamping helper
validate_response_event_stream() → _validate_response_event_stream()
normalize_lifecycle_events() → _normalize_lifecycle_events()

API Surface Reduction — Removed Exports & EVENT_TYPE Alias

Removed EVENT_TYPE alias entirely: replaced ~80 usages across 10 files with generated_models.ResponseStreamEventType directly
streaming exports removed: EVENT_TYPE, encode_sse_event, encode_keep_alive_comment
hosting exports removed (14 symbols): all observability types (CreateSpan, CreateSpanHook, InMemoryCreateSpanHook, RecordedSpan, build_create_span_tags, build_platform_server_header, start_create_span) and all validation functions (build_api_error_response, build_invalid_mode_error_response, build_not_found_error_response, parse_and_validate_create_response, parse_create_response, to_api_error_response, validate_create_response)
models exports removed: ResponseExecution, StreamEventRecord, StreamReplayState, get_instruction_items, get_output_item_id
top-level exports removed: to_output_item

Docs & Samples

Handler implementation guide updated: fixed positional args → keyword-only, model= → request= pattern, usage examples use ResponseUsage model
Annotation sample uses model instances
Method reference tables document typed return values

Test Results

786 passed, 1 skipped
ruff: clean
mypy: clean (streaming module)

Default model to empty string when not provided in the request, ensuring the field is always present in the response payload. The OpenAI SDK requires model to be present to deserialize the response object. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Ensures the Responses hosting layer always stamps a model field into response payloads (even when omitted from the request), preventing downstream clients (notably the OpenAI SDK) from failing to deserialize responses when model is missing.

Changes:

Default model to "" when building the per-request execution context so apply_common_defaults() will always include model in lifecycle snapshots.

Address PR review feedback: add contract tests verifying the model field is present in the response payload when omitted from the request, for both sync (stream=False) and streaming (stream=True) modes.

…eleted' The OpenAI spec returns {id, object: 'response', deleted: true} for DELETE /responses/{id}. Our handler was returning 'response.deleted' which doesn't match. Fixed the handler and updated all 5 test assertions.

ResponseExecution now carries agent_session_id and conversation_id so that _RuntimeState.to_snapshot can forcibly stamp them (S-038/S-040) on both the response.as_dict() path and the minimal fallback dict. All four orchestrator ResponseExecution creation sites pass both fields from the execution context.

The manual _patch.py override of ResponseObject.output erased the element type (list instead of list[OutputItem]), preventing the model framework from deserializing nested dicts into OutputItem instances. This caused get_history to return plain dicts instead of typed models. Changes: - Remove output:list override; use generated list[OutputItem] - Remove ToolChoiceAllowed override (generated type is identical) - Move Sphinx docstring fixes into models_patch.py shim so make generate-models preserves them instead of overwriting - Accept emitter upgrade to model_base.py (XML refactor) - Regenerate _validators.py from current TypeSpec sources

…type tests - Fix track_completed_output_item to use OutputItem._deserialize(dict, []) instead of OutputItem(dict) so response.output contains proper discriminated subtypes (OutputItemMessage, OutputItemFunctionToolCall, etc.) instead of base OutputItem instances. This ensures handler devs can use isinstance() and attribute access on output items. - Add test_public_contract_types.py with 22 tests covering every public handler/consumer surface for type fidelity: * context.request → CreateResponse * context.get_input_items() → Item subtypes * context.get_input_text() → str * context.get_history() → OutputItem subtypes (first-ever coverage) * stream.response → ResponseObject * stream.response.output → OutputItem subtypes * Builder emit_* → ResponseStreamEvent subtypes * Generator convenience → ResponseStreamEvent subtypes * InMemoryProvider round-trip preserves subtypes - Add isinstance assertions to existing tests in test_builders.py, test_event_stream_generators.py, and test_response_event_stream_builder.py

Replace random UUID fallback for agent_session_id with deterministic SHA-256 derivation matching .NET SessionIdDerivation logic: Priority chain: 1. Explicit agent_session_id from payload (unchanged) 2. Platform env FOUNDRY_AGENT_SESSION_ID (unchanged) 3. Deterministic: SHA256(agent_name:agent_version:partition_hint) where partition_hint is extracted from conversation_id or previous_response_id via IdGenerator.extract_partition_key 4. Random 63-char lowercase hex (one-shot, no conversational context) This ensures session affinity: the same conversation + agent identity always resolves to the same session ID, enabling stateful backends to route consistently without requiring explicit session IDs. New functions in _request_parsing.py: - derive_session_id() — public deterministic derivation - _compute_hex_hash() — SHA-256 → 63-char hex - _generate_random_hex() — os.urandom fallback - _extract_agent_identity() — name/version from agent_reference Updated _resolve_session_id() signature to accept agent_reference. Updated call site in _endpoint_handler.py to pass agent_reference. Updated all tests (unit + contract) from UUID to 63-char hex format. Added 14 new derivation tests covering determinism, agent isolation, version isolation, priority, and non-standard ID formats.

Port .NET pattern: every emit_* method now returns its specific event subtype (e.g. ResponseCreatedEvent, ResponseOutputItemAddedEvent) via typing.cast() instead of the base ResponseStreamEvent. Covers all builders: - ResponseEventStream: 6 lifecycle methods - OutputItemBuilder / BaseOutputItemBuilder: emit_added, emit_done - OutputItemMessageBuilder, TextContentBuilder, RefusalContentBuilder - FunctionCallBuilder, FunctionCallOutputBuilder - ReasoningSummaryPartBuilder, ReasoningItemBuilder - FileSearchCall, WebSearchCall, CodeInterpreter, ImageGen, McpCall, McpListTools, CustomToolCall builders Adds test_emit_return_types.py with 70 isinstance assertions covering every public emit_* method across all 16 builder classes.

…tputItem only Remove dict[str, Any] from the public signature — all item types are generated models. Internal callers use _emit_added/_emit_done directly. Also: fix handler guide (emit_failed/emit_incomplete kwargs, request= pattern), revert CHANGELOG to initial-release form, remove session ID derivation docs (internal detail).

…del types - ResponseEventStream constructor: agent_reference, request, response now accept only their respective model types (no dict[str, Any]) - Terminal methods (emit_completed/failed/incomplete): usage accepts only ResponseUsage (no dict[str, Any]) - Convenience generators (output_item_computer_call, _computer_call_output, _local_shell_call, _function_shell_call, _function_shell_call_output, _apply_patch_call): all action/output/environment params accept only their respective generated model types (no dict[str, Any]) - Async mirrors: same tightening as sync counterparts - emit_annotation_added: annotation accepts only Annotation (no dict) - _set_terminal_fields: usage tightened - Internal _build_events: coerce dict→AgentReference before passing to ResponseEventStream - Tests updated to use model constructors instead of raw dicts - Docs updated to show ResponseUsage model usage

…[Any] types - emit_event → _emit_event: internal only, all callers are sibling emit_* methods and _builders subpackage - with_output_item_defaults → _with_output_item_defaults: internal only, called only by _builders._base - validate_response_event_stream → _validate_response_event_stream: internal only, called only by _normalize_lifecycle_events - normalize_lifecycle_events → _normalize_lifecycle_events: internal only, called only by hosting._endpoint_handler - Removed both from streaming/__init__.py exports - output_item_custom_tool_call_output: output tightened from str | list[Any] to str | list[FunctionAndCustomToolCallOutput] - OutputItemFunctionCallOutputBuilder.emit_added/emit_done: output tightened from str | list[Any] to str | list[InputTextContentParam | InputImageContentParamAutoParam | InputFileContentParam] - Removed unused Any import from _function.py

…alize 22 symbols - Remove EVENT_TYPE alias: replaced all ~80 usages across 10 files with generated_models.ResponseStreamEventType directly - Remove from streaming exports: EVENT_TYPE, encode_sse_event, encode_keep_alive_comment - Remove from hosting exports: CreateSpan, CreateSpanHook, InMemoryCreateSpanHook, RecordedSpan, build_create_span_tags, build_platform_server_header, start_create_span, build_api_error_response, build_invalid_mode_error_response, build_not_found_error_response, parse_and_validate_create_response, parse_create_response, to_api_error_response, validate_create_response - Remove from models exports: ResponseExecution, StreamEventRecord, StreamReplayState, get_instruction_items, get_output_item_id - Remove from top-level exports: to_output_item - Keep public: get_conversation_id, get_input_expanded, get_content_expanded, get_conversation_expanded, get_tool_choice_expanded, all builder classes, ResponseEventStream, TextResponse, all store/Foundry types

Copilot

Pull request overview

Copilot reviewed 34 out of 34 changed files in this pull request and generated 4 comments.

...ntserver/azure-ai-agentserver-responses/azure/ai/agentserver/responses/streaming/_helpers.py

...r-responses/azure/ai/agentserver/responses/models/_generated/sdk/models/_utils/model_base.py

...er/azure-ai-agentserver-responses/azure/ai/agentserver/responses/hosting/_request_parsing.py

...r-responses/azure/ai/agentserver/responses/models/_generated/sdk/models/_utils/model_base.py

…lways-present

…ve .NET references

…utput, ToolChoiceAllowed.tools)

…valid TYPE_CHECKING import) CI failures

ankitbko requested review from RaviPidaparthi and vangarp as code owners April 14, 2026 06:25

Copilot AI review requested due to automatic review settings April 14, 2026 06:25

github-actions bot added the Hosted Agents sdk/agentserver/* label Apr 14, 2026

ankitbko changed the title ~~Always include model field in response payload~~ [agentserver] Always include model field in response payload Apr 14, 2026

Copilot started reviewing on behalf of ankitbko April 14, 2026 06:30 View session

Copilot AI reviewed Apr 14, 2026

View reviewed changes

RaviPidaparthi approved these changes Apr 14, 2026

View reviewed changes

RaviPidaparthi added 4 commits April 14, 2026 22:49

Add e2e tests for model field always present in response

ee071e9

Address PR review feedback: add contract tests verifying the model field is present in the response payload when omitted from the request, for both sync (stream=False) and streaming (stream=True) modes.

RaviPidaparthi force-pushed the fix/responses-model-always-present branch from 6fa2b47 to a141311 Compare April 15, 2026 00:13

RaviPidaparthi added 4 commits April 15, 2026 00:58

RaviPidaparthi changed the title ~~[agentserver] Always include model field in response payload~~ [agentserver-responses] Harden response model, type safety, and builder API Apr 15, 2026

RaviPidaparthi mentioned this pull request Apr 15, 2026

agentserver release #46304

Open

6 tasks

RaviPidaparthi added 3 commits April 15, 2026 02:13

RaviPidaparthi requested a review from Copilot April 15, 2026 03:00

RaviPidaparthi enabled auto-merge (squash) April 15, 2026 03:03

Copilot started reviewing on behalf of RaviPidaparthi April 15, 2026 03:07 View session

Copilot AI reviewed Apr 15, 2026

View reviewed changes

RaviPidaparthi added 3 commits April 15, 2026 03:11

Merge remote-tracking branch 'origin/main' into fix/responses-model-a…

32ae249

…lways-present

fix: handle None agent_reference, add session ID length comment, remo…

ed734b5

…ve .NET references

fix: fix Sphinx docstring warnings via models_patch (ResponseObject.o…

ce72a94

…utput, ToolChoiceAllowed.tools)

RaviPidaparthi approved these changes Apr 15, 2026

View reviewed changes

RaviPidaparthi added 2 commits April 15, 2026 05:34

fix: cap aiohttp<4.0.0 to avoid unstable 4.0.0a1 pre-release

937731f

fix: resolve pylint (line-too-long, protected-access) and pyright (in…

55d1151

…valid TYPE_CHECKING import) CI failures

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[agentserver-responses] Harden response model, type safety, and builder API#46302

[agentserver-responses] Harden response model, type safety, and builder API#46302
ankitbko wants to merge 17 commits intomainfrom
fix/responses-model-always-present

ankitbko commented Apr 14, 2026 •

edited by RaviPidaparthi

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ankitbko commented Apr 14, 2026 • edited by RaviPidaparthi Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!