How to service agents over HTTP #126

rahilvora · 2025-03-13T07:57:09Z

Please read this first

Have you read the docs?Agents SDK docs
Have you searched for related issues? Others may have had similar requesrs

Question

I am looking at this documentation and it helps me understand how I can create multiple agents and add tools to it.

But, I am curious to know how can I create an application server that could serve my Agents created using SDK over HTTP. So that different frontend apps can communicate with my agents.

Can someone to appropriate resources/examples that can help me understand serving part of the Agent?

rm-openai · 2025-03-13T22:46:48Z

Here's an example I found on the web that uses FastAPI + the OpenAI SDK: https://blog.pamelafox.org/2024/01/using-fastapi-for-openai-chat-backend.html

You can sub out the parts that use the OpenAI SDK, for code that uses the Agents SDK.

We can also work on a sample like this in this repo!

rahilvora · 2025-03-13T23:00:49Z

@rm-openai I can help you with the Sample. I can create one in this repo as reference and you review it? Should I create new issue and start working on it?

masterkain · 2025-03-13T23:40:22Z

very basic ex, I suggest reading the guide cause you then have handoffs, streaming, tool usage, etc.

# uv run python -m uvicorn main:app --reload

import os

os.environ["OPENAI_API_KEY"] = "sk-xxxx"

import openai
from fastapi import FastAPI
from fastapi.responses import HTMLResponse
from pydantic import BaseModel

app = FastAPI()


# ---------------------------
# Example 1: Using the New Responses API
# ---------------------------
@app.get("/responses")
async def responses_endpoint():
    # This example uses the Responses API with the built-in web search tool.
    response = openai.responses.create(model="gpt-4o-mini", input="What is a positive news story from today?", tools=[{"type": "web_search_preview"}])  # Model supporting the Responses API
    return {"result": response.output_text}


# ---------------------------
# Define a Pydantic model for the outline checker's output.
# ---------------------------
class OutlineCheckerOutput(BaseModel):
    good_quality: bool
    is_scifi: bool


# ---------------------------
# Example 2: Using the New Agents SDK for a Multi-Agent Workflow
# ---------------------------
from agents import Agent, Runner, trace

# Agent to generate a story outline.
story_outline_agent = Agent(name="story_outline_agent", instructions="Generate a short sci-fi story outline based on the user's prompt.")

# Agent to check the outline's quality, now using a proper Pydantic model.
outline_checker_agent = Agent(
    name="outline_checker_agent",
    instructions=("Check if the story outline is of good quality and if it is a sci-fi story. " "Return a JSON object with keys 'good_quality' (boolean) and 'is_scifi' (boolean)."),
    output_type=OutlineCheckerOutput,
)

# Agent to write the final story based on the approved outline.
story_writer_agent = Agent(name="story_writer_agent", instructions="Write a complete short story based on the given outline.")


@app.get("/agents")
async def agents_endpoint(prompt: str):
    with trace("Multi-agent workflow"):
        # Step 1: Generate the outline.
        outline_result = await Runner.run(story_outline_agent, prompt)
        # Step 2: Check the outline quality.
        checker_result = await Runner.run(outline_checker_agent, outline_result.final_output)
        checker_output = checker_result.final_output
        if not (checker_output.good_quality and checker_output.is_scifi):
            return {"result": "Outline did not pass quality check. Workflow stopped."}
        # Step 3: Write the final story.
        story_result = await Runner.run(story_writer_agent, outline_result.final_output)
        return {"result": story_result.final_output}


# ---------------------------
# Home Page: A simple HTML interface
# ---------------------------
@app.get("/")
async def home():
    html_content = """
    <html>
        <head>
            <title>OpenAI Responses & Agents Demo</title>
        </head>
        <body>
            <h1>OpenAI Responses & Agents Demo</h1>
            <ul>
                <li><a href="/responses">Test Responses API</a></li>
                <li>
                    <form action="/agents" method="get">
                        <input name="prompt" type="text" placeholder="Enter story prompt" />
                        <button type="submit">Run Agents Workflow</button>
                    </form>
                </li>
            </ul>
        </body>
    </html>
    """
    return HTMLResponse(content=html_content)

rahilvora · 2025-03-14T22:03:45Z

@rm-openai Created PR for this #168

isaac47 · 2025-03-18T22:13:14Z

Hi,

I'm trying to stream the output of an agent similar to how OpenAI does it to serve fastAPI endpoint, but the following code doesn't work as expected:

import json

# agent is created here

result = Runner.run_streamed(agent, "tell me a joke.")

async for event in result.stream_events():
    json.dumps(event.model_dump(), ensure_ascii=False) + "\n"

I want to receive and process each streamed event progressively, but I'm not getting any output. How can I properly implement streaming in this case?

Any guidance would be appreciated. Thanks!

TimoVink · 2025-03-18T23:12:39Z

There are different ways to return an event stream. Personally I use SSE in my experiments. Looks something like:

from fastapi.responses import StreamingResponse
from agents import Runner, Agent
import json

def _sse_response(generator):
    async def _wrapper():
        async for event in generator:
            yield f'data: {json.dumps(event)}\n\n'
    result = StreamingResponse(
        _wrapper(),
        media_type='text/event-stream',
        headers={ 'Content-Encoding': 'none' }
    )
    return result

async def _agent_stream(agent, prompt):
    result = Runner.run_streamed(agent, input=prompt, max_turns=32)
    async for event in result.stream_events():
        # Handle whatever events you're interested in here, `yield` JSON-serializable objects
        if event.type == 'raw_response_event':
            if event.data.type == 'response.output_text.delta':
                yield {
                    'type': 'output_delta',
                    'delta': event.data.delta,
                }

def _agent_response(agent, prompt):
    return _sse_response(_agent_stream(agent, prompt))

@app.get('/api/haiku')
def get_haiku(prompt: str):
    agent = Agent(name="Haiku", instructions="Write a Haiku about the given topic.")
    return agent_response(agent, prompt)

Note that the events emitted by Runner.run_streamed are not JSON serializable. The events and their properties appear to be a mix of dataclasses and Pydantic models.

In the example above I pick out the events I care about (just response.output_text.delta) and emit my own dicts. Alternatively you could write a custom JSONEncoder or other mechanism to serialize the events.

The output for this endpoint looks something like:

rahilvora added the question Question about using the SDK label Mar 13, 2025

rahilvora mentioned this issue Mar 14, 2025

Add FastAPI example #168

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to service agents over HTTP #126

How to service agents over HTTP #126

rahilvora commented Mar 13, 2025

rm-openai commented Mar 13, 2025

rahilvora commented Mar 13, 2025 •

edited

Loading

masterkain commented Mar 13, 2025 •

edited

Loading

rahilvora commented Mar 14, 2025

isaac47 commented Mar 18, 2025 •

edited

Loading

TimoVink commented Mar 18, 2025 •

edited

Loading

How to service agents over HTTP #126

How to service agents over HTTP #126

Comments

rahilvora commented Mar 13, 2025

Please read this first

Question

rm-openai commented Mar 13, 2025

rahilvora commented Mar 13, 2025 • edited Loading

masterkain commented Mar 13, 2025 • edited Loading

rahilvora commented Mar 14, 2025

isaac47 commented Mar 18, 2025 • edited Loading

TimoVink commented Mar 18, 2025 • edited Loading

rahilvora commented Mar 13, 2025 •

edited

Loading

masterkain commented Mar 13, 2025 •

edited

Loading

isaac47 commented Mar 18, 2025 •

edited

Loading

TimoVink commented Mar 18, 2025 •

edited

Loading