集成与可观测性

在工作流结构清晰之后，接下来的问题是：哪些外部接口应置于智能体循环内部，以及如何检查运行时的实际情况。

选择 SDK 中的内置功能

需求	起始项	原因
让智能体能够访问公开的、远程托管的 MCP 工具	SDK 中托管的 MCP 工具	模型可以通过托管接口调用远程 MCP 服务器
在运行时连接本地或私有 MCP 服务器	通过 stdio 或可流式 HTTP 进行 SDK 管理的 MCP 服务器	由你的运行时掌控连接、审批和网络边界
调试提示、工具、交接或审批	内置跟踪	在正式制定评估体系之前，跟踪记录可提供端到端的完整记录

工具功能的语义仍然存在于使用工具。本页面重点介绍特定于 SDK 的 MCP 连接与可观测性循环。

MCP

当远程服务器应通过模型接口运行时，请使用托管的 MCP 工具。

接入托管的 MCP 服务器

typescript

1
2
3
4
5
6
7
8
9
10
11
12
import { Agent, hostedMcpTool } from "@openai/agents";

const agent = new Agent({
  name: "MCP assistant",
  instructions: "Use the MCP tools to answer questions.",
  tools: [
    hostedMcpTool({
      serverLabel: "gitmcp",
      serverUrl: "https://gitmcp.io/openai/codex",
    }),
  ],
});

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
from agents import Agent, HostedMCPTool

agent = Agent(
    name="MCP assistant",
    instructions="Use the MCP tools to answer questions.",
    tools=[
        HostedMCPTool(
            tool_config={
                "type": "mcp",
                "server_label": "gitmcp",
                "server_url": "https://gitmcp.io/openai/codex",
                "require_approval": "never",
            }
        )
    ],
)

当你的应用程序应直接连接到 MCP 服务器时，请使用本地传输方式。

连接本地 MCP 服务器

typescript

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
import { Agent, MCPServerStdio, run } from "@openai/agents";

const server = new MCPServerStdio({
  name: "Filesystem MCP Server",
  fullCommand: "npx -y @modelcontextprotocol/server-filesystem ./sample_files",
});

await server.connect();

try {
  const agent = new Agent({
    name: "Filesystem assistant",
    instructions: "Read files with the MCP tools before answering.",
    mcpServers: [server],
  });

  const result = await run(agent, "Read the files and list them.");
  console.log(result.finalOutput);
} finally {
  await server.close();
}

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
import asyncio

from agents import Agent, Runner
from agents.mcp import MCPServerStdio


async def main() -> None:
    async with MCPServerStdio(
        name="Filesystem MCP Server",
        params={
            "command": "npx",
            "args": [
                "-y",
                "@modelcontextprotocol/server-filesystem",
                "./sample_files",
            ],
        },
    ) as server:
        agent = Agent(
            name="Filesystem assistant",
            instructions="Read files with the MCP tools before answering.",
            mcp_servers=[server],
        )
        result = await Runner.run(agent, "Read the files and list them.")
        print(result.final_output)


if __name__ == "__main__":
    asyncio.run(main())

The practical split is:

使用 托管 MCP 适用于符合平台信任模型的公开远程服务器。
使用 本地或私有 MCP 当你的运行时应自主掌控连接、过滤或审批时使用。

有关全平台的概念、信任模型和产品支持说明，请将 MCP 与连接器作为权威参考。

跟踪

跟踪功能内置于 Agents SDK 中，并在标准的服务器端 SDK 路径中默认启用。每次运行都可以生成包含模型调用、工具调用、交接、护栏和自定义 Span 的结构化记录，你可以在跟踪面板.

默认跟踪通常会为你提供：

整体运行或工作流
每次模型调用
工具调用及其输出
交接和护栏
你在工作流周围封装的任何自定义 Span

如果你需要减少跟踪数据，请使用 SDK 级别或单次运行的跟踪控制，而不是彻底移除工作流中的所有可观测性。

将多次运行封装在同一个跟踪中

typescript

1
2
3
4
5
6
7
8
9
10
11
12
13
import { Agent, run, withTrace } from "@openai/agents";

const agent = new Agent({
  name: "Joke generator",
  instructions: "Tell funny jokes.",
});

await withTrace("Joke workflow", async () => {
  const first = await run(agent, "Tell me a joke");
  const second = await run(agent, `Rate this joke: ${first.finalOutput}`);
  console.log(first.finalOutput);
  console.log(second.finalOutput);
});

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
import asyncio

from agents import Agent, Runner, trace

agent = Agent(
    name="Joke generator",
    instructions="Tell funny jokes.",
)


async def main() -> None:
    with trace("Joke workflow"):
        first = await Runner.run(agent, "Tell me a joke")
        second = await Runner.run(
            agent,
            f"Rate this joke: {first.final_output}",
        )
        print(first.final_output)
        print(second.final_output)


if __name__ == "__main__":
    asyncio.run(main())

使用跟踪来完成两项任务：

调试单次工作流运行并了解期间发生的具体情况。
一旦你准备好对行为进行系统评分，就将高信号样本输入到智能体工作流评估中。

后续步骤

接入外部接口后，请继续阅读涵盖功能设计、审查边界或评估的指南。

使用工具

了解托管工具、函数工具及智能体即工具如何与 MCP 配合使用。

护栏与人工审查

围绕敏感功能添加审批或验证边界。

智能体工作流评估

在行为稳定后，从单次跟踪转向可重复的评分。

推荐

入门

核心概念

Apps SDK

工具

运行与扩展

评估

实时与音频

模型优化

专业模型

正式上线

旧版 API

资源

入门指南

使用 Codex

配置

管理

自动化

学习

发布

核心概念

规划

构建

部署

转化应用

指南

资源

指南

文件上传

API

衡量

广告主 API

API 参考

最新

主题

主题

贡献

分类

主题

项目

活动

选择 SDK 中的内置功能

MCP

跟踪

后续步骤