交错思考能力(思考和行动交替,即把 ReAct 模式嵌入到模型推理过程中)是 OpenAI 在 2025 年 4 月 16 日的 o3 + Responses API 首次上线的(
openai.com)。「The Responses API supports reasoning summaries, the ability to preserve reasoning tokens around function calls for better performance, and will soon support built-in tools like web search, file search, and code interpreter within the model’s reasoning. 」
Anthropic 是第二个,并且给了 interleaved thinking 这个名字,可能是这个导致很多人以为是 Anthropic 第一个推出,「To enable interleaved thinking, add the beta header interleaved-thinking-2025-05-14 to your API request.」
然后才是 MiniMax M2 (
platform.minimax.io)「M2 natively supports Interleaved Thinking, enabling it to reason between each round of tool interactions. Before every Tool Use, the model reflects on the current environment and the tool outputs to decide its next action.」
接着是 Kimi K2 thinking,Gemini 3 Pro 的 Thought signatures(
ai.google.dev),还有 DeepSeek-V3.2,「DeepSeek-V3.2 is our first model to integrate thinking directly into tool-use, and also supports tool-use in both thinking and non-thinking modes.」