What is ReWOO?

Author

Staff Writer

IBM Think

ReWOO (short for “reasoning without observation”) is a reasoning framework that makes large language models (LLMs) more cost-effective and accurate in some complex reasoning applications. Models with ReWOO engage in a reasoning process around a problem before trying to solve it, leading to much greater efficiency, accuracy and robustness under tool failure.

The first LLMs (like OpenAI’s GPT-1 and GPT-2 models) provided answers directly; the subsequent wave of chain-of-thought models that debuted in 2022 added an element of externalized reasoning, with models essentially “thinking out loud” as they arrived at a response, improving accuracy and explainability.

Next came a generation of augmented language models (“ALM systems”) and AI agents, which added tool-calling capabilities on top of this reasoning. The early ALM frameworks—like ReAct—employ a thought-action-observation pattern, where the system will observe what it generates before beginning to think again. While generally effective, frameworks like ReAct can require heavy token consumption, since each subsequent tool call must include all the conversation history that precedes it—a cost that compounds with each step.

ReWOO breaks away from the think-act-observe pattern by decoupling reasoning from external observations, allowing the model to plan its chain of reasoning internally before selectively invoking tools or retrieving information. This separation reduces unnecessary back-and-forth and allows the model to maintain a plan throughout the task.

Industry newsletter

The latest AI trends, brought to you by experts

Get curated insights on the most important—and intriguing—AI news. Subscribe to our weekly Think newsletter. See the IBM Privacy Statement.

How ReWOO works

ReWOO employs three distinct modules, which divide and conquer complex tasks. First, a Planner module maps out a blueprint for how the model will behave based on the user’s prompt. Second, a Worker module executes the plan, calling external tools (without repeating costly LLM API calls for “thinking,” as in ReAct). Finally, a Solver module takes the plans and evidence, synthesizing the final response.

Though the difference in approach may seem minor, the results are dramatic: ReWOO performs as well (or slightly better) than ReAct against some benchmarks—all while using about 80% fewer tokens. (A token is a unit of semantic meaning for an AI model; the more tokens, the higher the cost of operation.) For instance, on the HotpotQA dataset (one battery of questions used to evaluate AI systems), ReWOO achieves 42.4% accuracy using 2,000 tokens, while ReAct achieves 40.8% accuracy using 10,000 tokens.

Crucially, this optimization of token efficiency makes reasoning models economically viable at scale.

ReAct vs. ReWOO: a real-world example

To illustrate the difference between these two common generative AI frameworks, let’s examine a specific use case. Consider the different ways a ReAct vs. ReWOO system would address a user query asking for help packing for a trip that involves a flight between New York and Chicago tomorrow, followed by a drive to Milwaukee a day later.

A ReAct system would decompose the problem into a sequence of three thought-action-observations cycles before giving its final answer. In the first cycle, it would think, “I need to check tomorrow’s weather in New York,” using retrieval-augmented generation (RAG) to search for that weather (an action), finally observing the result. That result then serves as the input towards another three-step think-act-observe cycle for Chicago’s weather. Third, it would do the same for Milwaukee’s weather. Finally, it would collate its findings into an output (e.g. “Pack layers, because it gets colder in each location”).

A ReWOO-style system, by contrast, would gain efficiencies by doing all the planning upfront. First, it would plan, “I need to get New York’s weather tomorrow, Chicago’s weather tomorrow, and Milwaukee’s weather the day after.” Next, it would work, by calling weather APIs in one tight sequence (or potentially in parallel), without doing any costly “thinking” at this workhorse step. Finally, it would solve, collating the evidence and outputting a final answer.

AI agents

5 Types of AI Agents: Autonomous Functions & Real-World Applications

Learn how goal-driven and utility-based AI adapt to workflows and complex environments.

Build, deploy and monitor AI agents

Benefits and downsides of ReWOO

In addition to token efficiency, ReWOO demonstrates an additional benefit: robustness under tool-failure. If a tool fails under ReAct, for instance, the system can get caught in an infinite loop (as the LLM repeatedly queries a broken database for the weather in Chicago, for example).

ReWOO is nimbler. Even if a tool fails to return a given piece of evidence, the initial overarching plan is still in place: The Worker module can progress, and the Solver module will be able to deliver at least a partial answer. In the weather example, instead of getting caught in an infinite or excessive loop querying a database for Chicago’s weather, the Solver module would at least return an answer informing the user of New York’s and Milwaukee’s weather (assuming the Worker module was able to retrieve those bits of evidence), which might ultimately be sufficiently helpful for the user’s planning needs.

Despite ReWOO’s benefits, it is not a universally superior framework; it is simply better for certain kinds of jobs, particularly when the types and quantities of evidence needed are regular and predictable. Where ReWOO falls short, however, are with less predictable or structured problems that may require creativity, exploration or improvisation. With known unknowns, ReWOO excels, but with unknown unknowns, it flounders.

For instance, ReWOO would not be an optimal for debugging Python code, an exploratory and iterative process where each fix might yield new errors and clues, with the best-laid plans quickly becoming obviated. A more adaptable framework like ReAct, while less token-efficient in the abstract, would ultimately be a better match for such a problem.

How to implement ReWOO

As with most AI systems and frameworks, various approaches are available for the implementation of ReWOO workflows. An “official” implementation of the framework, which was first described by researcher Binfeng Xu (along with his colleagues, in 2023[1]), is available via Github. GenAI frameworks like LangGraph (which calls its modules “nodes”) and the related LangChain are also popular. And a ReWOO-style multi-step reasoning methodology is also available using IBM’s Granite.

One can get started with ReWOO at a conceptual level in any LLM environment with a well-crafted prompt that simply encourages the AI to craft a step-by-step plan to answer subsequent questions before progressing to any tool input.

The paper first describing ReWOO, for instance, includes example prompts, including one that begins: “For the following task, make plans that can solve the problem step by step. For each plan, indicate which external tool together with tool input to retrieve evidence.” The study authors add, however, that “ReWOO is a general paradigm and prompts are not necessarily fixed. We encourage readers and users to adjust the prompts tailored to their own needs.”¹

Building and evaluating AI agents that work in the real world

Join this webinar to explore how to operationalize AI agents with enterprise-grade reliability. Learn how to integrate agentic AI with deterministic workflows to ensure governance, security and consistent outcomes, while applying structured evaluation and observability practices to scale intelligent automation with confidence.

Abstract portrayal of AI agent, shown in isometric view, acting as bridge between two systems

Build, run and manage AI agents with watsonx Orchestrate

Resources

The enterprise in 2030: Engineered for perpetual innovation

Discover our five predictions about what will define the most successful enterprises in 2030, and the steps leaders can take to gain an AI-first advantage.

AI governance imperative: evolving regulations and emergence of agentic AI

Learn how evolving regulations and the emergence of AI agents are reshaping the need for robust AI governance frameworks.

Agentic AI explained

Techsplainers by IBM breaks down the essentials of agentic AI, from key concepts to real‑world use cases. Clear, quick episodes help you learn the fundamentals fast.

Unlock AI ROI: A tactical guide to enterprise productivity

Learn proven strategies to boost productivity and power enterprise transformation with AI and innovation at the core.

IDC MarketScape names IBM a leader in 2025 gen AI evaluation technology

Download the report to learn why IDC MarketScape named IBM a leader in 2025 gen AI evaluation technology, and how watsonx.governance® advances risk management, reporting and integration.

How AI agents and assistants can benefit your organization

Dive into this comprehensive guide that breaks down key use cases and core capabilities, providing step-by-step recommendations to help you choose the right solutions for your business.

Reimagine business productivity with AI agents and assistants

Learn how AI agents and AI assistants can work together to achieve new levels of productivity.

Try watsonx Orchestrate®

Explore how generative AI assistants can lighten your workload and improve productivity.

From AI projects to profits: How agentic AI can sustain financial returns

Learn how organizations are shifting from launching AI in disparate pilots to using it to drive transformation at the core.

Omdia Report on empowered intelligence: The impact of AI agents

Discover how you can unlock the full potential of gen AI with AI agents.

How AI agents will reinvent productivity

Learn ways to use AI to be more creative, efficient and start adapting to a future that involves working closely with AI agents.

Ushering in the agentic enterprise: Putting AI to work across your entire technology estate

Stay updated about the new emerging AI agents, a fundamental tipping point in the AI revolution.

The future of agents, AI energy consumption, Anthropic computer use and Google watermarking AI-generated text

Stay ahead of the curve with our AI experts on this episode of Mixture of Experts as they dive deep into the future of AI agents and more.

How Comparus is using a "banking assistant"

Comparus used solutions from IBM watsonx.ai® and impressively demonstrated the potential of conversational banking as a new interaction model.

Footnotes

1. "ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models," Binfeng Xu, Zhiyuan Peng, Bowen Lei, Subhabrata Mukherjee, Yuchen Liu, Dongkuan Xu. Arxiv.org, 23 May 2023.