ReAct¶

ReAct (Reasoning + Acting) is covered in the RAG and Agents repo under Agent Fundamentals — it is the foundational agentic pattern and is documented there in full.

Quick summary:

ReAct interleaves reasoning traces (Thought) with actions (Action) and their results (Observation) in a loop:

Thought: I need to find the population of Tokyo.
Action: search("Tokyo population 2024")
Observation: Tokyo population is approximately 13.96 million (city) / 37.4 million (metro).
Thought: I have the answer.
Action: finish("13.96 million in the city proper")

Key properties:

Thought steps are not executed — they are internal scratchpad reasoning
Actions call external tools (search, calculator, code executor, database)
The loop continues until the agent emits a finish action
Introduced by Yao et al. (2023); shown to outperform CoT-only and Act-only baselines on HotpotQA, Fever, AlfWorld

In the context of alignment: ReAct reasoning traces serve as interpretable audit logs — each decision step is visible, which supports human oversight and reward model training on reasoning quality.

Reference: Yao et al. (2023) — "ReAct: Synergizing Reasoning and Acting in Language Models." ICLR 2023.