Reading Synthesis · Caladrius Health KB

Introduction to Agents

An agent is not an AI model in a static workflow — it is a complete application that reasons, acts, and observes in a loop to achieve goals. This is the readable synthesis of Google's foundational paper: the anatomy of an agent, a taxonomy of capability, and the production discipline to scale from prototype to an enterprise fleet.

Source Google · Introduction to Agents (54 pp) Authors Blount · Gulli · Saboo · Zimmermann · Vuskovic Updated May 2026 Series Day 1 of 5

"Agents are the natural evolution of Language Models, made useful in software."

From Predictive AI to Autonomous Agents

For years AI focused on passive, discrete tasks — answer a question, translate text, generate an image — each demanding constant human direction. We are now in a paradigm shift: from AI that merely predicts or creates content to a new class of software capable of autonomous problem-solving and task execution.

An agent is not an AI model in a static workflow — it is a complete application that makes plans and takes actions to achieve goals.

It fuses a Language Model's ability to reason with the practical ability to act, handling multi-step tasks a model alone cannot. The defining capability is autonomy: agents figure out the next steps toward a goal without a person guiding every turn.

The developer's role changes too. The traditional developer is a bricklayer, defining every logical step. The agent developer is a director: set the scene (instructions, prompts), select the cast (tools, APIs), supply the context (data), then guide an autonomous "actor." An LM's greatest strength — its flexibility, the capacity to do anything — is also the biggest headache: it is hard to compel it to do one thing reliably. What we called "prompt engineering" is now context engineering.

In essence, an agent is a system dedicated to the art of context-window curation: a relentless loop of assembling context, prompting the model, observing the result, and re-assembling context for the next step.

This is the first of a five-part series — a formal guide for moving from proofs-of-concept to production-grade systems. A prototype is easy; ensuring security, quality, and reliability is the real challenge.

The Agentic Problem-Solving Process

The short definition: "LMs in a loop with tools to accomplish an objective." The core loop breaks into five fundamental steps.

Step	Name	What happens
1	Get the Mission	A high-level goal arrives — from a user ("Organize my team's travel") or an automated trigger ("A high-priority ticket arrived").
2	Scan the Scene	Perceive the environment: read the request, consult memory ("Did I try this before?"), inventory accessible tools (calendars, DBs, APIs).
3	Think It Through	The reasoning model analyzes Mission vs. Scene and devises a plan — often a chain of reasoning, not a single thought.
4	Take Action	The orchestration layer executes the first concrete step — invoke a tool, call an API, run code, query a DB.
5	Observe & Iterate	Observe the outcome, fold it into context/memory, loop back to Step 3 until the plan is complete.

This "Think, Act, Observe" cycle is managed by the Orchestration Layer, reasoned by the Model, executed by the Tools.

Worked example — a Customer Support Agent. For "Where is my order #12345?" the agent first plans: (1) Identify the order in the internal DB; (2) Track via the carrier's API; (3) Report a clear answer. It executes find_order("12345") → observes record + tracking ZYX987 → get_shipping_status("ZYX987") → observes "Out for Delivery" → replies "Your order #12345 is 'Out for Delivery'!"

The 5-Level Taxonomy of Agentic Systems

The same loop can be scaled in complexity into different classes of agent — each building on the last. Scoping which level you need is a key early architectural decision.

LVL0
Core Reasoning SystemThe "Brain" alone
An LM in isolation — no tools, memory, or live environment. Deep at explaining concepts and planning approaches, but blind to anything after its training cutoff (can't tell you last night's game score).
LVL1
Connected Problem-SolverReasoning + Tools
Connecting external tools makes it a functional agent. It recognizes a real-time need, calls Search / a financial API / a database via RAG, observes, and synthesizes. Interacting with the world is the defining capability.
LVL2
Strategic Problem-SolverContext engineering
From simple tasks to strategically planning multi-part goals. The emergent skill is curating focused, high-quality context per step — building new, focused queries from prior output. Enables proactive assistance (flight email → calendar).
LVL3
Collaborative Multi-Agent SystemA team of specialists
The paradigm shifts from a "super-agent" to a team mirroring a human org; agents treat other agents as tools. A Project Manager agent delegates sub-missions to research / marketing / web-dev specialists. The frontier of workflow automation.
LVL4
Self-Evolving SystemAutonomous creation
From delegation to creation: the system identifies gaps in its own capabilities and dynamically builds new tools or agents (meta-reasons → invokes AgentCreator → a new specialist appears on the fly). A truly learning, evolving organization.

Core Agent Architecture: Model, Tools, Orchestration

An AI agent is the combination of a Model, Tools, an Orchestration Layer, and runtime services that use the LM in a loop to accomplish a goal.

🧠

The Brain

Model

The reasoning core. Selection dictates cognition, cost, and speed.

🖐️

The Hands

Tools

Connect reasoning to reality — retrieve information and take actions.

⚙️

The Nervous System

Orchestration

Runs the Think-Act-Observe loop; state, memory, design choices.

BRAINModel

The core LM is the reasoning engine. Picking the highest benchmark score is a common path to failure — production success is rarely set by generic academic benchmarks. Instead:

Test agentic fundamentals — superior multi-step reasoning and reliable tool use, mapped to your business problem (test code on your codebase; claims extraction on your document formats).
Cross-reference cost & latency. The "best" model is the optimal intersection of quality, speed, price.
Team of specialists / model routing — a frontier model (Gemini 3.2 Pro) for heavy planning, a fast/cheap one (Gemini 3.2 Flash, open Gemma 4) for high-volume tasks.
Multimodality — a natively multimodal model (Gemini live mode) or specialized tools (Cloud Vision, Speech-to-Text) that convert the world to text first.
Plan for churn — today's model is superseded in ~6 months. An AgentOps CI/CD pipeline that continuously re-evaluates new models against business metrics de-risks upgrades.

HANDSTools

Tools connect reasoning to reality through a three-part loop: define · invoke · observe.

Retrieving information (grounding). RAG gives a "library card" to external knowledge (vector DBs, knowledge graphs, web via Search); NL2SQL queries structured data. Looking things up before speaking dramatically reduces hallucinations.
Executing actions (changing the world). Wrap APIs/code as tools to send email, schedule meetings, update a CRM, or write & execute code in a secure sandbox. Includes HITL tools (ask_for_confirmation(), ask_for_date_input()) for critical decisions.
Function calling (connecting tools). The OpenAPI spec gives a structured contract; MCP simplifies discovery; some models have native tools (Gemini's native Search) where invocation happens inside the LM call.

NERVOUS SYSTEMOrchestration Layer

The engine that runs the Think, Act, Observe loop — the conductor deciding when to reason, which tool acts, and how results inform the next step.

Core design choices. The degree of autonomy (a spectrum from "a sprinkle of AI" to the LM fully in the driver's seat) and the implementation method (no-code builders vs. code-first frameworks like Google's ADK). A production framework must be open, offer precise control, and above all be built for observability — traces exposing the full reasoning trajectory.
Instruct with domain knowledge & persona. The system prompt is the agent's constitution: identity, constraints, output schema, rules of engagement, tone, and explicit guidance on when/why to use tools. A few example scenarios are highly effective.
Augment with context (memory). Short-term memory is the active scratchpad — running (Action, Observation) history. Long-term memory is persistence across sessions, almost always a RAG system over a vector DB.

Multi-agent design patterns — a team of specialists beats one super-agent:

Pattern	Use it for	How it works
Coordinator	Dynamic / non-linear tasks	A "manager" segments a request, routes sub-tasks to specialists (researcher, writer, coder), then aggregates.
Sequential	Linear workflows	A digital assembly line — one agent's output is the next's input.
Iterative Refinement	Quality	A "generator" creates content; a "critic" evaluates it against standards in a feedback loop.
Human-in-the-Loop	High-stakes / safety	A deliberate pause to get human approval before a significant action.

Agent Deployment & Services

Deployment is the agent's "body and legs" — from a laptop prototype to an always-on server reachable by people and other agents. Production agents need session-history and memory persistence, plus the builder's decisions on logging, data privacy, data residency, and compliance. Two paths:

Managed — the Gemini Enterprise Agent Platform: one destination to build, scale, govern, optimize. Includes Agent Studio (prompting → deployment), a revamped Agent Runtime (sub-second cold starts, autonomous multi-day workflows), and Memory Bank (persistent long-term context).
DIY / DevOps: package any agent into a Docker container on standard runtimes like Cloud Run or GKE.

Quick deploy commands suit early exploration; a secure, production-ready environment requires real investment in CI/CD and automated testing.

AgentOps: Structuring the Unpredictable

Traditional unit tests assert output == expected — useless when an agent's response is probabilistic by design. Judging "quality" (did it do all it should, nothing it shouldn't, with proper tone?) usually requires an LM. AgentOps is the disciplined evolution of DevOps and MLOps — turning unpredictability into a managed, measurable, reliable feature.

Measure what matters — instrument like an A/B experiment. Define "better" for the business first. Track goal-completion, satisfaction, latency, cost per interaction, and — most importantly — business impact (revenue, conversion, retention).
Quality over pass/fail — use an LM judge. A powerful model scores outputs against a rubric over a golden dataset sampled from real interactions. A domain expert must review results before they're accepted.
Metrics-driven development — your go/no-go. Run a new version against the full eval set and compare to production; add A/B deployments for safe, gradual rollout.
Debug with OpenTelemetry traces — answer "why?" A trace records the full trajectory: exact prompt, internal reasoning, tool chosen, parameters, raw observation. For root-cause debugging, not overviews.
Cherish human feedback. A "thumbs down" is a gift — a missed edge case. Close the loop: capture it, replicate it, convert it into a permanent test case — vaccinating against that whole class of error.

Agent Interoperability

High-quality agents must interconnect — the "face" of the agent. Note: agents are not tools.

Agents & Humans. A chatbot at its simplest, or structured JSON powering rich UIs. HITL patterns: intent refinement, goal expansion, confirmation, clarification. Computer use lets the LM drive a UI under oversight; MCP UI / AG UI / A2UI let the agent reshape or generate the UI. Live mode (Gemini Live API) adds bidirectional voice — speak, interrupt, with camera/mic so the agent sees and hears in real time.
Agents & Agents — A2A. Without a standard, connecting many agents means brittle integrations. The twin challenges are discovery and communication. Agent2Agent (A2A) is the open "universal handshake": each agent publishes an Agent Card (JSON "business card" — capabilities, endpoint, credentials). Communication is task-oriented and asynchronous. Where MCP solves transactional requests, A2A is for collaborative problem-solving — enabling Level-3 systems.
Agents & Money — AP2 (+ x402). An autonomous "buy" creates a crisis of trust. The Agent Payments Protocol (AP2) extends A2A with cryptographically-signed "mandates" — verifiable proof of user intent and a non-repudiable audit trail. x402 uses HTTP 402 "Payment Required" for frictionless machine-to-machine micropayments. Together they form the trust layer for the agentic web.

Security, Identity & Governance

Securing a single agent — the trust trade-off

Utility requires power; every ounce of power adds risk — primarily rogue actions and sensitive-data disclosure. Give the agent a leash long enough to do its job, short enough to keep it out of traffic. You can't trust the model's judgment alone (it's vulnerable to prompt injection). Use defense-in-depth: (1) deterministic guardrails — hardcoded rules outside the model (block any purchase over $100); (2) reasoning-based defenses — small "guard models" that inspect a proposed plan before execution.

Agent Identity — a new class of principal

Agents are a third principal beyond humans and services — autonomous actors needing their own verifiable "digital passport", distinct from the user who invoked them and the developer who built them. With a cryptographic identity (e.g. SPIFFE) they get least-privilege permissions, containing the blast radius if one is compromised.

Principal	Authentication / Verification	Notes
Users	OAuth or SSO	Human actors, full autonomy and responsibility.
Agents (new)	Verified with SPIFFE	Delegated authority — act on behalf of users.
Service accounts	Integrated into IAM	Apps/containers, deterministic, not responsible for actions.

Policy = authorization (AuthZ), distinct from authentication (AuthN). Apply least privilege while staying contextually relevant.

Securing an ADK agent

A layered exercise: define identities → enforce access policies at the API-governance layer → build in-tool guardrails that refuse unsafe actions regardless of LM reasoning → add dynamic defenses (ADK Callbacks & Plugins; a before_tool_callback; a "Gemini as a Judge" screen). The Agent Gateway is "air traffic control," natively enforcing Model Armor (prompt injection, jailbreaks, PII leakage, malicious URLs).

Scaling to an enterprise fleet

One or two agents → a security problem. Hundreds → an architecture problem ("agent sprawl").

Security & privacy. The agent is a new attack vector (injection, data poisoning, leakage). Ensure proprietary data never trains base models, protect with VPC Service Controls, apply input/output filtering, and secure IP indemnity.
Governance — a control plane, not sprawl. A central gateway becomes the mandatory entry point for all agentic traffic: (1) Runtime Policy Enforcement via the Agent Gateway (AuthN + AuthZ + single pane of glass); (2) Centralized Governance via the Agent Registry — an enterprise "app store" indexing every agent, tool, and Skill, with security review, versioning, and fine-grained policies.
Cost & reliability. Scale-to-zero for irregular traffic; Provisioned Throughput / 99.9% SLAs for mission-critical workloads.

How Agents Evolve & Learn

In dynamic environments, performance "ages" and decays; manually updating a fleet is slow and uneconomical. The scalable answer: agents that learn and evolve autonomously.

Sources of learning — runtime experience (logs, traces, memory, and authoritative HITL feedback) and external signals (new policies, regulations, peer critiques).
Adaptation techniques — enhanced context engineering (refine prompts, few-shot examples, retrieved memory) and tool optimization & creation (gain, craft, or modify tools). RLHF and dynamic pattern reconfiguration are active research.

Worked example — learning compliance guidelines. A multi-agent loop: a Querying Agent fetches data → a Reporting Agent drafts → a Critiquing Agent reviews against rules and escalates ambiguity to a human → a Learning Agent generalizes the expert's correction into a new reusable guideline the Critiquing Agent applies automatically next time.

Simulation & Agent Gym — the next frontier. Beyond in-line learning, a dedicated Agent Gym optimizes the system offline: (1) not in the execution path; (2) a simulation environment for trial-and-error; (3) synthetic data generators for realistic pressure-testing (red-teaming, critiquing-agent families); (4) a non-fixed optimization arsenal that can adopt or craft tools; (5) a bridge to human experts for "tribal knowledge" edge cases.

Advanced Examples

Google Co-Scientist

A virtual research collaborator that accelerates discovery by systematically exploring complex problem spaces. A researcher defines a goal and grounds the agent in chosen knowledge; the system generates and evaluates a landscape of novel hypotheses. It spawns a whole ecosystem of agents: a "Supervisor" acts as project manager, delegating to specialists and distributing compute. Agents run for hours or days, with loops and meta-loops improving both the hypotheses and the way ideas are judged and created.

AlphaEvolve

An AI agent that discovers and optimizes algorithms, pairing Gemini's code generation with an automated evaluator in an evolutionary loop: generate → score → use the best as inspiration for the next generation. Breakthroughs include more efficient data centers / chip design / AI training, faster matrix multiplication, and new solutions to open math problems. It excels where verifying a solution is far easier than finding it, and is built for human–AI partnership: transparent human-readable code and expert guidance that steers exploration and prevents loophole-exploitation.

Conclusion

Agents are the natural evolution of language models, made useful in software.

Generative AI agents shift AI from a passive content tool to an active, autonomous problem-solving partner. The anatomy is three parts — the reasoning Model (Brain), actionable Tools (Hands), and the governing Orchestration Layer (Nervous System) — integrated in a continuous Think, Act, Observe loop. The 5-level taxonomy lets architects scope ambition to the task.

The central shift is in the developer paradigm: no longer bricklayers defining explicit logic, but architects and directors who guide, constrain, and debug an autonomous entity. The flexibility that makes LMs powerful is also the source of their unreliability — so success lives not in the initial prompt but in engineering rigor across the whole system: robust tool contracts, resilient error handling, sophisticated context management, and comprehensive evaluation. Applied with discipline, these patterns build not mere "workflow automation" but collaborative, capable, adaptable new members of the team.

Synthesized for the Caladrius Health KB from Google's Introduction to Agents (Agents Whitepaper Series · May 2026). Source PDF and full extraction in raw/ai-ml-research/harness-engineering/.
Authors: Alan Blount · Antonio Gulli · Shubham Saboo · Michael Zimmermann · Vladimir Vuskovic. · Synthesis generated 2026-06-19.