Waboom AI
AI Training
AI Automation
AI Voice Agents
Resources
Contact
09 888 0402
Back to BlogArchitecture

The Game-Changer: Agent-to-Agent Transfer with Full Context Preservation

Leonardo Garcia-Curtis06/08/2025
The Game-Changer: Agent-to-Agent Transfer with Full Context Preservation

A real estate agency in Auckland had one AI voice agent handling everything. Valuations, open home bookings, tenant enquiries, maintenance requests, investor callbacks. One agent. One prompt. 2,400 words of instructions.

It worked for 3 months. Then it started routing maintenance calls to the sales team. Investor questions got tenant responses. The agent couldn't tell the difference between "I want to sell my house" and "I want to list my rental."

The problem wasn't the AI. The problem was asking one agent to be 5 specialists at once.

We split it into 5 focused agents. Each one handles a single domain. Calls transfer between them with full context — your caller never notices the handoff.

Zero repeated information. Zero confusion.

That system now handles 800+ calls a week across your typical mix of sales, property management, and investor relations.

Why Single-Agent Architecture Breaks

You've seen this pattern. Your agent starts simple. Handles one task well.

Then your client asks for "just one more thing." And another. And another.

Before you know it, your system prompt is 3,000 tokens. Your agent hallucinates responses from the wrong domain. A tenant asking about a broken heater gets your sales pitch instead.

Single-agent architecture fails for 3 reasons:

  • Prompt bloat — Every new capability adds tokens. Your LLM processes all of them on every turn, even when 80% aren't relevant.
  • Context confusion — Your agent can't reliably distinguish between domains when instructions span 5 business functions.
  • Maintenance nightmare — One change to your sales script risks breaking your maintenance flow. Your developers can't update one without testing all 5.
  • How Agent Transfer Works in Retell

    Retell's Transfer Agent Node lets you hand off a live call from one AI agent to another. Here's what happens:

    No new phone call. Your transfer is instant. No dialling, no connection delay. Your caller hears a seamless transition — often faster than a human transfer.

    Full conversation history. Your destination agent receives the complete transcript of everything said so far. Every word. Every data point your first agent collected.

    Single phone number. Your caller dials one number. Your routing logic decides which specialist handles what. 5 agents, one number. Clean for your callers, flexible for your team.

    Settings that carry over. Your webhook URL, signed URL opt-in, and data storage settings persist from your first agent throughout the entire call. Voice and language settings update to match your destination agent.

    Agent transfer architecture

    One number. Five specialists. Zero context loss.

    Building Your Multi-Agent Architecture

    Here's how we structure multi-agent systems at Waboom AI:

    Step 1: Design Your Routing Agent

    Your first agent is a traffic controller. Short prompt. One job: figure out why your caller is calling and route them to the right specialist.

    Keep this agent lean. 200-300 tokens max in the system prompt. Ask 1-2 qualifying questions, then transfer. The faster your router works, the faster your caller gets to the right agent.

    Step 2: Build Specialist Agents

    Each specialist handles one domain. Give them focused prompts with deep knowledge in their area. A 500-token prompt for one topic outperforms a 2,500-token prompt covering 5 topics.

    Your specialists don't need to handle off-topic questions. If your caller changes direction mid-conversation, transfer them back to the router or to the right specialist.

    Step 3: Configure Transfer Triggers

    In your conversation flow, add Transfer Agent Nodes at the decision points. Your conditions can be prompt-based ("caller wants to discuss a sale") or equation-based (checking a variable you extracted earlier).

    Step 4: Handle the Edge Cases

    What happens when your caller's request spans two domains? Build a handback path. Your specialist handles their piece, then transfers to the next specialist with full context intact.

    What if your caller says "actually, I called about something else"? Your global node catches that and routes back to your traffic controller.

    Warm Transfers to Humans

    Not every transfer goes to another AI. Sometimes your caller needs a human.

    Retell's Call Transfer Node handles this with 2 modes:

    Cold transfer — Immediate handoff. Your caller connects to your human agent. Simple. No frills.

    Warm transfer — Your AI waits for a human to answer (configurable timeout, default 30 seconds). While waiting, your caller hears hold music. When your human picks up, 3 things can happen:

  • Whisper message — Your AI whispers context to your human. The caller doesn't hear it.
  • Three-way introduction — All 3 parties hear each other. Your AI introduces the caller, then drops off.
  • Auto-greeting — Your human picks up and the conversation continues naturally.
  • Pro tip: always configure a failure edge on your transfer node. If your human doesn't answer within the timeout, route your caller to voicemail or back to your AI for rescheduling.

    What Carries Over (and What Doesn't)

    When your agents transfer between each other, your caller's experience should feel continuous. Here's what transfers:

    Carries over:

  • Complete conversation transcript
  • All dynamic variables collected so far
  • Webhook URL and data storage settings
  • Updates to match your destination agent:

  • Voice and language settings
  • LLM model and prompt
  • Stays locked to your first agent:

  • Webhook URL configuration
  • Signed URL opt-in setting
  • Sensitive data storage preferences
  • This means your first agent's webhook receives events for the entire call, regardless of how often your call transfers. Design your webhook handlers accordingly.

    Real-World Deployments

    Healthcare clinic in Christchurch — Routing agent qualifies the call type (booking, prescription, results, emergency). Transfers to the right specialist.

    4 agents total. Your patients never repeat their reason for calling.

    E-commerce brand in Melbourne — Product enquiry agent handles questions about 200+ SKUs using a RAG-powered knowledge base. Returns agent handles refund logic. Order tracking agent checks your shipping API.

    3 specialists, one number.

    Financial adviser in Wellington — Intake agent collects your basic details and KYC info. Investment agent discusses portfolio options.

    Compliance agent handles your disclosure requirements. Each agent has domain-specific compliance guardrails.

    Performance Considerations

    Multi-agent transfers add minimal latency. The handoff itself is near-instant — no telephony overhead.

    But each new agent loads its own prompt and context. If you're transferring 4 times in one call, that's 4 prompt loads.

    Keep your specialist prompts lean to minimise the impact on your overall latency.

    Test your transfer paths with batch simulation before going live. The transfers themselves are reliable.

    What breaks is your routing logic sending callers to the wrong specialist.

    One number. Multiple specialists. Zero dropped context.

    Book a Strategy Call | See the Platform

    Frequently Asked Questions

    Does the caller notice when agents transfer between each other?

    No. The transfer is near-instant with no new phone call created. Your caller hears a seamless transition.

    Voice and language settings update to match your destination agent, but the conversation continues without interruption. Most callers have no idea they've been handed to a different agent.

    What context does the destination agent receive?

    Your destination agent gets the complete conversation transcript — every word from the start of the call. It also receives all dynamic variables your previous agents collected.

    Webhook and data storage settings stay locked to your first agent for the entire call.

    How do I handle transfers to human agents?

    Use Retell's Call Transfer Node with warm or cold transfer modes. Warm transfers let your AI wait for a human to answer, play hold music, and deliver a whisper message with context.

    Cold transfers hand off immediately. Always configure a failure edge in case your human doesn't answer within the timeout.

    What's the maximum number of agent transfers in one call?

    Retell doesn't impose a hard limit on transfers per call. We've built flows with up to 4 transfers without issues.

    Each transfer adds minimal latency, but your specialist prompts should stay lean to keep response times under your target. Test your full transfer chain before deploying to production.

    LG

    Leonardo Garcia-Curtis

    Founder & CEO at Waboom AI. Building voice AI agents that convert.

    Ready to Build Your AI Voice Agent?

    Let's discuss how Waboom AI can help automate your customer conversations.

    Book a Free Demo
    Waboom AI

    Empowering New Zealand and Australian businesses with AI voice agents and automation that deliver real, measurable value.

    hello@waboom.ai+64 9 888 0402
    Level 8, 139 Quay Street
    Auckland CBD, New Zealand

    Solutions

    • AI Training
    • AI Strategy
    • AI Automation
    • AI Voice Agents
    • AI Champion Workshop

    Resources

    • AI Voice Agent Pricing
    • AI Voice Demos
    • Resources
    • Blog

    Company

    • About Us
    • Contact
    • Privacy Policy
    • Terms of Service

    Powered by leading AI technologies

    VAPIRetell AIOpenAIZapierMakeStripe

    © 2026 Waboom.ai. All rights reserved.

    PrivacyTermsSecurity