Home Technologies Voice agents and ElevenLabs MCP built to run in production
Voice AI we build with

Voice agents and ElevenLabs MCP built to run in production

What it is & where it fits

How QuantalAI uses Voice agents and ElevenLabs MCP built to run in production.

ElevenLabs is a voice platform. It turns text into speech that sounds human, clones a consented voice, dubs audio across languages, and through ElevenLabs MCP it lets an agent reach your tools to actually do something on a call. That much you can buy in an afternoon. The part that decides whether callers trust it is the work nobody demos. Connecting the agent to your booking system and account records so it answers about your business. Recording every prompt and routing decision so you can find and fix a bad answer. Building the call flow as a service that survives a Monday morning rush rather than a notebook that stalls at the second concurrent caller. That groundwork is where a natural voice becomes a dependable one.

Book a discovery call

Where the voice project stalls

You have heard how good AI speech has become, and you can see the use. A phone line that answers after hours. Booking changes handled without tying up the front desk. Training audio you do not re-record every time a price changes. The demo sounds almost human, and for a few minutes it feels solved.

Then the questions start. The voice sounds great, but it does not know your opening hours, your products or which suburbs you service. It cannot actually rebook an appointment, it can only talk about rebooking. When a caller asks something awkward, it answers confidently and wrong, and you cannot see why. By the time three people call at once, the prototype that worked on your laptop is dropping calls. The gap between a clip that impresses and a line a customer can rely on is wide.

Why buying ElevenLabs on its own under-delivers

ElevenLabs gives you a remarkable voice. It does not give you an agent that knows your business, behaves predictably, or stays standing under load. Those three things decide whether callers trust the line, and none arrive in the subscription.

The first gap is knowledge. A voice agent is only useful if it answers about you. Ask it whether a faulty item bought on sale can be returned, and a stock model will invent a plausible policy rather than quote yours. We close that by connecting the agent to your real information, the AI-accessible internal data that lets it read your actual policies, opening hours and account records and answer from them. Through ElevenLabs MCP and the API the agent reaches your booking system or CRM during the call, so it does the task instead of describing it.

The second gap is traceability. When a voice agent gives a wrong answer, you need to know why and change it the same day. We keep the prompts, call flows and tools the agent can use under version-controlled prompts and decisions, with eval runs over recorded calls so behaviour is measured rather than guessed. Every change is logged, and a tweak that makes things worse gets rolled back. That record matters when calls touch customer data or a regulator might ask.

The third gap is that a notebook is not a phone system. A script that handles one caller is a long way from a line that holds up at nine on a Monday. We build the voice flow as a quality internal platform, engineered for concurrent calls, retries, logging and graceful failure, so a busy hour does not become a queue of dropped calls.

An ElevenLabs phone agent answering a customer enquiry while routing a sensitive call to a staff member

How we deliver an ElevenLabs voice agent

We work in small, reviewable stages so risk stays low and you hear something real early.

  1. Pick the calls worth automating. We find one set of high-volume, low-risk enquiries where a voice answers well and a mistake is recoverable, then agree what a good call sounds like before we build.
  2. Ground it in your data. We connect the agent to the knowledge, records and systems it needs, through retrieval and ElevenLabs MCP, so its answers and actions come from your business with the source behind them.
  3. Choose and tune the voice. We select the ElevenLabs TTS voice, settings and model, then check pronunciation of your product names, place names and any tricky terms so the audio matches how you actually speak.
  4. Set the handoff and the guardrails. We define exactly what the agent handles alone and what goes to a person, with a warm transfer that carries the context so the caller never starts over.
  5. Test on real calls, then scale. We replay your past enquiries, measure where the agent is right and wrong, run a small pilot, and expand only once the numbers and the cost per call hold.

When ElevenLabs is the right call, and when it is not

ElevenLabs is the right choice when a natural voice genuinely improves the experience. After-hours and overflow phone lines, IVR that does not sound like a machine reading a menu, narration for training and accessibility, and multilingual audio for customers who do not read English easily are all strong fits. If your callers would rather speak than tap through a form, voice earns its place.

It is the wrong choice when voice is added for novelty and a chat or a form would serve the listener better. It is wrong when the conversation needs human empathy on every call, such as a complaint or a vulnerable customer, where a bot frustrates more than it helps. And we will not clone a voice without the consent of the person it belongs to. On the newest voice-agent products around ElevenLabs we are equally plain, because much of that ecosystem is young and the lock-in is real, so we choose what we can support rather than what is loudest this month. We will tell you straight which of these your case is before you spend anything.

Services and industries where voice fits

A voice agent is one shape of a wider capability. See how it connects to AI agents that act inside your systems and to broader AI and automation work. It earns its keep differently across sectors, so see it applied in Healthcare, Retail & Ecommerce and Professional Services.

Capabilities

What we build on ElevenLabs

01

ElevenLabs MCP tool calling

We wire the agent to your scheduling, CRM and order systems through ElevenLabs MCP, so a caller can rebook an appointment or check an order status and the agent does the action, not just reads a script back.

02

Production voice agents with human handoff

Spoken agents for phone lines, kiosks and in-app calls that answer the routine enquiries and pass anything sensitive or angry to a person, with the warm transfer carrying the context across so nobody repeats themselves.

03

Grounded text-to-speech from your own content

ElevenLabs TTS narration for training, accessibility and IVR prompts, generated from your approved scripts and product names so the audio matches what you actually sell rather than a generic blurb.

04

Consented voice cloning with safeguards

A brand or presenter voice cloned only with written permission from the person it belongs to, set up with access controls and an audit record of who can generate speech in that voice and when.

05

Multilingual dubbing for Australian audiences

Existing recordings re-voiced across languages for multicultural customers and regional teams, with a native reviewer checking pronunciation of local place names and terms before anything ships.

About Voice agents and ElevenLabs MCP built to run in production

Voice agents and ElevenLabs MCP built to run in production is a voice ai that QuantalAI builds and integrates for Australian organisations. Learn more at the official source: https://elevenlabs.io.

No stupid questions

Frequently asked.

Is ElevenLabs AI free to use?
There is a free tier that gives you a small monthly allowance of generated speech for trying it out. It is fine for testing a voice or a short clip, but it is not built for a business running a phone line at volume, and it carries usage limits and attribution conditions. For anything customer-facing you move to a paid plan, and we estimate your volume before you commit so the bill holds no surprises.
What does ElevenLabs do?
It converts written text into natural-sounding speech, clones a voice with permission, dubs audio into other languages, and provides the speech layer for conversational voice agents. Through ElevenLabs MCP and its API it can also connect to other systems so an agent acts on a call. We use it as the voice in projects where a spoken answer beats a typed one, and pair it with the retrieval and integrations that make the agent know your business.
Can I use ElevenLabs for commercial use?
Yes, on the paid plans. The commercial plans grant rights to use the generated speech in your products, content and services, and remove the attribution that the free tier requires. The detail that matters is voice cloning. Cloning a real person needs their consent, and using a generated voice that imitates someone without permission is both a licence breach and a legal risk. We build only with consented voices and keep the permission on record.
What does ElevenLabs do for a business?
It lets you add a natural voice where one earns its place. That might be a phone agent handling after-hours enquiries, spoken IVR prompts that do not sound robotic, narration for training and accessibility, or multilingual audio for customers who do not read English easily. The value is staff time saved and faster answers for callers. We help you judge where voice genuinely helps and where a form or chat would serve the listener better.
What ElevenLabs voice is this?
People often ask this when they hear a clip and want to match it. ElevenLabs has a library of stock voices plus any custom or cloned voices an account holds, each with a name and an identifier. When we build for you we record exactly which voice, settings and model produced each piece of audio, so a voice can be reproduced or replaced later without guesswork and you keep a clear record of what you published.
What is usage-based billing in ElevenLabs?
ElevenLabs charges mainly on how much speech you generate, measured in characters or credits, on top of a plan tier. Generate more audio and the cost rises with it. For a busy voice agent this means cost tracks call volume, so we model your expected minutes and characters up front, design the call flow to avoid wasteful re-generation, and give you a forecast in AUD before you scale.
Take the next step

Tell us where a voice would earn its place

Describe the calls or audio eating your team's hours. We will tell you honestly whether ElevenLabs fits, how we would ground and govern it, and what it costs to run at your volume.

Book a discovery call