Configure your first AI phone agent

Complete guide to create, configure, test, and publish an AI phone agent that handles customer conversations automatically. In this guide, you'll set up a professional voice agent that can make sales calls, provide customer support, schedule appointments, or handle any other voice interaction use case.

Overview

An AI agent in Agoralia is a voice-enabled AI assistant that can:

Answer inbound calls automatically
Make outbound calls for campaigns
Handle conversations in natural language
Extract structured data from conversations
Integrate with knowledge bases for enhanced responses
Use tools and functions for external integrations

Agents are configured through a 5-step wizard that guides you through all settings, from basic configuration to advanced features.

Prerequisites

Before creating an agent, ensure you have:

✅ Admin or editor access to your Agoralia workspace
✅ Clear understanding of your use case (sales, support, appointments, etc.)
✅ Optional: Knowledge base documents prepared (for enhanced responses)
✅ Optional: Phone number configured (for testing)

Step 1: Base Configuration

The first step sets up the fundamental properties of your agent.

Agent Name

Required: Give your agent a descriptive name that reflects its purpose.

Examples:

"Enterprise Sales Rep"
"Tech Support Assistant"
"Appointment Scheduler"
"Customer Onboarding Agent"

Best Practices:

Use clear, descriptive names
Include the use case or department
Avoid generic names like "Agent 1"

Language

Required: Select the primary language for your agent.

Available Languages: Agoralia supports 150+ languages including:

English (US, UK, Canada, Australia, etc.)
Italian, Spanish, French, German
Chinese, Japanese, Korean
Arabic, Hindi, Portuguese
And many more...

How it works:

Agoralia automatically selects the best voice engine for your chosen language
English languages use optimized voice engines
All other languages are fully supported with high-quality voices

Note: You can enable multilingual mode in campaigns to auto-detect the caller's language.

Voice ID

Required: Select the voice that will speak for your agent.

Voice Selection:

High-quality voices are available for all supported languages
Voices are automatically filtered based on your selected language
You can preview voices before selecting them

Voice Properties:

Gender: Male, Female, Neutral
Accent: Regional variations (e.g., US English vs. UK English)
Style: Professional, Casual, Friendly, etc.

Preview: You can preview voices before selecting them using the voice selector dialog.

Best Practices:

Choose a voice that matches your brand personality
Test multiple voices to find the best fit
Consider your target audience's preferences

LLM Model

Required: Select the Large Language Model (LLM) that powers your agent's intelligence.

Available Models:

GPT-4o: Most capable, best for complex conversations
GPT-4o-mini: Faster and more cost-effective, good for most use cases
GPT-3.5-turbo: Legacy model, still supported
Additional models may be available based on your plan

Model Selection Guide:

Complex use cases (technical support, sales): Use GPT-4o
Simple use cases (appointments, FAQs): Use GPT-4o-mini
Cost-sensitive: Use GPT-4o-mini
Speed-critical: Use GPT-4o-mini

Note: Available models may vary. Contact support if you need a specific model.

Step 2: Character Configuration

Define your agent's personality, role, and conversation style.

Primary Role

Options:

Inbound: Agent handles incoming calls only
Outbound: Agent makes outbound calls only
Both: Agent handles both inbound and outbound calls

Default: both

Use Cases:

Inbound only: Customer support, help desk
Outbound only: Sales outreach, appointment reminders
Both: General-purpose agents

Mission

Required: Describe what your agent should accomplish.

Examples:

"Help customers troubleshoot technical issues with our product"
"Schedule appointments for our dental practice"
"Qualify leads for our sales team"
"Provide information about our services and pricing"

Best Practices:

Be specific and clear
Include the primary goal
Mention any constraints or requirements
Keep it concise (1-2 sentences)

Additional Instructions (Custom Prompt)

Optional: Provide additional instructions for the agent's behavior.

Use Cases:

Tone and style guidelines
Specific phrases to use or avoid
Handling of edge cases
Compliance requirements

Example:

Always be polite and professional. If the customer seems frustrated, 
acknowledge their feelings and offer to escalate to a human agent. 
Never make promises about pricing or delivery dates without checking 
our system first.

Welcome Message

Optional: Custom greeting message that the agent says when answering a call.

Examples:

"Hello! Thank you for calling Acme Corp. How can I help you today?"
"Hi, this is Sarah from Tech Support. What seems to be the issue?"
"Welcome to our appointment booking service. I'm here to help you schedule your visit."

If not provided: The agent will generate a greeting based on its mission.

Who Speaks First

Options:

Agent: Agent greets the caller first (default)
User: Agent waits for the caller to speak first

Default: agent

Use Cases:

Agent first: Most common, provides immediate greeting
User first: Useful for inbound calls where caller might have urgent questions

Pause Before Speaking

Range: 0-5000 milliseconds

Default: 0ms

Purpose: Adds a delay before the agent starts speaking (useful for ensuring the call is connected).

Recommendations:

0ms: For most use cases
500-1000ms: If you experience connection issues
2000-5000ms: For very slow connections (not recommended)

Step 3: Voice & Behavior

Fine-tune how your agent sounds and behaves during conversations.

Voice Settings

Voice Model

Optional: Advanced voice model configuration (usually auto-detected from voice ID).

Voice Temperature

Range: 0.0 - 2.0

Default: 1.0

Effect: Controls the randomness and creativity of the voice.

Lower (0.0-0.7): More consistent, predictable voice
Higher (1.3-2.0): More varied, expressive voice

Recommendations:

Professional/Formal: 0.7-1.0
Friendly/Casual: 1.0-1.3
Creative/Expressive: 1.3-1.7

Voice Speed

Range: 0.5 - 2.0

Default: 1.0

Effect: Controls how fast the agent speaks.

Slower (0.5-0.8): Easier to understand, good for complex information
Normal (0.9-1.1): Standard speaking pace
Faster (1.2-2.0): Quick delivery, use sparingly

Recommendations:

Technical support: 0.8-1.0 (slower for clarity)
Sales: 1.0-1.2 (energetic but clear)
Appointments: 0.9-1.1 (balanced)

Volume

Range: 0.0 - 2.0

Default: 1.0

Effect: Controls the audio volume level.

Recommendations:

Most cases: 1.0 (default)
Quiet environments: 1.2-1.5
Noisy environments: 0.8-1.0

Behavior Settings

Responsiveness

Range: 0.0 - 1.0

Default: 1.0

Effect: How quickly the agent responds to the user's speech.

Lower (0.0-0.5): Waits longer before responding (more natural pauses)
Higher (0.7-1.0): Responds quickly (more conversational)

Recommendations:

Formal/Professional: 0.6-0.8
Casual/Friendly: 0.8-1.0

Interruption Sensitivity

Range: 0.0 - 1.0

Default: 1.0

Effect: How easily the agent allows the user to interrupt.

Lower (0.0-0.5): Agent continues speaking, harder to interrupt
Higher (0.7-1.0): Agent stops quickly when user speaks

Recommendations:

Information-heavy: 0.5-0.7 (less interruptions)
Conversational: 0.8-1.0 (more interruptions allowed)

Enable Backchannel

Default: Disabled

Effect: Agent makes small verbal acknowledgments ("uh-huh", "I see", "okay") during user speech.

Use Cases:

Enabled: More natural, conversational feel
Disabled: More formal, professional feel

When Enabled:

Backchannel Frequency

Range: 0.0 - 1.0

Default: 0.8

Effect: How often the agent makes backchannel sounds.

Normalize for Speech

Default: Disabled

Effect: Automatically converts text to speech-friendly format (e.g., "Dr." → "Doctor", "$100" → "one hundred dollars").

Use Cases:

Enabled: When agent reads structured data (dates, numbers, abbreviations)
Disabled: For natural conversation

Background Sound (Ambient Sound)

Options:

None (default)
Coffee Shop: Subtle café ambiance
Convention Hall: Large room reverb
Summer Outdoor: Outdoor environment
Mountain Outdoor: Natural outdoor setting
Static Noise: Background static
Call Center: Office/call center ambiance

Use Cases:

None: Most professional use cases
Coffee Shop: More casual, friendly feel
Call Center: Traditional call center experience

When Enabled:

Ambient Sound Volume

Range: 0.0 - 2.0

Default: 1.0

Effect: Volume of the background sound relative to the agent's voice.

Recommendations: Keep below 0.5 to avoid distracting from the conversation.

Step 4: Transcription & Call Settings

Configure how the agent processes speech and manages calls.

Transcription Settings

Transcription Mode

Options:

Fast: Lower latency, good for real-time conversations (default)
Accurate: Higher accuracy, slightly higher latency

Recommendations:

Fast: For most use cases, real-time conversations
Accurate: For technical terms, medical terminology, or when accuracy is critical

Vocabulary Specialization

Options:

General (default): Standard vocabulary
Medical: Optimized for medical terminology

Use Cases:

General: Most use cases
Medical: Healthcare, medical appointments, pharmaceutical sales

Denoising Mode

Options:

Noise Cancellation: Removes background noise only
Noise and Background Speech Cancellation: Removes both noise and background speech

Recommendations:

Noise Cancellation: For most environments
Noise and Background Speech: For noisy call centers or open offices

Boosted Keywords

Optional: Comma-separated list of keywords that should be recognized with higher priority.

Format: keyword1, keyword2, keyword3

Examples:

Product names: AcmePro, Enterprise Edition, Premium Plan
Technical terms: API, SDK, webhook, integration
Industry terms: GDPR, HIPAA, compliance

Use Cases:

Product names or brand names
Technical terminology
Industry-specific terms
Acronyms

Call Settings

End Call After Silence

Range: 10 seconds minimum

Default: 600 seconds (10 minutes)

Effect: Automatically ends the call if there's no speech for this duration.

Recommendations:

Short calls (appointments, FAQs): 30-60 seconds
Medium calls (support, sales): 60-120 seconds
Long calls (consultations): 300-600 seconds

Maximum Call Duration

Range: 60 seconds (1 minute) - 7200 seconds (2 hours)

Default: 3600 seconds (60 minutes)

Effect: Maximum length of a call before automatic termination.

Recommendations:

Appointments: 300-600 seconds (5-10 minutes)
Support: 600-1800 seconds (10-30 minutes)
Sales: 1800-3600 seconds (30-60 minutes)
Consultations: 3600-7200 seconds (60-120 minutes)

Ring Duration

Range: 5-90 seconds

Default: 30 seconds

Effect: How long the phone rings before the call is considered unanswered.

Recommendations:

Most cases: 30 seconds
High-volume: 20 seconds
Important calls: 60-90 seconds

DTMF (Touch-Tone) Support

Default: Enabled

Effect: Allows users to press numbers on their phone keypad to interact with the agent.

Use Cases:

Menu navigation ("Press 1 for Sales, Press 2 for Support")
Confirmation ("Press 1 to confirm")
Data entry (phone numbers, account numbers)

When Enabled: You can configure DTMF options in the agent's advanced settings.

Voicemail Detection

Default: Disabled

Effect: Automatically detects when a call goes to voicemail and leaves a message.

When Enabled:

Voicemail Message

Optional: Custom message to leave on voicemail.

Example:

Hi, this is Sarah from Acme Corp. I'm calling to follow up on your 
inquiry. Please call us back at 1-800-555-1234 or visit our website 
at acme.com. Thank you!

If not provided: The agent will generate a message based on its mission.

Step 5: Advanced & Knowledge Base

Configure advanced features like knowledge bases, webhooks, tools, and integrations.

Knowledge Base

Optional: Connect your agent to knowledge bases for enhanced responses.

How it works:

Agent can reference knowledge base content during conversations
Knowledge bases can contain FAQs, product information, policies, etc.
Multiple knowledge bases can be connected to a single agent

Available Knowledge Bases:

General KB: Automatically available to all agents
Custom KBs: Created in the Knowledge Base section

Selection:

Check the boxes next to knowledge bases you want to connect
Knowledge bases are filtered by language (agent language must match KB language)
Synced knowledge bases are marked with "✓ Synced"

Best Practices:

Connect relevant knowledge bases for your use case
Keep knowledge bases up-to-date
Use language-specific knowledge bases for multilingual agents

Webhook Settings

Optional: Configure webhooks to receive call events in real-time.

Webhook URL

Format: Valid HTTPS URL

Example: https://your-backend.com/webhooks/agoralia

Events Sent:

Call started
Call ended
Call transcript updates
Extracted data
Call outcomes

Use Cases:

Real-time call monitoring
CRM integration
Custom analytics
Alert systems

Webhook Timeout

Range: 0+ milliseconds

Default: 5000ms (5 seconds)

Effect: Maximum time to wait for webhook response before timing out.

Recommendations:

Fast webhooks: 3000-5000ms
Slow webhooks: 10000-30000ms

Tools & Functions

Optional: Add custom tools/functions that the agent can call during conversations.

Available Tools:

Custom Functions: Define your own functions
MCP Servers: Connect to Model Context Protocol servers
API Integrations: Connect to external APIs

Use Cases:

Calendar Integration: Check availability, book appointments
CRM Integration: Look up customer data, update records
Database Queries: Retrieve information
API Calls: External service integration

Configuration:

Tools are configured in the Functions Editor
Each tool has a name, description, and parameters
Agent automatically decides when to use tools based on conversation context

MCP Servers

Optional: Connect Model Context Protocol (MCP) servers for advanced integrations.

MCP Servers:

Provide structured access to external data and functions
Enable agents to interact with complex systems
Support authentication and secure connections

Configuration:

Select MCP servers from your configured servers
Servers must be set up in Settings → Integrations → MCP Servers
Agent can use multiple MCP servers simultaneously

Post-Call Analysis

Optional: Configure automatic analysis of call transcripts after calls end.

Analysis Types:

Sentiment Analysis: Determine caller sentiment
Topic Extraction: Identify main topics discussed
Action Items: Extract follow-up actions
Summary Generation: Create call summaries

Model: Select the LLM model to use for analysis (default: GPT-4o-mini)

Use Cases:

Quality assurance
Call categorization
Performance metrics
Training data generation

Data Storage

Options:

Everything: Store all call data including transcripts and audio
Everything Except PII: Store data but redact personally identifiable information
Basic Attributes Only: Store only metadata (duration, outcome, etc.)

Default: Everything

Recommendations:

GDPR/Privacy Compliance: Use "Everything Except PII" or "Basic Attributes Only"
Full Analytics: Use "Everything"
Cost Optimization: Use "Basic Attributes Only"

Opt-In Signed URL

Default: Disabled

Effect: Generate signed URLs for accessing call recordings (requires opt-in).

Use Cases:

Secure access to call recordings
Time-limited access links
Compliance with data retention policies

Testing Your Agent

Before publishing, test your agent to ensure it works correctly.

Test Call

How to Test:

Go to the agent's edit page
Click "Test Call" button
Enter a phone number (your own number for testing)
Click "Start Test Call"
Answer the call and have a conversation

What to Test:

✅ Agent answers correctly
✅ Voice quality is good
✅ Agent understands your questions
✅ Agent responds appropriately
✅ Agent follows instructions correctly
✅ Call ends properly

Test Chat (Simulation)

How to Test:

Go to the agent's edit page
Click the "Simulation" tab
Type messages as if you were calling
Review the agent's responses

What to Test:

✅ Agent understands context
✅ Responses are relevant
✅ Agent follows instructions
✅ Knowledge base content is used correctly
✅ Tools/functions are called when needed

Cost Estimate

How to View:

Cost estimate is shown automatically when configuring the agent
Based on selected language, voice, and LLM model
Updates in real-time as you change settings

What's Included:

Per-minute rates for voice
Per-minute rates for LLM
Estimated cost per call
Estimated cost per 1000 calls

Publishing Your Agent

Once your agent is configured and tested, publish it to make it available for campaigns.

How to Publish

Complete all 5 steps of the agent configuration
Review all settings
Click "Save" or "Publish" button
Agent is now available for use in campaigns

Agent Status

Draft: Agent is saved but not published (can be edited) Published: Agent is available for campaigns (can still be edited)

Note: Changes to published agents take effect immediately for new calls. Existing calls continue with the previous configuration.

Best Practices

Agent Configuration

✅ Start simple: Begin with basic settings, add complexity as needed
✅ Test thoroughly: Test with real calls before deploying to production
✅ Monitor performance: Review call transcripts and adjust settings
✅ Iterate: Improve agents based on real-world usage

Voice Selection

✅ Match brand: Choose voices that match your brand personality
✅ Consider audience: Test with your target audience
✅ Test multiple: Try different voices to find the best fit

Language & Model

✅ Use appropriate model: GPT-4o for complex, GPT-4o-mini for simple
✅ Consider costs: Balance capability with cost
✅ Test performance: Ensure model handles your use case well

Knowledge Bases

✅ Keep updated: Regularly update knowledge base content
✅ Be specific: Include detailed, accurate information
✅ Test relevance: Ensure KB content is actually used

Webhooks & Integrations

✅ Handle errors: Implement proper error handling in webhooks
✅ Test endpoints: Verify webhooks work before deploying
✅ Monitor performance: Track webhook response times

Troubleshooting

Agent Doesn't Answer Calls

Possible Causes:

Agent is not published
Phone number not configured correctly
Agent configuration incomplete

Solutions:

Verify agent status is "Published"
Check phone number configuration
Test agent using "Test Call" feature

Poor Voice Quality

Possible Causes:

Voice selection mismatch
Network issues
Voice settings configuration

Solutions:

Try different voice
Check network connection
Adjust voice settings (speed, volume)

Agent Doesn't Understand Questions

Possible Causes:

Language mismatch
Transcription issues
LLM model limitations

Solutions:

Verify language is correct
Check transcription settings
Try a more capable LLM model (GPT-4o)

Knowledge Base Not Used

Possible Causes:

Knowledge base not connected
Language mismatch
Content not relevant

Solutions:

Verify knowledge base is selected in Step 5
Ensure KB language matches agent language
Review KB content for relevance

Next Steps

Once your agent is configured and published:

Create a calling campaign to use this agent
Manage knowledge bases for better responses
Monitor calls to see it in action
Review campaign results to analyze performance

Additional Resources

Agent API Reference - Create agents programmatically
Knowledge Base Guide - Create and manage knowledge bases
Create Calling Campaign - Complete campaign creation guide

Was this page helpful? Yes | No

Configure your first AI phone agent

Overview​

Prerequisites​

Step 1: Base Configuration​

Agent Name​

Language​

Voice ID​

LLM Model​

Step 2: Character Configuration​

Primary Role​

Mission​

Additional Instructions (Custom Prompt)​

Welcome Message​

Who Speaks First​

Pause Before Speaking​

Step 3: Voice & Behavior​

Voice Settings​

Voice Model​

Voice Temperature​

Voice Speed​

Volume​

Behavior Settings​

Responsiveness​

Interruption Sensitivity​

Enable Backchannel​

Backchannel Frequency​

Normalize for Speech​

Background Sound (Ambient Sound)​

Ambient Sound Volume​

Step 4: Transcription & Call Settings​

Transcription Settings​

Transcription Mode​

Vocabulary Specialization​

Denoising Mode​

Boosted Keywords​

Call Settings​

End Call After Silence​

Maximum Call Duration​

Ring Duration​

DTMF (Touch-Tone) Support​

Voicemail Detection​

Voicemail Message​

Step 5: Advanced & Knowledge Base​

Knowledge Base​

Webhook Settings​

Webhook URL​

Webhook Timeout​

Tools & Functions​

MCP Servers​

Post-Call Analysis​

Data Storage​

Opt-In Signed URL​

Testing Your Agent​

Test Call​

Test Chat (Simulation)​

Cost Estimate​

Publishing Your Agent​

How to Publish​

Agent Status​

Best Practices​

Agent Configuration​

Voice Selection​

Language & Model​

Knowledge Bases​

Webhooks & Integrations​

Troubleshooting​

Agent Doesn't Answer Calls​

Poor Voice Quality​

Agent Doesn't Understand Questions​

Knowledge Base Not Used​

Next Steps​

Additional Resources​

Overview

Prerequisites

Step 1: Base Configuration

Agent Name

Language

Voice ID

LLM Model

Step 2: Character Configuration

Primary Role

Mission

Additional Instructions (Custom Prompt)

Welcome Message

Who Speaks First

Pause Before Speaking

Step 3: Voice & Behavior

Voice Settings

Voice Model

Voice Temperature

Voice Speed

Volume

Behavior Settings

Responsiveness

Interruption Sensitivity

Enable Backchannel

Backchannel Frequency

Normalize for Speech

Background Sound (Ambient Sound)

Ambient Sound Volume

Step 4: Transcription & Call Settings

Transcription Settings

Transcription Mode

Vocabulary Specialization

Denoising Mode

Boosted Keywords

Call Settings

End Call After Silence

Maximum Call Duration

Ring Duration

DTMF (Touch-Tone) Support

Voicemail Detection

Voicemail Message

Step 5: Advanced & Knowledge Base

Knowledge Base

Webhook Settings

Webhook URL

Webhook Timeout

Tools & Functions

MCP Servers

Post-Call Analysis

Data Storage

Opt-In Signed URL

Testing Your Agent

Test Call

Test Chat (Simulation)

Cost Estimate

Publishing Your Agent

How to Publish

Agent Status

Best Practices

Agent Configuration

Voice Selection

Language & Model

Knowledge Bases

Webhooks & Integrations

Troubleshooting

Agent Doesn't Answer Calls

Poor Voice Quality

Agent Doesn't Understand Questions

Knowledge Base Not Used

Next Steps

Additional Resources