Configure your first AI phone agent
Complete guide to create, configure, test, and publish an AI phone agent that handles customer conversations automatically. In this guide, you'll set up a professional voice agent that can make sales calls, provide customer support, schedule appointments, or handle any other voice interaction use case.
Overview
An AI agent in Agoralia is a voice-enabled AI assistant that can:
- Answer inbound calls automatically
- Make outbound calls for campaigns
- Handle conversations in natural language
- Extract structured data from conversations
- Integrate with knowledge bases for enhanced responses
- Use tools and functions for external integrations
Agents are configured through a 5-step wizard that guides you through all settings, from basic configuration to advanced features.
Prerequisites
Before creating an agent, ensure you have:
- ✅ Admin or editor access to your Agoralia workspace
- ✅ Clear understanding of your use case (sales, support, appointments, etc.)
- ✅ Optional: Knowledge base documents prepared (for enhanced responses)
- ✅ Optional: Phone number configured (for testing)
Step 1: Base Configuration
The first step sets up the fundamental properties of your agent.
Agent Name
Required: Give your agent a descriptive name that reflects its purpose.
Examples:
- "Enterprise Sales Rep"
- "Tech Support Assistant"
- "Appointment Scheduler"
- "Customer Onboarding Agent"
Best Practices:
- Use clear, descriptive names
- Include the use case or department
- Avoid generic names like "Agent 1"
Language
Required: Select the primary language for your agent.
Available Languages: Agoralia supports 150+ languages including:
- English (US, UK, Canada, Australia, etc.)
- Italian, Spanish, French, German
- Chinese, Japanese, Korean
- Arabic, Hindi, Portuguese
- And many more...
How it works:
- Agoralia automatically selects the best voice engine for your chosen language
- English languages use optimized voice engines
- All other languages are fully supported with high-quality voices
Note: You can enable multilingual mode in campaigns to auto-detect the caller's language.
Voice ID
Required: Select the voice that will speak for your agent.
Voice Selection:
- High-quality voices are available for all supported languages
- Voices are automatically filtered based on your selected language
- You can preview voices before selecting them
Voice Properties:
- Gender: Male, Female, Neutral
- Accent: Regional variations (e.g., US English vs. UK English)
- Style: Professional, Casual, Friendly, etc.
Preview: You can preview voices before selecting them using the voice selector dialog.
Best Practices:
- Choose a voice that matches your brand personality
- Test multiple voices to find the best fit
- Consider your target audience's preferences
LLM Model
Required: Select the Large Language Model (LLM) that powers your agent's intelligence.
Available Models:
- GPT-4o: Most capable, best for complex conversations
- GPT-4o-mini: Faster and more cost-effective, good for most use cases
- GPT-3.5-turbo: Legacy model, still supported
- Additional models may be available based on your plan
Model Selection Guide:
- Complex use cases (technical support, sales): Use GPT-4o
- Simple use cases (appointments, FAQs): Use GPT-4o-mini
- Cost-sensitive: Use GPT-4o-mini
- Speed-critical: Use GPT-4o-mini
Note: Available models may vary. Contact support if you need a specific model.
Step 2: Character Configuration
Define your agent's personality, role, and conversation style.
Primary Role
Options:
- Inbound: Agent handles incoming calls only
- Outbound: Agent makes outbound calls only
- Both: Agent handles both inbound and outbound calls
Default: both
Use Cases:
- Inbound only: Customer support, help desk
- Outbound only: Sales outreach, appointment reminders
- Both: General-purpose agents
Mission
Required: Describe what your agent should accomplish.
Examples:
- "Help customers troubleshoot technical issues with our product"
- "Schedule appointments for our dental practice"
- "Qualify leads for our sales team"
- "Provide information about our services and pricing"
Best Practices:
- Be specific and clear
- Include the primary goal
- Mention any constraints or requirements
- Keep it concise (1-2 sentences)
Additional Instructions (Custom Prompt)
Optional: Provide additional instructions for the agent's behavior.
Use Cases:
- Tone and style guidelines
- Specific phrases to use or avoid
- Handling of edge cases
- Compliance requirements
Example:
Always be polite and professional. If the customer seems frustrated,
acknowledge their feelings and offer to escalate to a human agent.
Never make promises about pricing or delivery dates without checking
our system first.
Welcome Message
Optional: Custom greeting message that the agent says when answering a call.
Examples:
- "Hello! Thank you for calling Acme Corp. How can I help you today?"
- "Hi, this is Sarah from Tech Support. What seems to be the issue?"
- "Welcome to our appointment booking service. I'm here to help you schedule your visit."
If not provided: The agent will generate a greeting based on its mission.
Who Speaks First
Options:
- Agent: Agent greets the caller first (default)
- User: Agent waits for the caller to speak first
Default: agent
Use Cases:
- Agent first: Most common, provides immediate greeting
- User first: Useful for inbound calls where caller might have urgent questions
Pause Before Speaking
Range: 0-5000 milliseconds
Default: 0ms
Purpose: Adds a delay before the agent starts speaking (useful for ensuring the call is connected).
Recommendations:
- 0ms: For most use cases
- 500-1000ms: If you experience connection issues
- 2000-5000ms: For very slow connections (not recommended)
Step 3: Voice & Behavior
Fine-tune how your agent sounds and behaves during conversations.
Voice Settings
Voice Model
Optional: Advanced voice model configuration (usually auto-detected from voice ID).
Voice Temperature
Range: 0.0 - 2.0
Default: 1.0
Effect: Controls the randomness and creativity of the voice.
- Lower (0.0-0.7): More consistent, predictable voice
- Higher (1.3-2.0): More varied, expressive voice
Recommendations:
- Professional/Formal: 0.7-1.0
- Friendly/Casual: 1.0-1.3
- Creative/Expressive: 1.3-1.7
Voice Speed
Range: 0.5 - 2.0
Default: 1.0
Effect: Controls how fast the agent speaks.
- Slower (0.5-0.8): Easier to understand, good for complex information
- Normal (0.9-1.1): Standard speaking pace
- Faster (1.2-2.0): Quick delivery, use sparingly
Recommendations:
- Technical support: 0.8-1.0 (slower for clarity)
- Sales: 1.0-1.2 (energetic but clear)
- Appointments: 0.9-1.1 (balanced)
Volume
Range: 0.0 - 2.0
Default: 1.0
Effect: Controls the audio volume level.
Recommendations:
- Most cases: 1.0 (default)
- Quiet environments: 1.2-1.5
- Noisy environments: 0.8-1.0
Behavior Settings
Responsiveness
Range: 0.0 - 1.0
Default: 1.0
Effect: How quickly the agent responds to the user's speech.
- Lower (0.0-0.5): Waits longer before responding (more natural pauses)
- Higher (0.7-1.0): Responds quickly (more conversational)
Recommendations:
- Formal/Professional: 0.6-0.8
- Casual/Friendly: 0.8-1.0
Interruption Sensitivity
Range: 0.0 - 1.0
Default: 1.0
Effect: How easily the agent allows the user to interrupt.
- Lower (0.0-0.5): Agent continues speaking, harder to interrupt
- Higher (0.7-1.0): Agent stops quickly when user speaks
Recommendations:
- Information-heavy: 0.5-0.7 (less interruptions)
- Conversational: 0.8-1.0 (more interruptions allowed)
Enable Backchannel
Default: Disabled
Effect: Agent makes small verbal acknowledgments ("uh-huh", "I see", "okay") during user speech.
Use Cases:
- Enabled: More natural, conversational feel
- Disabled: More formal, professional feel
When Enabled:
Backchannel Frequency
Range: 0.0 - 1.0
Default: 0.8
Effect: How often the agent makes backchannel sounds.
Normalize for Speech
Default: Disabled
Effect: Automatically converts text to speech-friendly format (e.g., "Dr." → "Doctor", "$100" → "one hundred dollars").
Use Cases:
- Enabled: When agent reads structured data (dates, numbers, abbreviations)
- Disabled: For natural conversation
Background Sound (Ambient Sound)
Options:
- None (default)
- Coffee Shop: Subtle café ambiance
- Convention Hall: Large room reverb
- Summer Outdoor: Outdoor environment
- Mountain Outdoor: Natural outdoor setting
- Static Noise: Background static
- Call Center: Office/call center ambiance
Use Cases:
- None: Most professional use cases
- Coffee Shop: More casual, friendly feel
- Call Center: Traditional call center experience
When Enabled:
Ambient Sound Volume
Range: 0.0 - 2.0
Default: 1.0
Effect: Volume of the background sound relative to the agent's voice.
Recommendations: Keep below 0.5 to avoid distracting from the conversation.
Step 4: Transcription & Call Settings
Configure how the agent processes speech and manages calls.
Transcription Settings
Transcription Mode
Options:
- Fast: Lower latency, good for real-time conversations (default)
- Accurate: Higher accuracy, slightly higher latency
Recommendations:
- Fast: For most use cases, real-time conversations
- Accurate: For technical terms, medical terminology, or when accuracy is critical
Vocabulary Specialization
Options:
- General (default): Standard vocabulary
- Medical: Optimized for medical terminology
Use Cases:
- General: Most use cases
- Medical: Healthcare, medical appointments, pharmaceutical sales
Denoising Mode
Options:
- Noise Cancellation: Removes background noise only
- Noise and Background Speech Cancellation: Removes both noise and background speech
Recommendations:
- Noise Cancellation: For most environments
- Noise and Background Speech: For noisy call centers or open offices
Boosted Keywords
Optional: Comma-separated list of keywords that should be recognized with higher priority.
Format: keyword1, keyword2, keyword3
Examples:
- Product names:
AcmePro, Enterprise Edition, Premium Plan - Technical terms:
API, SDK, webhook, integration - Industry terms:
GDPR, HIPAA, compliance
Use Cases:
- Product names or brand names
- Technical terminology
- Industry-specific terms
- Acronyms
Call Settings
End Call After Silence
Range: 10 seconds minimum
Default: 600 seconds (10 minutes)
Effect: Automatically ends the call if there's no speech for this duration.
Recommendations:
- Short calls (appointments, FAQs): 30-60 seconds
- Medium calls (support, sales): 60-120 seconds
- Long calls (consultations): 300-600 seconds
Maximum Call Duration
Range: 60 seconds (1 minute) - 7200 seconds (2 hours)
Default: 3600 seconds (60 minutes)
Effect: Maximum length of a call before automatic termination.
Recommendations:
- Appointments: 300-600 seconds (5-10 minutes)
- Support: 600-1800 seconds (10-30 minutes)
- Sales: 1800-3600 seconds (30-60 minutes)
- Consultations: 3600-7200 seconds (60-120 minutes)
Ring Duration
Range: 5-90 seconds
Default: 30 seconds
Effect: How long the phone rings before the call is considered unanswered.
Recommendations:
- Most cases: 30 seconds
- High-volume: 20 seconds
- Important calls: 60-90 seconds
DTMF (Touch-Tone) Support
Default: Enabled
Effect: Allows users to press numbers on their phone keypad to interact with the agent.
Use Cases:
- Menu navigation ("Press 1 for Sales, Press 2 for Support")
- Confirmation ("Press 1 to confirm")
- Data entry (phone numbers, account numbers)
When Enabled: You can configure DTMF options in the agent's advanced settings.
Voicemail Detection
Default: Disabled
Effect: Automatically detects when a call goes to voicemail and leaves a message.
When Enabled:
Voicemail Message
Optional: Custom message to leave on voicemail.
Example:
Hi, this is Sarah from Acme Corp. I'm calling to follow up on your
inquiry. Please call us back at 1-800-555-1234 or visit our website
at acme.com. Thank you!
If not provided: The agent will generate a message based on its mission.
Step 5: Advanced & Knowledge Base
Configure advanced features like knowledge bases, webhooks, tools, and integrations.
Knowledge Base
Optional: Connect your agent to knowledge bases for enhanced responses.
How it works:
- Agent can reference knowledge base content during conversations
- Knowledge bases can contain FAQs, product information, policies, etc.
- Multiple knowledge bases can be connected to a single agent
Available Knowledge Bases:
- General KB: Automatically available to all agents
- Custom KBs: Created in the Knowledge Base section
Selection:
- Check the boxes next to knowledge bases you want to connect
- Knowledge bases are filtered by language (agent language must match KB language)
- Synced knowledge bases are marked with "✓ Synced"
Best Practices:
- Connect relevant knowledge bases for your use case
- Keep knowledge bases up-to-date
- Use language-specific knowledge bases for multilingual agents
Webhook Settings
Optional: Configure webhooks to receive call events in real-time.
Webhook URL
Format: Valid HTTPS URL
Example: https://your-backend.com/webhooks/agoralia
Events Sent:
- Call started
- Call ended
- Call transcript updates
- Extracted data
- Call outcomes
Use Cases:
- Real-time call monitoring
- CRM integration
- Custom analytics
- Alert systems
Webhook Timeout
Range: 0+ milliseconds
Default: 5000ms (5 seconds)
Effect: Maximum time to wait for webhook response before timing out.
Recommendations:
- Fast webhooks: 3000-5000ms
- Slow webhooks: 10000-30000ms
Tools & Functions
Optional: Add custom tools/functions that the agent can call during conversations.
Available Tools:
- Custom Functions: Define your own functions
- MCP Servers: Connect to Model Context Protocol servers
- API Integrations: Connect to external APIs
Use Cases:
- Calendar Integration: Check availability, book appointments
- CRM Integration: Look up customer data, update records
- Database Queries: Retrieve information
- API Calls: External service integration
Configuration:
- Tools are configured in the Functions Editor
- Each tool has a name, description, and parameters
- Agent automatically decides when to use tools based on conversation context
MCP Servers
Optional: Connect Model Context Protocol (MCP) servers for advanced integrations.
MCP Servers:
- Provide structured access to external data and functions
- Enable agents to interact with complex systems
- Support authentication and secure connections
Configuration:
- Select MCP servers from your configured servers
- Servers must be set up in Settings → Integrations → MCP Servers
- Agent can use multiple MCP servers simultaneously
Post-Call Analysis
Optional: Configure automatic analysis of call transcripts after calls end.
Analysis Types:
- Sentiment Analysis: Determine caller sentiment
- Topic Extraction: Identify main topics discussed
- Action Items: Extract follow-up actions
- Summary Generation: Create call summaries
Model: Select the LLM model to use for analysis (default: GPT-4o-mini)
Use Cases:
- Quality assurance
- Call categorization
- Performance metrics
- Training data generation
Data Storage
Options:
- Everything: Store all call data including transcripts and audio
- Everything Except PII: Store data but redact personally identifiable information
- Basic Attributes Only: Store only metadata (duration, outcome, etc.)
Default: Everything
Recommendations:
- GDPR/Privacy Compliance: Use "Everything Except PII" or "Basic Attributes Only"
- Full Analytics: Use "Everything"
- Cost Optimization: Use "Basic Attributes Only"
Opt-In Signed URL
Default: Disabled
Effect: Generate signed URLs for accessing call recordings (requires opt-in).
Use Cases:
- Secure access to call recordings
- Time-limited access links
- Compliance with data retention policies
Testing Your Agent
Before publishing, test your agent to ensure it works correctly.
Test Call
How to Test:
- Go to the agent's edit page
- Click "Test Call" button
- Enter a phone number (your own number for testing)
- Click "Start Test Call"
- Answer the call and have a conversation
What to Test:
- ✅ Agent answers correctly
- ✅ Voice quality is good
- ✅ Agent understands your questions
- ✅ Agent responds appropriately
- ✅ Agent follows instructions correctly
- ✅ Call ends properly
Test Chat (Simulation)
How to Test:
- Go to the agent's edit page
- Click the "Simulation" tab
- Type messages as if you were calling
- Review the agent's responses
What to Test:
- ✅ Agent understands context
- ✅ Responses are relevant
- ✅ Agent follows instructions
- ✅ Knowledge base content is used correctly
- ✅ Tools/functions are called when needed
Cost Estimate
How to View:
- Cost estimate is shown automatically when configuring the agent
- Based on selected language, voice, and LLM model
- Updates in real-time as you change settings
What's Included:
- Per-minute rates for voice
- Per-minute rates for LLM
- Estimated cost per call
- Estimated cost per 1000 calls
Publishing Your Agent
Once your agent is configured and tested, publish it to make it available for campaigns.
How to Publish
- Complete all 5 steps of the agent configuration
- Review all settings
- Click "Save" or "Publish" button
- Agent is now available for use in campaigns
Agent Status
Draft: Agent is saved but not published (can be edited) Published: Agent is available for campaigns (can still be edited)
Note: Changes to published agents take effect immediately for new calls. Existing calls continue with the previous configuration.
Best Practices
Agent Configuration
- ✅ Start simple: Begin with basic settings, add complexity as needed
- ✅ Test thoroughly: Test with real calls before deploying to production
- ✅ Monitor performance: Review call transcripts and adjust settings
- ✅ Iterate: Improve agents based on real-world usage
Voice Selection
- ✅ Match brand: Choose voices that match your brand personality
- ✅ Consider audience: Test with your target audience
- ✅ Test multiple: Try different voices to find the best fit
Language & Model
- ✅ Use appropriate model: GPT-4o for complex, GPT-4o-mini for simple
- ✅ Consider costs: Balance capability with cost
- ✅ Test performance: Ensure model handles your use case well
Knowledge Bases
- ✅ Keep updated: Regularly update knowledge base content
- ✅ Be specific: Include detailed, accurate information
- ✅ Test relevance: Ensure KB content is actually used
Webhooks & Integrations
- ✅ Handle errors: Implement proper error handling in webhooks
- ✅ Test endpoints: Verify webhooks work before deploying
- ✅ Monitor performance: Track webhook response times
Troubleshooting
Agent Doesn't Answer Calls
Possible Causes:
- Agent is not published
- Phone number not configured correctly
- Agent configuration incomplete
Solutions:
- Verify agent status is "Published"
- Check phone number configuration
- Test agent using "Test Call" feature
Poor Voice Quality
Possible Causes:
- Voice selection mismatch
- Network issues
- Voice settings configuration
Solutions:
- Try different voice
- Check network connection
- Adjust voice settings (speed, volume)
Agent Doesn't Understand Questions
Possible Causes:
- Language mismatch
- Transcription issues
- LLM model limitations
Solutions:
- Verify language is correct
- Check transcription settings
- Try a more capable LLM model (GPT-4o)
Knowledge Base Not Used
Possible Causes:
- Knowledge base not connected
- Language mismatch
- Content not relevant
Solutions:
- Verify knowledge base is selected in Step 5
- Ensure KB language matches agent language
- Review KB content for relevance
Next Steps
Once your agent is configured and published:
- Create a calling campaign to use this agent
- Manage knowledge bases for better responses
- Monitor calls to see it in action
- Review campaign results to analyze performance
Additional Resources
- Agent API Reference - Create agents programmatically
- Knowledge Base Guide - Create and manage knowledge bases
- Create Calling Campaign - Complete campaign creation guide