Customer Support Agent Swarm¶

An interactive, console-based customer support system using multi-agents and free vector database (ChromaDB).

Overview¶

This example demonstrates a practical customer support system with:

Console-Based Interface: Simple terminal interaction, no web services required
Free Vector Database: ChromaDB for knowledge storage and retrieval
Multi-Agent Workflow: Triage and support agents working together
Real-Time Knowledge Retrieval: Agents query relevant information from vector DB
Interactive & Demo Modes: Test with your own queries or use pre-defined examples

Key Features¶

✅ No Cloud Services: Runs completely locally ✅ Free Dependencies: ChromaDB is open-source and free ✅ Semantic Search: Vector similarity for intelligent knowledge retrieval ✅ Intelligent Caching: Automatic response caching saves tokens and improves speed ✅ Production-Ready: Easy to extend and deploy

Architecture¶

graph TB
    A[Customer Query] --> B[ChromaDB Vector Search]
    B --> C[Relevant Knowledge Retrieved]
    C --> D[Triage Agent]
    D --> E[Support Agent]
    E --> F[Final Response]
    G[Knowledge Base] -.->|8 Entries| B

Workflow: 1. Customer asks a question via console 2. System queries ChromaDB for relevant knowledge 3. Triage agent categorizes the inquiry 4. Support agent formulates response using retrieved knowledge 5. Customer receives answer

Components¶

1. ChromaDB Vector Database¶

Purpose: Stores and retrieves company knowledge using semantic search

Contents: - Products: CloudSync Pro (\(29.99/mo), DataVault Enterprise (\)99.99/mo) - Troubleshooting: Login issues, sync problems - Billing: Refunds, upgrades, pricing - Policies: Privacy (GDPR/SOC2), SLA (99.9% uptime)

Features: - Free and open-source - Runs locally (no cloud required) - In-memory or persistent storage - Semantic similarity search

2. Triage Agent¶

Purpose: First point of contact, categorizes inquiries

Capabilities: - Understands customer issues - Categorizes by type (technical, billing, product) - Provides concise summary - Uses knowledge base context

3. Support Agent¶

Purpose: Provides solutions using retrieved knowledge

Capabilities: - Technical troubleshooting - Billing assistance - Product information - Contextual responses from vector DB

4. Response Caching System¶

Purpose: Automatically cache and reuse responses for similar queries

How It Works: 1. Every customer query is checked against cached responses 2. If a similar query exists (85%+ similarity), cached response is returned 3. New responses are automatically saved to cache 4. Saves API tokens and improves response time

Benefits: - Token Savings: Reuses responses instead of calling LLM - Faster Responses: Instant retrieval from vector DB - Consistent Answers: Same questions get same quality answers - Learning System: Gets better over time as cache grows

Quick Start¶

Installation¶

# Install ChromaDB (free vector database)
pip install chromadb

# Install Swarms (if not already installed)
pip install swarms

Setup¶

# Set your OpenAI API key
export OPENAI_API_KEY="your-api-key-here"

Run Interactive Mode¶

cd examples/multi_agent
python customer_support_swarm.py

This starts an interactive console where you can ask questions:

🎯 TECHCORP CUSTOMER SUPPORT - INTERACTIVE CONSOLE
================================================================================

Powered by:
  • Multi-Agent System (Swarms)
  • ChromaDB Vector Database (free, local)
  • Real-time Knowledge Retrieval

Type 'quit' or 'exit' to end the session
Type 'help' to see example questions
================================================================================

--------------------------------------------------------------------------------

👤 You: I can't log into my account

🔍 Searching knowledge base...
🤖 Processing with support agents...

================================================================================
🤖 SUPPORT AGENT:
================================================================================

I understand you're having trouble logging in. Here's how to resolve this:

DIAGNOSIS: Login authentication issue

SOLUTION:
1. Clear your browser cache and cookies
2. Use the password reset feature at cloudpro.com/reset
3. Verify your email address is correct
4. Ensure your account is still active

FOLLOW_UP: If these steps don't resolve the issue, contact support@techcorp.com
Our 24/7 support team is available via chat, email, or phone.

================================================================================

Run Demo Mode¶

python customer_support_swarm.py --demo

Runs 3 pre-defined queries to demonstrate the system

Code Overview¶

Initialize ChromaDB¶

import chromadb
from chromadb.config import Settings

# Initialize ChromaDB (in-memory, free, no setup)
chroma_client = chromadb.Client(Settings(
    anonymized_telemetry=False,
    allow_reset=True
))

# Create collection for company knowledge
knowledge_collection = chroma_client.get_or_create_collection(
    name="company_knowledge",
    metadata={"description": "TechCorp product information"}
)

Load Knowledge into Vector DB¶

KNOWLEDGE_ENTRIES = [
    {
        "id": "product_cloudsync",
        "text": "CloudSync Pro is priced at $29.99/month with unlimited storage...",
        "metadata": {"category": "product", "product": "CloudSync Pro"}
    },
    # ... more entries
]

# Populate database
for entry in KNOWLEDGE_ENTRIES:
    knowledge_collection.add(
        ids=[entry["id"]],
        documents=[entry["text"]],
        metadatas=[entry["metadata"]]
    )

Caching System¶

# Create conversation history collection for caching
conversation_history = chroma_client.get_or_create_collection(
    name="conversation_history",
    metadata={"description": "Past queries and responses"}
)

def check_cached_response(query: str, similarity_threshold: float = 0.85):
    """Check if similar query was already answered"""
    results = conversation_history.query(
        query_texts=[query],
        n_results=1
    )

    # If similarity > 85%, return cached response
    if results["distances"][0][0] < 0.3:  # Low distance = high similarity
        return True, results["metadatas"][0][0]["response"]

    return False, ""

def save_to_cache(query: str, response: str):
    """Save query-response pair for future reuse"""
    conversation_history.add(
        ids=[f"conv_{hash(query)}_{time.time()}"],
        documents=[query],
        metadatas=[{"response": response, "timestamp": time.time()}]
    )

Query Vector Database¶

def query_knowledge_base(query: str, n_results: int = 3) -> str:
    """Query the vector database for relevant knowledge"""
    results = knowledge_collection.query(
        query_texts=[query],
        n_results=n_results
    )

    # Format and return relevant knowledge
    knowledge = "Relevant Knowledge Base Information:\n\n"
    for doc, metadata in zip(results["documents"][0], results["metadatas"][0]):
        knowledge += f"[{metadata['category']}] {doc}\n\n"

    return knowledge

Create Agents¶

from swarms import Agent

def create_support_agents():
    triage_agent = Agent(
        agent_name="Triage-Agent",
        system_prompt="You are a customer support triage specialist...",
        max_loops=1,
        verbose=False,
    )

    support_agent = Agent(
        agent_name="Support-Agent",
        system_prompt="You are a TechCorp support specialist...",
        max_loops=1,
        verbose=False,
    )

    return [triage_agent, support_agent]

Handle Support Query¶

from swarms.structs.sequential_workflow import SequentialWorkflow

def handle_support_query(customer_query: str, agents: list) -> str:
    # Step 1: Query vector database
    relevant_knowledge = query_knowledge_base(customer_query, n_results=3)

    # Step 2: Enrich query with knowledge
    enriched_query = f"""Customer Query: {customer_query}

{relevant_knowledge}

Based on the customer query and knowledge base information, provide a response."""

    # Step 3: Run through agent workflow
    workflow = SequentialWorkflow(
        name="Support-Workflow",
        agents=agents,
        max_loops=1,
    )

    return workflow.run(enriched_query)

Example Queries¶

The system can handle various types of customer support queries:

Technical Support¶

👤 You: I can't log into my account

Vector DB Retrieves: - Troubleshooting guide for login issues - Account verification steps - Contact information for support

Agent Response: - Diagnoses the issue - Provides step-by-step solution - Offers escalation path

Product Information¶

👤 You: What's the difference between CloudSync Pro and DataVault Enterprise?

Vector DB Retrieves: - Product specifications - Pricing information - Feature comparisons

Agent Response: - Explains key differences - Recommends based on user needs - Mentions pricing and trials

Billing Questions¶

👤 You: How do I get a refund?

Vector DB Retrieves: - Refund policy - Process steps - Timeline expectations

Agent Response: - Explains 30-day money-back guarantee - Provides refund request steps - Sets timeline expectations

Caching in Action¶

First Query:

👤 You: I can't log into my account

💾 Checking cache for similar queries...
❌ No similar query found in cache. Processing with agents...
🔍 Searching knowledge base...
🤖 Processing with support agents...

🤖 SUPPORT AGENT:
I understand you're having trouble logging in...
[Full response]

💾 Saving response to cache for future queries...
✅ Cached successfully!

Similar Query Later:

👤 You: Can't sign in to my account

💾 Checking cache for similar queries...
✅ Found cached response! (Saving tokens 🎉)

🤖 SUPPORT AGENT (from cache):
I understand you're having trouble logging in...
[Same cached response - instant, no LLM call!]

💡 This response was retrieved from cache. No tokens used! 🎉

Customization¶

Adding More Knowledge¶

Expand the knowledge base with more entries:

KNOWLEDGE_ENTRIES.append({
    "id": "feature_api_access",
    "text": "API access is available on DataVault Enterprise plan. Includes REST API, webhooks, and 10,000 requests/month. Documentation at api.techcorp.com",
    "metadata": {"category": "product", "feature": "api"}
})

Adding More Agents¶

Create specialized agents for specific needs:

def create_expanded_support_agents():
    # ... existing agents ...

    escalation_agent = Agent(
        agent_name="Escalation-Agent",
        system_prompt="Review cases and determine if human intervention is needed.",
        max_loops=1,
    )

    return [triage_agent, support_agent, escalation_agent]

Adjusting Cache Sensitivity¶

Control when to use cached responses:

# More aggressive caching (70% similarity)
response, cached = handle_support_query(query, agents, cache_threshold=0.70)

# Stricter caching (90% similarity)
response, cached = handle_support_query(query, agents, cache_threshold=0.90)

# Disable caching
response, cached = handle_support_query(query, agents, use_cache=False)

Recommended Thresholds: - 0.85 (default) - Good balance, catches paraphrased questions - 0.90 - Strict, only very similar questions - 0.70 - Aggressive, broader matching (more token savings, less accuracy)

Persistent Storage¶

For production, save ChromaDB to disk:

from chromadb import PersistentClient

# Use persistent storage instead of in-memory
chroma_client = PersistentClient(path="./chroma_db")

# Database persists between runs (including cached responses!)

Load Knowledge from Files¶

import json

# Load knowledge from JSON file
with open("knowledge_base.json", "r") as f:
    knowledge_entries = json.load(f)

# Populate database
for entry in knowledge_entries:
    knowledge_collection.add(
        ids=[entry["id"]],
        documents=[entry["text"]],
        metadatas=[entry["metadata"]]
    )

Production Considerations¶

1. Knowledge Management¶

Regular Updates: Keep knowledge base current with product changes
Version Control: Track knowledge base changes in Git
Quality Control: Review and validate new entries before adding
Categorization: Use consistent metadata categories for better retrieval

2. Performance Optimization¶

Batch Loading: Load knowledge in batches for faster startup
Cache Results: Cache frequently queried knowledge
Index Optimization: ChromaDB handles indexing automatically
Retrieval Tuning: Adjust n_results based on query complexity

3. Monitoring & Analytics¶

Track Queries: Log all customer queries for analysis
Measure Accuracy: Monitor if retrieved knowledge is relevant
Response Time: Track end-to-end response latency
User Feedback: Collect satisfaction ratings

4. Security¶

API Key Protection: Never hardcode API keys
Input Validation: Sanitize user inputs
Rate Limiting: Prevent abuse with rate limits
Data Privacy: Don't store sensitive customer data in logs

Best Practices¶

1. Vector Database¶

Start with in-memory for development
Use persistent storage for production
Regularly backup your database
Monitor database size and performance

2. Agent Design¶

Keep prompts focused and specific
Use verbose=False in production for cleaner output
Test agents independently before integration
Monitor agent response quality

3. Knowledge Base¶

Structure entries clearly and consistently
Use meaningful IDs and metadata
Include examples in knowledge entries
Update based on common queries

Why ChromaDB?¶

Advantages: - ✅ Free and open-source - ✅ No external dependencies - ✅ Simple 3-line setup - ✅ Fast semantic search - ✅ Works offline - ✅ Perfect for prototypes and production

Alternatives: - FAISS (Facebook) - Faster for very large datasets - Pinecone - Managed cloud service (paid) - Weaviate - Full-featured vector DB (more complex) - Qdrant - Open-source with advanced features

Next Steps¶

Try the demo: Run python customer_support_swarm.py --demo
Interactive mode: Run without flags for console chat
Customize: Add your own knowledge entries
Extend: Add more specialized agents
Deploy: Switch to persistent storage for production

Sequential Workflow - Linear agent workflows
Agent Rearrange - Dynamic agent routing
Multi-Agent Collaboration - Team coordination patterns

Support¶

Documentation: docs.swarms.world
Discord: Join the community
GitHub: Report issues

Built with ❤️ using Swarms and ChromaDB