A production-ready, enterprise-grade multi-agent AI system deployed on Microsoft Azure using Infrastructure as Code (Terraform). This platform provides a secure, scalable, and cost-optimized environment for running containerized AI agents with comprehensive networking, data storage, monitoring, and security features.
Architecture diagram showing the complete Azure infrastructure
- β 5 Containerized AI Agents on Azure Container Instances with auto-scaling
- β Azure Cosmos DB for conversation state storage (400-4000 RU/s autoscale)
- β Azure Cache for Redis for session management (250MB - 4GB configurable)
- β Application Gateway with Web Application Firewall (WAF)
- β Container Registry with geo-replication support
- β Key Vault for secrets management
- β Application Insights + Log Analytics for observability
- π Private Virtual Network with isolated subnets
- π Network Security Groups with least-privilege rules
- π Private endpoints for all backend services
- π Managed Identities (zero credential management)
- π TLS 1.3 enforcement
- π WAF with OWASP 3.2 ruleset
- π Customer-managed encryption keys
- π Independent agent scaling (2-10 instances per agent)
- π Cosmos DB autoscaling (400-4000 RU/s)
- π Redis capacity scaling (Basic to Premium tiers)
- π Application Gateway autoscaling
- π Application Insights for APM
- π Centralized logging with Log Analytics
- π Custom metrics and dashboards
- π Pre-configured alerts (CPU, memory, errors, latency)
- π Budget alerts at 80%, 95%, and 100% thresholds
- Azure Subscription with appropriate permissions
- Terraform 1.5.0 or later (Install)
- Azure CLI 2.40 or later (Install)
- Docker Desktop (for building agent images) (Install)
- Git for version control
# Clone the repository
git clone https://github.com/Remaker-Digital/multi-agent-service-platform.git
cd multi-agent-service-platform/terraform
# Authenticate with Azure
az login
az account set --subscription "YOUR_SUBSCRIPTION_ID"
# Make scripts executable (Unix/Mac/Git Bash)
chmod +x scripts/*.sh
# Deploy to development environment
./scripts/init.sh dev
./scripts/plan.sh dev
./scripts/apply.sh devFor detailed deployment instructions, see the Deployment Guide.
multi-agent-service-platform/
βββ README.md # This file
βββ LICENSE # MIT License
βββ CONTRIBUTING.md # Contribution guidelines
βββ claude.md # Project context for AI assistants
β
βββ terraform/ # Infrastructure as Code
β βββ main.tf # Root module
β βββ variables.tf # Variable definitions
β βββ outputs.tf # Output values
β βββ modules/ # 7 Terraform modules
β βββ environments/ # Environment configs (dev/staging/prod)
β βββ scripts/ # Deployment automation
β βββ docs/ # Detailed documentation
β
βββ agents/ # Agent applications
β βββ conversation-agent/ # Conversational AI agent
β βββ analysis-agent/ # Data analysis agent
β βββ recommendation-agent/ # Recommendation engine
β βββ knowledge-agent/ # Knowledge base manager
β βββ orchestration-agent/ # Multi-agent orchestrator
β
βββ docker/ # Docker configurations
β βββ docker-compose.yml # Local development setup
β βββ base/ # Base Docker images
β
βββ .github/ # GitHub configurations
β βββ workflows/ # CI/CD pipelines
β
βββ docs/ # Additional documentation
βββ architecture.md # Architecture details
βββ setup/ # Setup guides
βββ images/ # Diagrams and screenshots
| Environment | Monthly Cost | Use Case |
|---|---|---|
| Development | ~$500 | Development and testing |
| Staging | ~$2,000 | Pre-production validation |
| Production | ~$5,000 | Live production workloads |
Cost optimization features:
- Environment-specific SKUs (Basic for dev, Premium for prod)
- Autoscaling to match demand
- Budget alerts and monitoring
- Optional geo-replication (production only)
The platform consists of 7 modular Terraform components:
- Networking - VNet, subnets, NSGs, private DNS zones
- Security - Key Vault, managed identities, RBAC
- Container Registry - ACR with geo-replication
- Data Layer - Cosmos DB and Redis
- Observability - Application Insights and Log Analytics
- Agent Infrastructure - Container instances with autoscaling
- Gateway - Application Gateway with WAF
For detailed architecture documentation, see Architecture Guide.
Create a terraform/environments/<env>/terraform.tfvars file:
# Core Configuration
project_name = "multiagent-ai"
environment = "dev"
location = "eastus"
# Alert Configuration
alert_email_addresses = ["info@remakerdigital.com"]
# Budget
monthly_budget_amount = 500
# Agent Configuration
agents = {
agent1 = {
name = "conversation-agent"
description = "Handles conversational interactions"
port = 8080
}
# Add more agents...
}See terraform.tfvars.example for all available options.
Build and push agent images:
# Build agent image
cd agents/conversation-agent
docker build -t conversation-agent:latest .
# Login to Azure Container Registry
az acr login --name <your-registry-name>
# Tag and push
docker tag conversation-agent:latest <registry>.azurecr.io/conversation-agent:latest
docker push <registry>.azurecr.io/conversation-agent:latestFor local development:
cd docker
docker-compose upAccess Application Insights for real-time monitoring:
APP_INSIGHTS_URL=$(terraform output -raw app_insights_id)
echo "https://portal.azure.com/#resource/$APP_INSIGHTS_URL"Query logs using KQL:
az monitor log-analytics query \
--workspace $(terraform output -raw log_analytics_workspace_id) \
--analytics-query "ContainerInstanceLog_CL | take 100"- Zero Trust Architecture - No public access to backend services
- Managed Identities - Eliminates credential management
- Private Endpoints - All data services isolated from internet
- WAF Protection - OWASP 3.2 + Bot detection + Rate limiting
- Encryption - TLS 1.3 in transit, customer-managed keys at rest
For security best practices, see Security Guide.
GitHub Actions workflows are included for:
- Terraform validation and formatting
- Infrastructure deployment (with approvals)
- Container image building and pushing
- Security scanning
See .github/workflows/ for workflow definitions.
- Deployment Guide - Step-by-step deployment instructions
- Quick Reference - Common commands
- Files Summary - Complete file inventory
- Architecture Guide - Detailed architecture documentation
- Security Guide - Security best practices
We welcome contributions! Please see CONTRIBUTING.md for details on:
- Code of conduct
- Development setup
- Pull request process
- Coding standards
This project is licensed under the MIT License - see the LICENSE file for details.
- Built with Terraform
- Deployed on Microsoft Azure
- Documentation generated with assistance from Claude AI
- Issues: GitHub Issues
- Discussions: GitHub Discussions
- Email: info@remakerdigital.com
- Terraform infrastructure modules
- Environment configurations (dev/staging/production)
- Security hardening with private endpoints
- Comprehensive monitoring and alerting
- Sample AI agent implementations
- Kubernetes alternative (AKS)
- Multi-region deployment support
- Advanced autoscaling policies
- Terraform Cloud integration
If you find this project helpful, please consider giving it a star on GitHub!
Built with β€οΈ for the AI community