Skip to main content
Skip to main content

High Availability and Disaster Recovery

IronWiFi's cloud RADIUS infrastructure is designed for high availability out of the box. Every Network provides two RADIUS servers for failover, and multi-region deployment supports geographic redundancy. This guide covers how to maximize uptime with proper configuration, failover planning, backup strategies, and disaster recovery procedures.

IronWiFi's Built-In Redundancy

Dual RADIUS Servers

Every Network in IronWiFi provides two RADIUS server IP addresses:

  • Primary server -- Handles authentication and accounting requests
  • Backup server -- Automatically available if the primary is unreachable

Both servers share the same configuration, user data, and policies. There is no manual synchronization required -- IronWiFi manages this in the cloud.

How Failover Works

  1. Your access point sends an authentication request to the primary RADIUS server
  2. If the primary does not respond within the timeout period, the AP retries
  3. After the configured number of retries, the AP switches to the backup server
  4. The backup server processes the request using the same user database
  5. When the primary recovers, the AP returns to using it (behavior varies by vendor)
warning

Failover only works if you configure both RADIUS servers on your access points. If you only configure the primary, there is no automatic failover. Always configure both.

Cloud Infrastructure Resilience

IronWiFi's cloud platform provides:

  • Geographic distribution -- RADIUS servers in multiple data centers
  • Automatic scaling -- Capacity adjusts to handle authentication load
  • Data replication -- User and configuration data replicated across infrastructure
  • 24/7 monitoring -- IronWiFi monitors platform health continuously

Multi-Region Deployment

For organizations with access points in multiple geographic regions, create separate Networks in each region to minimize latency and provide geographic redundancy.

Architecture

Setting up multi-region

  1. Navigate to Networks in the IronWiFi Console
  2. Click Create Network and select a region (e.g., US East)
  3. Create a second Network and select another region (e.g., Europe West)
  4. Configure US access points to use the US Network's RADIUS servers
  5. Configure EU access points to use the EU Network's RADIUS servers

Users and Groups are account-wide -- they work across all Networks automatically.

tip

For critical deployments, you can configure access points to use a remote region's RADIUS servers as a tertiary fallback. For example, US access points could use the EU RADIUS servers as a last resort if both US servers are unreachable.

Cross-region failover

Most enterprise access points support configuring multiple RADIUS servers with priority. Use this to set up cross-region failover:

PriorityServerRegionPurpose
1 (primary)US RADIUS PrimaryUS EastNormal operation
2 (backup)US RADIUS BackupUS EastSame-region failover
3 (tertiary)EU RADIUS PrimaryEurope WestCross-region failover

Check your access point documentation for the maximum number of RADIUS servers supported and how priority is configured.

Access Point Configuration for HA

Timeout and retry settings

Proper timeout configuration is critical for fast failover:

SettingRecommended ValueReason
RADIUS timeout3-5 secondsTime to wait for a response before retrying
Retry count3Number of retries before switching to backup
Dead time300 seconds (5 min)How long to avoid a failed server before retrying it
Accounting interim300 seconds (5 min)Frequency of accounting updates

With these settings, failover to the backup server happens within 9-15 seconds (3 retries x 3-5 seconds each).

Vendor-specific HA configuration

Cisco Meraki

Meraki supports primary and secondary RADIUS servers natively:

  1. Navigate to Wireless > Access Control
  2. Under RADIUS servers, add both IronWiFi Primary and Backup IPs
  3. Meraki automatically fails over to the secondary if the primary is unresponsive

Ubiquiti UniFi

  1. In the UniFi Controller, navigate to Settings > WiFi
  2. Edit your SSID and expand RADIUS settings
  3. Add the primary server (IP, port, secret)
  4. Click Add RADIUS Server to add the backup
  5. UniFi tries servers in order, failing over automatically

Aruba / HPE

  1. Configure primary and backup RADIUS servers in the server group
  2. Set the
    dead-time
    parameter for failed server recovery
  3. Enable RADIUS server health monitoring

MikroTik

MikroTik tries RADIUS servers in order. If the first fails, it moves to the next.

See the Configuration Guides for complete setup instructions for your specific hardware.

RADIUS caching for offline resilience

Enable RADIUS caching on access points that support it. Caching stores recently authenticated credentials locally so users can reconnect even during a complete RADIUS outage.

Benefits:

  • Previously authenticated users reconnect without reaching IronWiFi
  • Handles brief network outages transparently
  • Reduces authentication latency for repeat connections

See RADIUS Caching & Failover for detailed caching configuration.

Recovery Time and Recovery Point Objectives

Definitions

TermDefinitionIronWiFi Context
RTO (Recovery Time Objective)Maximum acceptable downtimeHow quickly authentication resumes after a failure
RPO (Recovery Point Objective)Maximum acceptable data lossHow much configuration/user data you can afford to lose

IronWiFi HA targets

ScenarioRTORPOMitigation
Single RADIUS server failure9-15 seconds0 (no data loss)AP fails over to backup server
Regional outage9-15 seconds0Cross-region failover (if configured)
RADIUS caching active0 (transparent)0Local cache handles authentication
Complete platform outageDepends on caching0 (cloud-replicated)RADIUS caching + IronWiFi platform recovery
Configuration errorMinutes0Restore from backup (see below)

Improving your RTO

  1. Configure both RADIUS servers -- Reduces single-server RTO to seconds
  2. Enable RADIUS caching -- Eliminates perceived downtime for cached users
  3. Use multi-region Networks -- Provides geographic resilience
  4. Lower AP timeout values -- Faster failover (but avoid timeouts so short they cause false failovers)
  5. Set up monitoring -- Detect issues before users report them (see Monitoring and Alerting)

Backup Strategies

What to back up

DataBackup MethodFrequency
User accounts and attributesAPI exportWeekly or after bulk changes
Group configurationsAPI export or manual documentationAfter policy changes
Network settingsDocument RADIUS IPs, ports, secretsAfter Network creation
Captive portal configurationDocument settings and custom HTMLAfter portal changes
Access point RADIUS settingsNetwork management tool backupAfter configuration changes

Automated backup via API

Use the REST API to export your IronWiFi configuration:

Schedule these exports using cron or a similar scheduler. See Backup and Restore for complete backup procedures.

Documenting your configuration

Maintain a configuration document that records:

  • IronWiFi account details (not credentials -- store those in a password manager)
  • Network names, regions, RADIUS server IPs, ports
  • Group names and their attribute configurations
  • Captive portal settings, splash page URLs, walled garden entries
  • Access point RADIUS configuration for each site
  • Identity provider integration details (Connector or SCIM settings)

Store this document securely and update it whenever configuration changes are made.

Testing Your HA Setup

Failover test procedure

  1. Connect a test device to your WiFi network and verify authentication succeeds
  2. On your access point, temporarily block the primary RADIUS server IP using a firewall rule
  3. Disconnect and reconnect the test device
  4. Verify authentication succeeds via the backup RADIUS server
  5. Check IronWiFi Reports to confirm the authentication went through the backup
  6. Remove the firewall rule and verify the AP returns to the primary server

Monitoring during tests

  • Watch the IronWiFi Console for authentication events during failover
  • Measure the time between blocking the primary and successful backup authentication
  • Verify that RADIUS accounting continues to function during failover

Test schedule

TestFrequencyWho
AP failover (primary to backup)QuarterlyNetwork team
Cross-region failoverSemi-annuallyNetwork team
RADIUS caching (disconnect from cloud)QuarterlyNetwork team
Full DR drill (simulate platform outage)AnnuallyIT operations
Backup restore testSemi-annuallyIT operations

Incident Communication

During an outage:

  1. Check status.ironwifi.com for platform-level incidents
  2. Subscribe to status page notifications for real-time updates
  3. Contact IronWiFi support via live chat at ironwifi.com or email support@ironwifi.com
  4. Communicate with affected users through your internal channels

Was this page helpful?