Skip to main content
Version: 8.1

Ignition Redundancy

Inductive University

How Redundancy Works

Watch the video

Ignition redundancy supports a 2-node system, meaning there are two copies of the Gateway running. One node is the Master Gateway and the other is the Backup Gateway or backup node. All projects, Gateway settings, etc., are shared between nodes. The master node manages the configuration then replicates it to the backup node.

How to Update Redundant Servers

Refer to the Updating or Patching a Redundant Ignition Pair Knowledge Base Article to learn more about updating redundant servers and how to make the process a success.

" "

When you have redundant systems in place, you can get detailed status information by going to Gateway webpage and selecting Status > Redundancy to view the system's status and events.

" "

Redundancy Status Page Overview

New in 8.1.21
The Redundancy Status page was overhauled in 8.1.21. This section breaks down the Redundancy Status Page, describing each feature.

Metrics and Information

" "

The top section of the Redundancy Status page gives information about the Gateway and your redundancy setup, including:

  • Redundancy configuration settings
  • The current Gateway's role in the Redundant pair
  • If the Gateway has a peer connected
  • The current node's uptime after failover
  • Current redundancy settings
  • Force re-sync and failover options

The following image is the Redundancy Status page from the Master Gateway's perspective. Many features of the Redundancy Status page remain the same, with a few notable exceptions.

Redundancy Status Page Overview - Master

  • The Role is different, reflecting the Gateway's own role in the redundant pair.
  • There is a Config Updates Disk Cache section, which can be adjusted to display information for different times using the date range slider. This section will only show on the Master Gateway, as you cannot make config updates on the Backup Gateway.

Redundancy Providers Statistics

" "

The next section of the Redundancy Status page details metrics about applicable redundancy providers.

Data that is presented include:

  • The name of the provider
  • When the provider was last pulled
  • The latest time duration (or how long it takes) for the Gateway to get data from the provider
  • The latest time duration (or how long it takes) for the Gateway to apply the data received from the provider
  • Whether a full sync is needed
  • Whether a system restart is needed

System Event Information

" "

The third section of the Redundancy Status page displays a table that will log system events whenever a full sync is required, helping to establish a timeline of when full sync events were requested.

Information displayed in this table includes:

  • How severe the system event was/is
  • When the system event occurred
  • The reason for the system event

Logging Activity

" "

The final section of the Redundancy Status page shows logger activity and allows users to enable DEBUG and TRACE logs for a specific redundancy provider.

Features of the log activity table include:

  • Minimum logging level. Options are:
    • INFO
    • DEBUG
    • TRACE
  • An option to merge logs to the main diagnostic log viewer
  • The specified logger
  • The log's timestamp
  • The issue being logged

Node Communication

The master and backup nodes communicate over TCP/IP. Therefore, they must be able to see each other over the network, through any firewalls that might be in place. All communication goes from the backup to the master node over the gateway network (default port 8088 without SSL, port 8060 with SSL). Therefore, that port must allow TCP listening on the master machine.

Configuration Synchronization

The master node maintains the official version of the system configuration. You must make all changes to the system on the master Gateway, the backup Gateway does not allow you to edit properties. Similarly, the Designer only connects to the master node.

When changes are made on the master, they are queued up to be sent to the backup node. When the backup connects, it retrieves these updates, or downloads a full system backup if it is too far out of date.

If the master node has modules that aren't present on the backup, they are sent across. Both types of backup transfers, data only and full, will trigger the Gateway to perform a soft reboot.

Runtime State Synchronization

Information that is only relevant to the running state, such as current alarm states, is shared between nodes on a differential basis so that the backup can take over with the same state that the master had.

On first connection or if the backup node falls too far out of sync, a full state transfer is performed. This information is light-weight and does not trigger a Gateway restart.

New in 8.1.19
After the Master Gateway and Backup Gateway reestablishes a redundancy connection, the Backup Gateway will check if it has any conflicting data compared to the Master Gateway's data. If the Backup Gateway has instances of conflicting data, the Backup Gateway will drop those instances in favor of the Master Gateway's data.

Changed in 8.1.31
The above behavior changes depending on if the Use Active Uptime to Resolve Conflicts property is toggled. If this property is toggled, the redundancy system will compare the Master and Backup nodes' active uptimes to each other, and use the data from the node with higher active uptime. This behavior can result in the Master overwriting its own data with data from an active Backup node.

Status Monitoring

Once connected, the nodes begin monitoring each other for liveliness and configuration changes. While the master is up, the backup runs according to the stand by activity level in the settings.

When the master cannot be contacted by the backup for the specified amount of time, it is determined to be down and the backup assumes responsibility. When the master becomes available again, responsibility is dictated by the recovery mode and the master either takes over immediately or waits for user interaction.

Historical Logging

Historical data presents a unique challenge when working with redundancy because it is never possible for the backup node to know whether the master is truly down or simply unreachable. If the master was running, but unreachable due to a network failure, the backup node becomes active and begins to log history at the same time as the master, who is still active.

In some cases this is OK because the immediate availability of the data is more important than the fact that duplicate entries are logged. But in other cases, it's desirable to avoid duplicates, even at the cost of not having the data available until information about the master state is available.

Ignition redundancy provides for both of these cases, with the backup history level, which can be either Partial or Full.

  • In Full mode, the backup node logs data directly to the database.
  • In Partial mode, however, all historical data is cached until a connection is reestablished with the master. At that time, the backup and master communicate about the uptime of the master, and only the data that was collected while the master was truly down is forwarded to the database.

Client Failover

Changed in 8.1.17
Failover to the other redundant node is now allowed if the nodes have different platform versions, which will allow attached clients to remain connected to at least one node during a redundant pair upgrade.

Vision Clients

All Vision clients connect to the active node. When this system fails and is no longer available, they automatically re-target to the other node. The reconnection and session establishment procedures are handled automatically, but the user is notified that they have been transferred to a different node so that they can notify the system administrator that the system may need attention.

Perspective Sessions

Like Vision clients, Perspective sessions connect to the active node. When connection to the active node is lost, or the activity level of the Gateway changes from active, the session will simultaneously attempt to:

  1. Re-establish the connection to the Gateway it was connected to, and check to make sure its activity level is active.

  2. Monitor the backup Gateway. If the backup Gateway becomes reachable and active before the connection to the active Gateway can be re-established, the Perspective session navigates in the browser to the same project and page on the backup Gateway.