ia02 - system redundancy

Redundancy

• What does Redundancy mean to you?

• Definition: – the quality or state of being redundant – serving as a duplicate for preventing failure of an entire system (as a spacecraft) upon

failure of a single component

• Why is redundancy needed?– “My plant runs 24x7x365”

Can affect overall financial commitment and engineering development– “Zero Downtime”– “Zero Data Loss”

Source: Merriam-Webster

Different Redundancy Solutions Exist• Network Media Redundancy

– “What if my network cable gets cut ?”– “What if my network card fails?”

• Controller Redundancy– “What if my controller has a hiccup ?”– DE10: High Availability Control Systems

• Computer Hardware Redundancy– “What if my computer fails ?”– “What if some components of the PC fail?”

• Software Application Redundancy– “What if the software product faults ?”– Native feature of the product such as RSViewSE & FactoryTalk Data servers– Implemented with features built into the product. Example - A8934 - Redundancy Method Using

Cooperating RSSql Applications and A9067 - RSSql Redundancy Method Using PLC Logic• User-project “Redundancy”

– Specific checks/safeguards built into the project by the user

Which Solution Should I Use ?• May vary on a case-by-case basis

• Some applications may require several redundancy solutions used together to provide system-wide protection

– Beware of statements like “Zero Downtime” or “Zero Data Loss” These are virtually impossible to deliver How much $$ are you willing to spend

• Clearly understand:– What each solution was designed to protect against– The cost associated with a given solution– Application considerations associated with a given solution

What is important to the solution? Does this solution solve the problem?

• By the end of this session, you should have a better understanding of Rockwell’s redundancy solutions.

Redundancy Solutions OverviewRedundancy Solutions Overview

The Architecture

IA02 System Redundancy Architecture - Logical

Secondary RSSql

Terminal Server 2

Secondary HMI/Data

Primary HMI/Data

Primary RSSql

Terminal Server 1

Factory Talk Directory

Student Clients

Redundant CLXPrimary ENet

Secondary Enet

Redundant ControlNet

Student Clients

IA02 System Redundancy Architecture – Physical

Primary HMI/Data

Primary RSSql

Terminal Server 1

Secondary RSSql

Terminal Server 2

Secondary HMI/Data

Student Clients

•Microsoft SQL Server *•RSBizWare Historian

•Factory Talk Directory

Redundant CLX

Redundant ControlNet

Fiber Optic Ring

* Location for demo purposes only

Network Media RedundancyNetwork Media Redundancy

Redundant Switches and Network Cards

Network Media Redundancy• What is it ?

– Automatic switching of physical media such that the network automatically switches to a different cable path in the event of a problem

• How does it work ?– The network transceivers perform a “diagnostic” of each cable path to determine the

“best” physical cable to use– This is transparent to controllers, computers, software, user project, etc.

Its all handled by the network.– “Spanning Tree” – Hirschmann HIPER-Ring

• Redundant topology in that it provides network redundancy instead of just path redundancy while preventing loops in a network.

• For Ethernet to function properly only one active path can exist between devices.• To provide redundancy, Spanning Tree relies on having multiple paths or

connections to different switches and configures some of these paths into standby (Blocked) state.

• If a network segment becomes unreachable, spanning tree reconfigures and reestablishes link by activating the "Blocked" links.

• IEEE standardized (Most existing company IT architectures)• Demonstrates importance of managed switches instead of “home grown” networks

Media Redundancy Types

Spanning Tree / Rapid Spanning Tree

HIPER-Ring

• Typically used when downtime is critical• Available in all Hirschmann Managed Switches• Up to 50 switches in a ring supported• Maximum reconfiguration time of 300mS – reducing downtime• No software required to configure – just set DIP switch on 1 switch in ring

(Redundancy Manager)

500 ms maximum network “downtime” with 100BASE ring

50 ms maximum network “downtime” with 100BASE ring

(up to 50 switches in ring and 4,000 connected MAC addresses)

ToPLC 7

From PC 10

RM: ON (ACTIVE)RM: OFF RM: ON (STANDBY)RM: OFF

P1 P2 P1 P2 P1 P2

P3

PC 10

P3

PLC 3

P3

PLC 7

ToPLC 7

From PC 10

ToPLC 7

From PC 10

ToPLC 7

From PC 10

ToPLC 7

From PC 10

Redundant Connection

HIPER-Ring Redundancy

ToPLC 7

From PC 10

ToPLC 7

From PC 10

ToPLC 7

From PC 10

SW 1 Address TablePORT 1 PORT 2 PORT 3

PLC 3 - PC 10PLC 7 - -


- PLC 7 PLC 3- PC 10 -


PLC 3 PC 10 PLC 7- - -

HIPER-Ring(Reconfiguration < 0,5sec.)

HIPER-Ring(Reconfiguration < 0,5sec.)

Media Redundancy – Combinations

Redundant Link(Spanning Tree)

Spanning Tree / Rapid Spanning Tree

HIPER-Ring(Hirschmann Only)

Hirschmann Switches workIn Spanning Tree, Rapid Spanning Tree, And HIPER-Ringnetwork architectures

Network Media Redundancy (con’t.)• When should I use it ?

– To protect against media failures• What products supports it ?

– ControlNet 1756-CNBR, 1786-PCICS

– Ethernet Thru the use of Hirschmann switches Encompass Partner

• Instructor– Use HIDiscovery to show Hirschmann configuration software samples– Disconnect a fiber-optic connection– Clients continue to get data and successfully navigate screens– Reconnect the fiber-optic cable

• Questions?

Network Interface Card Redundancy• What is it?

– Automatic switching of NIC such that the network communications automatically switches to a different hardware component of the same PC in the event of a failure

• How does it work?– Typically, software ‘utility pack’ is used to ‘team’ a pair (or more) of NIC’s to appear as a single IP

address to the rest of the network. Should any NIC fail, the rest of the team carries the load.

Controller RedundancyController Redundancy

Redundant CLX with SRM Module

Controller Redundancy• What is it ?

– Duplicate chassis hosting controllers and communications modules such that if one controller faults, the other controller takes over.

– System is “bumpless” from the standpoint of I/O – no uncontrolled I/O states– DE10: High Availability Control Systems– System may or may not be bumpless from a supervisory/HMI perspective – temporary

loss of communications may exist depending upon media type.• How does it work ?

– Controller pairs sync their program scans, and data from supervisory systems, program edits, etc. written to the primary controllers are automatically cross-loaded to the secondary controllers Note: The bandwidth and memory required for successful synchronization should

be taken into account when estimating communications throughput.– Communications cards in the primary & secondary chassis automatically “swap” node

addresses so that the primary & secondary chassis remain at the same node addresses Times for communications to be re-established after a node swap vary by network

type, system loading, etc.

RSLogix 5000 Configuration• Configuration check box option within RSLogix 5000

Recipe Display• Students to return back to the “Welcome” display (press buttons or F3 key) and then open the

Recipe Demo display• Instructor to download values to the registers with a ‘full control’ client• Students verify they see the downloaded values & navigate back to the previous displays (F3)

as instructed in the beginning of the lab based on seat location

1 person in each row to connect in a different manner:“Rich” client on EtherNet“Rich” client on ControlNet

“Thin” client on EtherNet“Thin” client on ControlNet

Computer Hardware RedundancyComputer Hardware Redundancy

Marathon Endurance

Computer Hardware Redundancy• What is it ?

– Automatic switching of PC hardware devices such that a failure of the device does not interrupt the O/S nor applications running on the PC.

Motherboard Hard Drive Network Card

• How does it work ?– Performs similarly to Controller Redundancy– The PC performs self checks of system components.– If a component fails, the system switches over to use the secondary system component provided

by the other co-server. – This is transparent to the application software, networks, etc.

• Software Faults?– Does not protect against faulty code, “hang ups”, or software “glitches”– If system gets out of synch, it is possible to have to start from scratch in order to rebuild

Computer Hardware Redundancy (con’t.)• When should I use it ?

– To protect against PC hardware failures• What products supports it ?

– RAID– Clustering

Although not supported by RSI products, it can be used for database components of a system– Marathon Technologies Endurance system

Although not currently supported by RSI products, it is being reviewed for platform support in the very near future…so stay tuned

– How is it configured? A pair of hardware ‘co-server’ systems share a ‘virtual’ system that is synchronized via a 1

Gb/s Ethernet backbone Requires Server class hardware

Marathon Configuration

Co-Server 1

Co-Server 2

Virtual ServerVirtual Server

• 2 Co-Server PC’s host a virtual server

• This is the most costly example. Can be used with less network connections

Computer Hardware Redundancy (con’t.)• Hands-On:

– RSSql configuration running within the synchronized ‘virtual’ server– Demonstrate the GUI provided by Marathon– Disable the network card on a co-server– Co-server #2 detects the network card failure and uses its component– RSSql configuration remains running and inserts data into the database– Students notice the Virtual Server Manager indicates the component failure inside the provided GUI

Software Application RedundancySoftware Application Redundancy

RSView SE & RSLinx Enterprise

Software Application Redundancy• What is it ?

– Automatic failover from a software application running on one computer (primary) to an identical software application running on another computer (secondary) should the primary software application fail

• How does it work ?– The health of both primary & secondary software application is checked, and client-side

applications will automatically switch to the secondary server-side application should the primary server-side application become unavailable

Example – RSView SE & FactoryTalk Data Server allow redundancy configuration

Software Application Redundancy (con’t.)• Hands-On:

– Please navigate to an HMI display 1 person in each row to connect in a different manner

“Rich” client on EtherNet connect to Alarm Summary “Rich” client on ControlNet connect to a Segment display “Thin” client on EtherNet connect to a Segment display “Thin” client on ControlNet connect to Alarm Summary

– Using examples from the RSView SE Design Guide, Fail the primary HMI/Data Server by disconnecting the network cables and shutting down

power– Please note the behavior of the system and compare with your neighbor’s system

Alarm States are synchronized between primary & secondary HMI Servers

Summary• Many forms of redundancy exist

– Each were created to solve a specific application need– Be sure to use the correct redundancy solution for the application– Multiple solutions may be used concurrently based on application needs

• Redundancy is dependent upon software and hardware solutions working in tandem• Many times the software is a messenger to problems with hardware• Redundancy is not a method to ‘cover up’ poor application implementation

Questions ?

• G102753810 – RSView SE 3.20 Distributed System Design Considerations• OP07 – RSView SE Distributed Design Considerations• GN03 – FactoryTalk Distributed Design Considerations

• Thanks for attending IA02 - Visualization Redundancy for Real World Applications at RSTechEd 2005

• Please tidy up your area, complete the survey, and have a nice evening!

ia02 - system redundancy

Documents

network redundancy

controller redundancy

hiperring redundancy

path cost

ring redundancy manager

network cable

active plc

network transceivers