hp nonstop –r p m real-time process monitor product overview

20
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice HP NonStop – R P M Real-time Process Monitor Product Overview HP NonStop Enterprise Division [email protected]

Upload: others

Post on 05-Jan-2022

2 views

Category:

Documents


0 download

TRANSCRIPT

© 2008 Hewlett-Packard Development Company, L.P.The information contained herein is subject to change without notice

HP NonStop – R P MReal-time Process Monitor Product Overview

HP NonStop Enterprise Division

[email protected]

Page 2

RPM Overview – What is RPM?

•RPM - Real-time Process Monitor

• New Low-cost Point-product from HP NonStop

• Provides ultra-fast discovery and display of busiest Cpus and Processes

−By Cpu

−By Node

−By selected sets of Nodes

−Across entire cluster of NonStop nodes

Page 3

RPM Benefits

• Ultra-fast discovery of Cpu and Process busy activity by Cpu, Node, or Clusters of nodes

• Continuously finds & shows busiest Cpus & Processes

• Simple, fast startup and display in less than 1 second with low overhead, eg TACL 1 > RPM

•RPM provides user defined color-encoded busy alerts−Critical - Red

−Warning – Yellow

− Informative – Blue

Page 4

RPM Applicability• RPM was developed, evolved, and refined over multi-year period in HP Labs• Applicable regardless of what products or applications currently used• Runs on multiple nodes, multiple operating-systems, and multiple device-types• Supports

− J-, H-, G-, D-series - Operating Systems − Runs in mixed-version, multi-node, mixed-hardware networks

• Monitors many Cpus and/or Nodes at once

− Built based on long-term development requirement to understand real-time dynamics of software operating in large clusters of disparate NonStop servers

− Provides fast, real-time, super-cluster monitoring capabilities

• Customizable− RPM can support small, medium, and super-scalar display types:− From small 80x24 x16-Cpu displays− To medium 132x54 x64-Cpu displays− To giant 200x300 x1024-Cpu displays

• Device Types supported− TTY, T6530, VT100, ANSI− Disk file output− Process file output

Page 5

RPM General Features

• Understands both NSK and OSS file/path names

• Install Wizard provides fast setup and configuration eliminates startup learning curve

• Supports many different device types including:ANSI, VT100, T6530, TTY, Disk, and Process−Understands T6530 devices for support of legacy

displays including 24x80 and 54x132 displays

−Understands ANSI/VT100 Super-Sized200x300 character displays which are native (and free) on all Windows and Linux boxes

−Supports generic TTY, Disk, Process I/O allowing smart client front-ends and historical stats archival

Page 6

RPM Analysis Features

• Real-time - analysis-and-display of Cpu and Process busy activity by Cpu, node, selected nodes, or entire cluster.

• ByCpu – displays busiest processes in one or more Cpus

• ByNode - displays busiest processes in one or more nodes

• ADD node - commands allow multiple node analysis-display

• Results – are sorted, filtered, and color-encoded in real-time

• Statistics - synchronized across Cpus, nodes, and users(does NOT require Measure)

• Command line – configurable, 1 second response, example:− TACL 1 > RPM P\*, Rate 5, Entries 10, ByNode

− Extensive options provide wide-range of analysis-display capabilities

− Examples follow...

Page 7

RPM Example #1 – Monitor all processes in one Cpu

• PB 3, ENTRIES 10, RATE 5

− PB 3 – displays busiest processes in Cpu 3

− ENTRIES 10 – displays the 10 busiest processes

− RATE 5 – updates every 5 seconds ( can sample down to 1 sec )

• Commands can be abbreviated, commas optional, for example: P3 E10 R5

TACL 1 > RPM PB 3, ENTRIES 10, RATE 5

Process Cpu,Pin Busy% Name RPM Programs ET= 5.0 Top Pri User

-------- ------- ----- -------- -------------------------- --- --- -------

\A 3,73 87.82 $STEP $DATA.DATABASE.QUERY 1 1 66,1

11:32:10 3,320 10.13 $QAZ08 $DATA.DATABASE.QADRV1T 2 168 255,34

3,263 1.20 $HIT $SYSTEM.SYS03.TSYSDP2 3 220 255,255

3,319 .15 $QAZ07 $RPM.RPMQA.QADRV1T 4 168 255,34

3,314 .12 $QAZ06 $RPM.RPMQA.QADRV1T 5 168 255,34

3,175 .09 $ZOOH3 $DATA2.R0403V02.ASAPXMON 6 189 255,255

3,0 .06 $MON $SYSTEM.SYS03.OSIMAGE 7 201 255,255

3,192 .04 $X11W $SYSTEM.SYSTEM.RPMVT 8 168 66,1

3,43 .02 $ZOOM3 $SYSTEM.SYSTEM.ASAPMON 9 160 255,255

3,312 .02 $QAZ04 $RPM.RPMQA.QADRV1T 10 168 255,34

Page 8

RPM Example #2 – All processes in a node ByNode

• PB *, ENTRIES 10, RATE 5, ByNode

− PB * – displays Process Busy for all Cpus in a node

− ENTRIES 10 – displays the 10 busiest processes

− RATE 5 – updates every 5 seconds ( can sample down to 1 sec )

− ByNode – displays busiest processes for all Cpus in node & sorts by node

TACL 1 > RPM PB *, ENTRIES 10, RATE 5, ByNode

Process Cpu,Pin Busy% Name RPM Programs ET= 5.0 Top Pri User-------- ------- ----- -------- -------------------------- --- --- -------\A 3,73 66.24 $STEP $DATA.DATABASE.DRIVER 1 1 66,111:59:05 0,41 11.25 $ZEXP $DATA.DATABASE.QUERY 2 170 255,255

1,159 1.82 $Y7Q6 $SYSTEM.SYS03.NSKCOM 3 159 255,2551,69 1.70 $ZOO $DATA.DATABASE.LISTREC 4 160 255,2552,271 .58 $HIT $SYSTEM.SYS03.TSYSDP2 5 220 255,2552,279 .52 $HIT $SYSTEM.SYS03.TSYSDP2 6 220 255,2552,28 .18 $X90N $SYSTEM.SYSTEM.RPM 7 168 66,11,153 .31 $DATA1 $SYSTEM.SYS03.TSYSDP2 8 167 66,13,263 .31 $HIT $SYSTEM.SYS03.TSYSDP2 9 220 255,2550,0 .30 $MON $SYSTEM.SYS03.OSIMAGE 10 201 255,255

Page 9

RPM Example #3 – All processes in a node ByCpu

• PB *, Entries 5, Rate 5, ByCpu

− PB * – displays Process Busy for all processes in all Cpus in this node− ENTRIES 5 – displays the 5 busiest processes in each Cpu− RATE 5 – updates every 5 seconds – integrate stats over that interval− ByCpu – displays busy processes grouped by Cpu

Process Cpu,Pin Busy% Name RPM T0877(15MAY08) ET= 5.0 Top Pri User

-------- ------- ----- -------- -------------------------- --- --- -------

\SANFRAN 0,177 71.17 $Y567 $DATA.DATABASE.DRIVER 1 200 255,255

11:30:20 0,1021 14.24 $Y568 $DATA.DATABASE.QUERY 2 201 255,255

0,256 5.12 $T569 $DATA.DATABASE.LISTREC 3 205 255,255

0,257 .10 $TST1 $DATA.DATABASE.QUERY 4 220 255,255

0,15 .08 $NCP $SYSTEM.SYS03.NCPOBJ 5 199 255,255

\SANFRAN 1,182 11.32 $Z121 $DATA1.ANALYSIS.TOOL 1 168 255,255

11:30:20 1,47 5.31 $Y3AL $DATA1.ANALYSIS.QUERY 2 168 66,1

1,14 .23 $NCP $SYSTEM.SYS03.NCPOBJ 3 199 255,255

1,0 .16 $MON $SYSTEM.SYS03.OSIMAGE 4 201 255,255

1,87 .13 $ZOOS $SYSTEM.SYSTEM.RPM 5 160 255,255

\SANFRAN 2,271 1.08 $HIT $SYSTEM.SYS03.TSYSDP2 1 220 255,255

11:30:20 2,178 .31 $Z147 $SYSTEM.SYSTEM.TEDIT 2 160 255,255

2,0 .04 $MON $SYSTEM.SYS03.OSIMAGE 3 201 255,255

2,303 .02 $TT54 $SYSTEM.SAPP.DRV 4 80 255,255

2,275 .01 $TT62 $SYSTEM.SAPP.DRV 5 80 255,255

\SANFRAN 3,324 3.37 $QAZ08 $DATA1.DATABASE.QADRV1T 1 168 255,34

11:30:20 3,263 .28 $HIT $SYSTEM.SYS03.TSYSDP2 2 220 255,255

3,323 .18 $QAZ07 $ASAP.ASAPXQA.QADRV1T 3 168 255,34

3,322 .12 $QAZ06 $ASAP.ASAPXQA.QADRV1T 4 168 255,34

3,0 .08 $MON $SYSTEM.SYS03.OSIMAGE 5 201 255,255

ByCpu

- Used for observing load across Cpus

- Visual persistenceallows viewing ofchanging load

Page 10

RPM Example #4 – All processes on all nodes

• ADD \Node – Adds as many \nodes as you want in RPM analysis and display

• PB \*, Entries 5, Rate 5

− PB \* – \* indicates analysis and display is for all ADD-ed Nodes

− ENTRIES 5 – displays the 5 busiest processes in each Node

− RATE 5 – updates every 5 seconds – RPM can sample down to 1 second

TACL 1 > RPM PB \*, ENTRIES 5, RATE 5

Realtime Process Monitor (RPM) - T0877V01.00 - (15APR08)

+ADD \A

+ADD \B

+ADD \CProcess Cpu,Pin Busy% Name RPM Programs ET= 5.0 Top Pri User

-------- ------- ----- -------- -------------------------- --- --- -------

\A 3,35 74.63 $SPI1 $DATA.DATABASE.DRIVER 1 1 66,1

12:07:05 1,134 51.82 $ZYOB $DATA.DATABASE.QUERY 2 160 255,255

1,175 19.19 $ZERX $DATA.DATBASE.LISTREC 3 167 66,1

1,42 .18 $Z23V $SYSTEM.SYSTEM.DRIVE 4 166 66,1

1,249 .17 $Z244 $SYSTEM.SYSTEM.TEST 5 166 66,1

\B 2,271 4.83 $HIT $SYSTEM.SYS03.TSYSDP2 1 220 255,255

12:07:05 1,254 2.44 $Z08N $SYSTEM.SYSTEM.TEST 2 167 66,1

1,153 .43 $X11Z $SYSTEM.SYSTEM.DRIVE 3 167 66,1

1,74 .18 $RPMX $SYSTEM.SYSTEM.SEEVIEW 4 168 66,1

3,320 .28 $QAZ08 $ASAP.ASAPXQA.QADRV1T 5 168 255,34

\C 0,290 3.76 $SQL $SYSTEM.SYS00.TSYSDP2 1 220 255,255

12:07:05 1,195 2.48 $ZOOL $SYSTEM.SYSTEM.ASAPFIL 2 160 255,255

1,0 1.82 $MON $SYSTEM.SYS00.OSIMAGE 3 201 255,255

0,0 .55 $MON $SYSTEM.SYS00.OSIMAGE 4 201 255,255

0,85 .07 $RPMX $SYSTEM.SYSTEM.SEEVIEW 5 168 66,1

Page 11

RPM Example #5 – Composite Cpu + Process reports

• ZOOM \* – Displays both Cpu and Process stats in same display

TACL 1 > RPM ZOOM \*, ENTRIES 5, RATE 5Realtime Process Monitor (RPM) - T0877V01.00 - (15APR08)

+ADD \A

+ADD \B

+ADD \CCpus Cp hh:mm ss Busy Secs QLen Disp Disk Chit Swap MLock% Pcb PcbX-------- -- -------- ---- ---- ---- ----- ----- ----- ---- ------ --- ----\A 0 11:30:45 78 5 0 406 0 4 0 8.00 81 107

1 11:30:45 15 5 0 346 0 0 0 4.29 84 1062 11:30:45 0 5 0 176 0 40 0 6.73 28 593 11:30:45 0 5 0 28 0 0 1 3.54 30 66

\B 0 11:30:45 9 5 0 145 0 15 4 12.51 37 961 11:30:45 0 5 0 114 0 0 0 12.32 48 922 11:30:45 0 5 0 85 0 0 0 8.32 18 413 11:30:45 0 5 0 84 0 0 0 8.33 21 39

\C 0 11:30:45 4 5 0 131 0 4 0 8.01 33 971 11:30:45 0 5 0 29 0 0 1 7.65 50 84

Process Cpu,Pin Busy% Name RPM Programs ET= 5.0 Top Pri User

-------- ------- ----- -------- -------------------------- --- --- -------

\A 0,35 74.63 $SPI1 $DATA.DATABASE.DRIVER 1 1 66,1

11:30:45 1,134 11.82 $ZYOB $DATA.DATABASE.QUERY 2 160 255,255

1,175 .19 $ZERX $DATA.DATBASE.LISTREC 3 167 66,1

1,42 .18 $Z23V $SYSTEM.SYSTEM.DRIVE 4 166 66,1

1,249 .17 $Z244 $SYSTEM.SYSTEM.TEST 5 166 66,1

\B 0,271 4.83 $HIT $SYSTEM.SYS03.TSYSDP2 1 220 255,255

11:30:45 1,254 .34 $Z08N $SYSTEM.SYSTEM.TEST 2 167 66,1

1,153 .13 $X11Z $SYSTEM.SYSTEM.DRIVE 3 167 66,1

1,74 .08 $RPMX $SYSTEM.SYSTEM.SEEVIEW 4 168 66,1

3,320 .28 $QAZ08 $ASAP.ASAPXQA.QADRV1T 5 168 255,34

\C 0,290 2.79 $SQL $SYSTEM.SYS00.TSYSDP2 1 220 255,255

11:30:45 1,195 .18 $ZOOL $SYSTEM.SYSTEM.ASAPFIL 2 160 255,255

1,0 .62 $MON $SYSTEM.SYS00.OSIMAGE 3 201 255,255

0,0 .15 $MON $SYSTEM.SYS00.OSIMAGE 4 201 255,255

0,85 .07 $RPMX $SYSTEM.SYSTEM.SEEVIEW 5 168 66,1

Page 12

RPM Sample - RATE discussion

• RPM - Easily and efficiently supports update rates down to 1 second

• Fast 1 Second rate possible due to low-overhead linear-scalable design

• Default is 10 seconds because rapid time varying data difficult for some people to follow

• When RPM RATE is 10 seconds the digits to right of %Busy decimal point %0.NN

represent Cpu milliseconds used. No other RATE provides this numerical convenience:

Milliseconds SET RATE Percent Cpu Busy

100ms 10 seconds 1.00% Cpu Busy

90ms 10 seconds 0.90% Cpu Busy

80ms 10 seconds 0.80% Cpu Busy

70ms 10 seconds 0.70% Cpu Busy

60ms 10 seconds 0.60% Cpu Busy

... ... ...

Page 13

RPM Installation – Made easy using RPM Wizard

• RPM includes a configuration/install wizard: TACL 1 > RUN RPMWIZ

• Wizard interviews user and builds configuration

− Enter TYPE of device VT100 | T6530 | TTY ?

− Enter RATE interval in seconds 2, 3, 6, [10], .. ?

− Enter ENTRIES to display per Cpu or Node [3], 5, 10, ... ?

− Enter report Type Process | Cpu | Zoom ?

− Enter whether you want to sort by Cpu or [Node] ?

− Enter whether you want USEC time displayed Y/[N] ?

− Enter %BUSY threshold value [0] ?

− Enter INFO Busy% threshold value for INFO [1] ?

− Enter WARN Busy% threshold value for WARN [10] ?

− Enter CRIT Busy% threshold value for CRIT [50] ?

− ADD \sysname ? \A

− ADD \sysname ? \B

− ADD \sysname ? \C

− ADD \sysname ?

This slide shows only partial wizard interview,complete example of a Wizard session is at:

http://NonstopRPM.com/Example-Wizard.htm

Page 14

RPM Installation – Use RPM Wizard

• At End of Interview, the wizard creates RPMCNF configuration file!======================================================

!== RPM Config created by RPMWIZ - 08/02/22 07:20:08

!======================================================

SET TERM VT100

SET RATE 10

SET ENTRIES 10

SET SORT BYNODE

SET USEC OFF

SET CRIT 50

SET WARN 10

SET INFO 1

ADD \CENTDIV

ADD \SOLAR

ADD \WINE

!======================================================

SAVE to $DATA.RPMV1.RPMCNF Y/N ? Y

$DATA.RPMV1.RPMCNF save started.

$DATA.RPMV1.RPMCNF saved.

Alert Thresholds

Page 15

RPM – Interfaces

• RPM Supports wide-variety of device types− T6530

− ANSI/VT100 x Super-Sized

− TTY

− File or Process output

Page 16

RPM – T6530 Device Interface - Example

• RPM supports T6530 Devices and emulators

• Example

− RPM 6530 video high lights >>

− P\*R10 E7Analyzes 3 nodes, shows 7 busiest processes per node

− RATE 10 aka R10 means update every 10 seconds

T6530 – ExampleP \*, ByNode, RATE 10, Entries 7

Page 17

RPM – Super-Size ANSI displays

• RPM ANSI/VT100 device support:

• ANSI represents important class of display with very significant benefits:

− ANSI/VT100 provides ultra-high performance Super-Sized displays.

− No need for emulator on Windows or Linux. VT100 is natively supported on all Windows or Linux computers.

− ANSI/VT100 natively supported on virtually ANY Windows PC. From Windows just enter TELNET <ip-addr>

• Example

− C:\> TELNET <IP-address/DNS-name>

− Supports 200 line x 300 char displays

− Allows viewing busiest processes on 1000s of Cpus or 100s of Nodes in single screen

− RPM example to right shows busiest Cpus and Processes across many Cpus/Nodes

• How to create Windows TELNET shortcuts for NonStop servers see:http://NonstopRPM.com/Faqs.htm

Page 18

RPM – TTY mode Interface - Example

• RPM supports TTY mode allowing output to generic devices, processes, and files

• RPM USEC option displays Time-of-Dayin Micro-Seconds− Nodes in TimeSync− Wine in different Zone− 100ms delta is I/O time

• RPM supports both NSK and OSSfile and path names

RPM TTY Mode - ExampleP \* TTY, ByNode, RATE 10, Entries 15, USEC

Page 19

RPM Web ResourcesRPM Web Resources

−HP NonStop RPM - Technical Portal

http://NonstopRPM.com/

− RPM - Examples http://NonstopRPM.com/Examples.htm

− RPM FAQS - Frequently Asked Questionshttp://NonstopRPM.com/Faqs.htm

− RPM Documentshttp://NonstopRPM.com/Document.htm

− RPM Downloadshttp://NonstopRPM.com/Download.htm

Page 20

http://NonstopRPM.com

• HP Technology Forum 2008