vcs troubleshooting job aid

1
Cluster Communications Service Groups and Resources hastatus –sum o Cannot connect . . . o Systems in wait state o Resources offline gabconfig -a o No port a o No port h lltconfig -a o LLT not running GAB problem HAD problem LLT problem Service group cannot come online hagrp –display group o Check AutoStart and AutoStartList attributes o Check AutoDisabled attribute WARNING! See doc before clearing o Reprobe unprobed resources hares –probe res sys sys o Unfreeze frozen service groups hagrp –unfreeze group [-persistent] o Take service group offline elsewhere and flush hagrp –offline group hagrp –flush group –sys sys ArgList corrupted in types.cf hatype –display res_type o Stop VCS on all systems hastop –all -force o Fix or replace types.cf cd /etc/VRTSvcs/conf/config cp types.cf.previous types.cf o Restart VCS on all systems hastart (each system) Inconsistent system name uname -a o Correct any mismatched names in llttab, llthosts, sysname, main.cf Startup GAB problem o Check seed number in /etc/gabtab o Start GAB: HAD problem o Verify main.cf syntax hacf –verify conf_dir o Check had/hashadow processes ps-ef | grep ha LLT problem o Check console and logs for missing or misconfigured LLT files o Check LLT configuration files llttab, llthosts, sysname o Start LLT lltconfig -c o Ensure systems are visible on LLT lltstat –nvv o Check physical network components ADMIN_WAIT or STALE_ADMIN_WAIT o Visually inspect main.cf and restore or fix if necessary o Check and fix main.cf syntax hacf –verify conf_dir o Start VCS hasys –force sys o Force VCS on the node with fixed main.cf hasys –force sys o Start VCS on other nodes hastart VCS Troubleshooting Job Aid

Upload: rbolanoso

Post on 04-Sep-2015

32 views

Category:

Documents


7 download

DESCRIPTION

VCS Troubleshooting Job Aid

TRANSCRIPT

  • Cluster Communications

    Service Groups and Resources

    hastatus sum

    o Cannot connect . . .

    o Systems in wait state

    o Resources offline

    gabconfig -a

    o No port a

    o No port h lltconfig -a

    o LLT not running

    GAB problem HAD problem LLT problem

    Service group cannot come online hagrp display group

    o Check AutoStart and AutoStartList attributes o Check AutoDisabled attribute

    WARNING! See doc before clearing

    o Reprobe unprobed resources hares probe res sys sys

    o Unfreeze frozen service groups hagrp unfreeze group [-persistent]

    o Take service group offline elsewhere and flush

    hagrp offline group hagrp flush group sys sys

    ArgList corrupted in types.cf hatype display res_type

    o Stop VCS on all systems hastop all -force

    o Fix or replace types.cf cd /etc/VRTSvcs/conf/config cp types.cf.previous types.cf

    o Restart VCS on all systems hastart (each system)

    Inconsistent system name uname -a

    o Correct any mismatched names in llttab, llthosts, sysname, main.cf

    Startup

    GAB problem

    o Check seed number in /etc/gabtab o Start GAB:

    HAD problem

    o Verify main.cf syntax hacf verify conf_dir

    o Check had/hashadow processes ps-ef | grep ha

    LLT problem

    o Check console and logs for missing or misconfigured LLT files

    o Check LLT configuration files llttab, llthosts, sysname

    o Start LLT lltconfig -c

    o Ensure systems are visible on LLT lltstat nvv

    o Check physical network components

    ADMIN_WAIT or STALE_ADMIN_WAIT

    o Visually inspect main.cf and restore or fix if necessary

    o Check and fix main.cf syntax hacf verify conf_dir

    o Start VCS hasys force sys

    o Force VCS on the node with fixed main.cf hasys force sys

    o Start VCS on other nodes hastart

    VCS Troubleshooting Job Aid