SUNW.HAStorageplus

rgm_rt_SUNW.HAStoragePlus

(updated from a sunsolve document)

HAStoragePlus supersedes HAStorage. The HAStoragePlus resource type has standard resource type properties, in order for the Resource Group Manager to manage it:

Property Name

Explanation

Value

ccr_gennum

Generation number of the file so we always use the latest

integer (usually 1)

ccr_checksum

A checksum value to validate file integrity

hex # (meaningless except to cluster)

RT_basedir

The base birectory to find all the cluster-related executables

/usr/cluster/lib/rgm/rt/hastorageplus

RT_description

A description of the resource type

HA Storage Plus - A Resource Type which subsumes HA Storage

START

The name of the start script/program

hastorageplus_start

STOP

The name of the stop script/program

hastorageplus_stop

VALIDATE

The name of the validate script/program

hastorageplus_validate

UPDATE

The name of the update script/program

hastorageplus_update

MONITOR_START

The name of the monitor start script/program

hastorageplus_monitor_start

MONITOR_STOP

The name of the monitor stop script/program

hastorageplus_monitor_stop

MONITOR_CHECK

The name of the monitor check program

hastorageplus_monitor_check

PRENET_START

The name of the prenet startup program

hastorageplus_prenet_start

POSTNET_STOP

The name of the postnet shutdown program

hastorageplus_postnet_stop

Single_instance

True if failover, false if scalable

FALSE

Init_nodes

Where the dataservice can be started

RG_PRIMARIES

Installed_nodes

A comma delimited list of nodes where the RT is installed

\* means all nodes

\*

Failover

True if a failover only service, false otherwise

FALSE

API_version

Version of the API (Application Programming Interface)

2

RT_version

Version of the resource type

1.0

Pkglist

A list of the packages installed to get this RT

SUNWscu



# pwd
/usr/cluster/lib/rgm/rt/hastorageplus
# ls -l
total 6320
-r-xr-xr-x 1 root bin 344544 Jan 28 06:36 hastorageplus_init
-r-xr-xr-x 1 root bin 273104 Jan 28 06:36 hastorageplus_monitor_check
-r-xr-xr-x 1 root bin 273100 Jan 28 06:36 hastorageplus_monitor_start
-r-xr-xr-x 1 root bin 273096 Jan 28 06:36 hastorageplus_monitor_stop
-r-xr-xr-x 1 root bin 347616 Jan 28 06:36 hastorageplus_postnet_stop
-r-xr-xr-x 1 root bin 346492 Jan 28 06:36 hastorageplus_prenet_start
-r-xr-xr-x 1 root bin 273020 Jan 28 06:36 hastorageplus_start
-r-xr-xr-x 1 root bin 273016 Jan 28 06:36 hastorageplus_stop
-r-xr-xr-x 1 root bin 351680 Jan 28 06:36 hastorageplus_update
-r-xr-xr-x 1 root bin 347696 Jan 28 06:36 hastorageplus_validate



HAStoragePlus uses the following properties, which will be displayed in the rgm_rt file,as well as in the comma delimited list of properties for the HAStoragePlus resource in a resource group.

Property

Common/Extension

Description

START_TIMEOUT

Common

Time in seconds for the start script/program to finish.

Default=90

Min=60

Tunable: Anytime

STOP_TIMEOUT

Common

Time in seconds for the stop script/program to finish.

Default=1800

Min=60

Tunable: Anytime

VALIDATE_TIMEOUT

Common

Time in seconds for the validate script/program to finish. Validate is called at resource creation and during updates of the resource or its resource group.

Default=1800

Min=60

Tunable: Anytime

UPDATE_TIMEOUT

Common

Time in seconds for the update script/program to finish.Update is called when the administrator updates properties of the resource or its resource group.Default=90

Min=60

Tunable: Anytime

MONITOR_START_TIMEOUT

Common

Time in seconds for the monitor_start script/program to finish. Monitor_start is called just after a resource has started.

Default=90

Min=60

Tunable: Anytime

MONITOR_STOP_TIMEOUT

Common

Time in seconds for the monitor_stop script/program to finish. Monitor_stop is called just before a resource is stopped.

Default=90

Min=60

Tunable: Anytime

MONITOR_CHECK_TIMEOUT

Common

Time in seconds for the monitor_check script/program to finish. Monitor_check is called when the probe script/program attempts to fail the RG to a new node.

Default=90

Min=60

Tunable: Anytime

PRENET_START_TIMEOUT

Common

Time in seconds for the prenet_start script/program to finish. Prenet_start is called just before the network addresses in the resource group are started.

Default=1800

Min=60

Tunable: Anytime

POSTNET_STOP_TIMEOUT

Common

Time in seconds for the postnet_stop script/program to finish. Postnet_stop is called jest after the network interfaces in the resource group are stopped.

Default=1800

Min=60

Tunable: Anytime

Failover_mode

Common

A property that indicates what to do if start or stop fails for the resource.

HARD means relocate the resource to another node if start fails, and abort the node if stop fails.

SOFT means relocate the resource to another node in the cluster if start fails; do nothing if stop fails.

NONE means do nothing on either a start failure or stop failure.

Default: SOFT

Tunable: Anytime

Retry_count

Common

The number of times a monitor attempts to restart a resource if it fails.

Default:2

Min 1 Max 10

Tunable: Anytime

Retry_interval

Common

The number of seconds over which to count attempts to restart a resource.

Default 300

Min 30 Max 3600

Tunable: Anytime

GlobalDevicePaths

Extension

The list of HA global device paths

Tunable: When Disabled

FilesystemMountPoints

Extension

The list of file system mountpoints

Tunable: Anytime

AffinityOn

Extension

Specify whether this resource needs to do affinity switchover.

When AffinityOn is set to False, the HAStorage resource waits for the

specified global services to become available. As a result, the primary

of each online global service might not be the same node that

is the primary of the resource group.

The purpose of an affinity switchover is to enhance performance by having data services and

their dependent global services run on the same node.

For each global service, the HAStorageresource attempts affinity switchover only once.

If switchover fails, nothing is affected and the availability check occurs normally.

Default: True

Tunable: When disabled

FilesystemCheckCommand

Extension

Command string to be executed for file system checks

Tunable: Anytime

Zpools

Extension

List of Zpools

Tunable: Anytime

ZpoolsSearchDir

Extension

Directory location to search devices for zpools

Tunable: When disabled





This is an example of the resource type as it exists in the rgm_rt_SUNW.HAStoragePlus file in the CCR:



# cat rgm_rt_SUNW.HAStoragePlus:6
cccr_gennum      1
ccr_checksum    9EA886E0FFCBD213CBC24726D552FDF7
RT_basedir      /usr/cluster/lib/rgm/rt/hastorageplus
RT_description  HA Storage Plus
VALIDATE        hastorageplus_validate
UPDATE  hastorageplus_update
INIT    hastorageplus_init
MONITOR_START   hastorageplus_monitor_start
MONITOR_STOP    hastorageplus_monitor_stop
MONITOR_CHECK   hastorageplus_monitor_check
PRENET_START    hastorageplus_prenet_start
POSTNET_STOP    hastorageplus_postnet_stop
Single_instance FALSE
Init_nodes      RG_PRIMARIES
Installed_nodes \*
Failover        FALSE
Proxy   FALSE
Global_zone     TRUE
RT_system       FALSE
API_version     2
RT_version      6
Pkglist SUNWscu
Upgrade
Upgrade_from.5  ANYTIME
Upgrade_from.4  ANYTIME
Upgrade_from.3  ANYTIME
Upgrade_from.2  ANYTIME
Upgrade_from.1.0        ANYTIME
p.VALIDATE_TIMEOUT      Tunable=ANYTIME;Type=INT;Default=1800;Min=60;
p.UPDATE_TIMEOUT        Tunable=ANYTIME;Type=INT;Default=1800;Min=60;
p.INIT_TIMEOUT  Tunable=ANYTIME;Type=INT;Default=1800;Min=60;
p.MONITOR_START_TIMEOUT Tunable=ANYTIME;Type=INT;Default=90;Min=60;
p.MONITOR_STOP_TIMEOUT  Tunable=ANYTIME;Type=INT;Default=90;Min=60;
p.MONITOR_CHECK_TIMEOUT Tunable=ANYTIME;Type=INT;Default=90;Min=60;
p.PRENET_START_TIMEOUT  Tunable=ANYTIME;Type=INT;Default=1800;Min=60;
p.POSTNET_STOP_TIMEOUT  Tunable=ANYTIME;Type=INT;Default=1800;Min=60;
p.Failover_mode Tunable=ANYTIME;Type=ENUM;Default=SOFT;Enumlist=NONE,HARD,SOFT,RESTART_ONLY,LOG_ONLY;
p.Retry_count   Tunable=ANYTIME;Type=INT;Default=2;Min=1;Max=10;
p.Retry_interval        Tunable=ANYTIME;Type=INT;Default=300;Min=30;Max=3600;
x.GlobalDevicePaths     Tunable=WHEN_DISABLED;Type=STRINGARRAY;Default=;Max=4194303;Description="The list of HA global device paths";
x.FilesystemMountPoints Tunable=ANYTIME;Type=STRINGARRAY;Default=;Max=4194303;Description="The list of file system mountpoints";
x.AffinityOn    Tunable=WHEN_DISABLED;Type=BOOLEAN;Default=TRUE;Description="For specifying affinity switchover";
x.FilesystemCheckCommand        Tunable=ANYTIME;Type=STRINGARRAY;Default=;Max=4194303;Description="Command string to be executed for file system checks";
x.Zpools        Tunable=ANYTIME;Type=STRINGARRAY;Default=;Max=4194303;Description="The list of zpools";
x.ZpoolsSearchDir       Tunable=WHEN_DISABLED;Type=STRING;Default=;Max=4194303;Description="Directory location to search devices for zpools";



How to enable debug mode for HAstorageplus resource :


-Configure syslog with new options:

add daemon.debug messages login, to /var/adm/debug for example.

in syslog.conf :

daemon.debug        /var/adm/debug

# touch /var/adm/debug

Restart syslogd with:

# pkill -HUP syslogd

Test the addition to /etc/syslog.conf with:

# logger -p daemon.debug test

You should see output with "test" at the end of /var/adm/debug

       The file that controls the debug level is called "loglevel"
       To turn on debugging for a the HAStoragePlus type, you would do the following:

# mkdir -p /var/cluster/rgm/rt/SUNW.hastorageplus 
# echo 9 > /var/cluster/rgm/rt/SUNW.hastorageplus/loglevel




restart the resource

Mar 10 18:00:11 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 314356 daemon.notice] resource test4-rs enabled.
Mar 10 18:00:11 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 529407 daemon.notice] resource group test4-rg state on node v4u-4800e-dombc-sin06 change to RG_PENDING_ONLINE
Mar 10 18:00:11 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 443746 daemon.notice] resource test4-rs state on node v4u-4800e-dombc-sin06 change to R_PRENET_STARTING
Mar 10 18:00:11 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_prenet_start> for resource <test4-rs>, resource group <test4-rg>, node <v4u-4800e-dombc-sin06>, timeout <1800> seconds
Mar 10 18:00:11 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 784560 daemon.notice] resource test4-rs status on node v4u-4800e-dombc-sin06 change to R_FM_UNKNOWN
Mar 10 18:00:11 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 922363 daemon.notice] resource test4-rs status msg on node v4u-4800e-dombc-sin06 change to <Starting>
Mar 10 18:00:11 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 863274 daemon.notice] 40 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/lib/rgm/rt/hastorageplus/hastorageplus_prenet_start>:tag=<test4-rg.test4-rs.10>: Calling security_clnt_connect(..., host=<v4u-4800e-dombc-sin06>, sec_type {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
Mar 10 18:00:11 v4u-4800e-dombc-sin06 SC[,SUNW.HAStoragePlus:6,test4-rg,test4-rs,hastorageplus_prenet_start]: [ID 474256 daemon.info] Validations of all specified global device services complete.
Mar 10 18:00:11 v4u-4800e-dombc-sin06 java[1977]: [ID 697678 daemon.debug] sysevent_jni(): arg timestamp of type 18 ignored
Mar 10 18:00:11 v4u-4800e-dombc-sin06 SC[,SUNW.HAStoragePlus:6,test4-rg,test4-rs,hastorageplus_prenet_start]: [ID 664089 daemon.info] All global device services successfully switched over to this node.

Mar 10 18:00:11 v4u-4800e-dombc-sin06 SC[,SUNW.HAStoragePlus:6,test4-rg,test4-rs,hastorageplus_prenet_start]: [ID 123984 daemon.info] All specified global device services are available.
Mar 10 18:00:11 v4u-4800e-dombc-sin06 SC[,SUNW.HAStoragePlus:6,test4-rg,test4-rs,hastorageplus_prenet_start]: [ID 127065 daemon.info] About to perform file system check of /dev/md/test4-ds/rdsk/d0 (/test4) using command /usr/sbin/fsck -o p.
Mar 10 18:00:12 v4u-4800e-dombc-sin06 SC[,SUNW.HAStoragePlus:6,test4-rg,test4-rs,hastorageplus_prenet_start]: [ID 831031 daemon.info] File system check of /test4 (/dev/md/test4-ds/rdsk/d0) successful.
Mar 10 18:00:12 v4u-4800e-dombc-sin06 SC[,SUNW.HAStoragePlus:6,test4-rg,test4-rs,hastorageplus_prenet_start]: [ID 127930 daemon.info] About to mount /test4.
Mar 10 18:00:12 v4u-4800e-dombc-sin06 SC[,SUNW.HAStoragePlus:6,test4-rg,test4-rs,hastorageplus_prenet_start]: [ID 880921 daemon.info] /test4 is confirmed as mounted.
Mar 10 18:00:12 v4u-4800e-dombc-sin06 SC[,SUNW.HAStoragePlus:6,test4-rg,test4-rs,hastorageplus_prenet_start]: [ID 885688 daemon.info] Mounting the file systems is completed successfully.
Mar 10 18:00:12 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 515159 daemon.notice] method <hastorageplus_prenet_start> completed successfully for resource <test4-rs>, resource group <test4-rg>, node <v4u-4800e-dombc-sin06>, time used: 0% of timeout <1800 seconds>
Mar 10 18:00:12 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 443746 daemon.notice] resource test4-rs state on node v4u-4800e-dombc-sin06 change to R_JUST_STARTED
Mar 10 18:00:12 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 443746 daemon.notice] resource test4-rs state on node v4u-4800e-dombc-sin06 change to R_ONLINE_UNMON
Mar 10 18:00:12 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 784560 daemon.notice] resource test4-rs status on node v4u-4800e-dombc-sin06 change to R_FM_ONLINE
Mar 10 18:00:12 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 922363 daemon.notice] resource test4-rs status msg on node v4u-4800e-dombc-sin06 change to <>
Mar 10 18:00:12 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 443746 daemon.notice] resource test4-rs state on node v4u-4800e-dombc-sin06 change to R_MON_STARTING
Mar 10 18:00:12 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 529407 daemon.notice] resource group test4-rg state on node v4u-4800e-dombc-sin06 change to RG_PENDING_ON_STARTED
Mar 10 18:00:12 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_monitor_start> for resource <test4-rs>, resource group <test4-rg>, node <v4u-4800e-dombc-sin06>, timeout <90> seconds
Mar 10 18:00:12 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 863274 daemon.notice] 40 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/lib/rgm/rt/hastorageplus/hastorageplus_monitor_start>:tag=<test4-rg.test4-rs.7>: Calling security_clnt_connect(..., host=<v4u-4800e-dombc-sin06>, sec_type {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
Mar 10 18:00:12 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 515159 daemon.notice] method <hastorageplus_monitor_start> completed successfully for resource <test4-rs>, resource group <test4-rg>, node <v4u-4800e-dombc-sin06>, time used: 0% of timeout <90 seconds>
Mar 10 18:00:12 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 443746 daemon.notice] resource test4-rs state on node v4u-4800e-dombc-sin06 change to R_ONLINE
Mar 10 18:00:12 v4u-4800e-dombc-sin06 Cluster.RGM.global.rgmd: [ID 529407 daemon.notice] resource group test4-rg state on node v4u-4800e-dombc-sin06 change to RG_ONLINE
Comments:

Post a Comment:
  • HTML Syntax: NOT allowed
About

Jean-Christophe Lamoure

Search

Archives
« April 2014
SunMonTueWedThuFriSat
  
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
   
       
Today