UXMON: SSHD Daemon is not running or not doing it properly, please check

Node : solaris.setaoffice.com
Node Type : Sun SPARC (HTTPS)
Severity : normal
OM Server Time: 2016-09-10 08:03:10
Message : UXMON: SSHD Daemon is not running or not doing it properly, please check
Msg Group : OS
Application : sshd_mon
Object : sshd
Event Type :
not_found

Instance Name :
not_found

Instruction : It has been detected an SSH installation but the SSHD is not running
Please check SSH status, because it might happen also there are still some ssh spawned processes running but the father has died.

Note that if the SSH is not available this might prevent users log in the server and even impact some applications.

HPOM is complaining that ssh is not running but obviously is running because you’re connected to the server using ssh

root@solaris:~ # /var/opt/OV/bin/instrumentation/UXMONbroker -check sshdmon
Fri Sep 23 11:45:46 2016 : INFO : UXMONsshdmon is running now, pid=2250
Fri Sep 23 11:45:46 2016 : SSHDMON: SSHD – Not running
mv: /dev/null and /dev/null are identical
Fri Sep 23 11:45:46 2016 : INFO : UXMONsshdmon end, pid=2250

Check directory /var/run

root@solaris:/var/run # ls -la
total 16
drwxr-xr-x 4 root other 5 Sep 23 11:50 .
drwxr-xr-x 44 root sys 50 Aug 16 11:04 ..
-rw——- 1 root root 6 Jul 7 11:24 ds_agent.pid
drwxr-xr-x 13 root root 13 Aug 10 15:33 install_engine
drwx–x–x 2 root sys 2 Jul 6 14:27 sudo

It should have many files in /var/run

root@solaris:/var/run # ls -l
total 272
-rw——- 1 root root 0 Sep 10 21:27 AdDrEm.lck
drwxr-xr-x 3 root sys 183 Sep 10 21:43 cacao
-rw-rw-rw- 1 root bin 14 Sep 23 09:20 cdrom_rcm.conf
drwxr-xr-x 2 daemon daemon 183 Sep 23 12:18 daemon
-rw-r—– 1 root root 6 Sep 23 10:41 did_reloader.lock
-rw——- 1 root root 5 Sep 10 21:27 ds_agent.pid
Drw-r—– 1 root root 0 Sep 10 21:28 event_listener_proxy_door
Drw-r–r– 1 root root 0 Sep 10 21:40 fed_doorglobal
Drw-r–r– 1 root root 0 Sep 10 21:27 hotplugd_door
Drw-r–r– 1 root root 0 Sep 10 21:28 ifconfig_proxy_doorglobal
-rw——- 1 root root 0 Sep 10 21:26 ipsecconf.lock
Dr–r–r– 1 daemon daemon 0 Sep 10 21:26 kcfd_door
-rw——- 1 root root 0 Sep 14 09:07 lockf_raidctl
Dr–r–r– 1 root root 0 Sep 10 21:26 name_service_door
-rw-r–r– 1 root root 8 Sep 10 21:40 nfs4_domain
drwxr-xr-x 2 root root 179 Sep 10 21:40 pcmcia
Dr–r–r– 1 root root 0 Sep 10 21:26 picld_door
Drw-r–r– 1 root root 0 Sep 10 21:30 pmfd_doorglobal
-rw-r–r– 1 root sys 58 Sep 10 21:30 psn
Dr——– 1 root root 0 Sep 10 21:26 rcm_daemon_door
-rw-r–r– 1 root root 0 Sep 10 21:26 rcm_daemon_lock
-rw——- 1 root root 1068 Sep 10 21:26 rcm_daemon_state
Drw-r–r– 1 root root 0 Sep 10 21:40 rgmd_receptionist_doorglobal
drwxrwxrwt 2 root root 186 Sep 10 21:27 rpc_door
drwx—— 2 root root 182 Sep 10 21:27 smc898
-rw-r–r– 1 root root 5 Sep 10 21:27 sshd.pid
drwx–x–x 3 root sys 176 Sep 10 21:31 sudo
drwxr-xr-x 3 root root 191 Sep 10 21:26 sysevent_channels
Drw-r–r– 1 root root 0 Sep 10 21:30 sysevent_proxy_doorglobal
-rw-r–r– 1 root root 5 Sep 10 21:27 syslog.pid
Drw-r–r– 1 root root 0 Sep 10 21:27 syslog_door
-rw-r–r– 1 root root 8192 Sep 10 21:26 tzsync
drwx—— 2 root root 2625 Sep 23 10:26 zones
Drw-r–r– 1 root root 0 Sep 10 21:30 zoneup_doorglobal

Fixing the issue for the ticket. Check ssh processes for root user

root@solaris:/var/run # ps -ef | grep ssh | grep root
root 8047 1 0 Sep 21 ? 0:00 /usr/lib/ssh/sshd
root 17380 13924 0 00:07:02 ? 0:00 /usr/lib/ssh/sshd
root 5570 13878 0 08:08:40 ? 0:00 /usr/lib/ssh/sshd
root 13877 1 0 Sep 21 ? 0:00 /usr/lib/ssh/sshd
root 1003 13878 0 09:17:01 ? 0:00 /usr/lib/ssh/sshd
root 13903 1 0 Sep 21 ? 0:00 /usr/lib/ssh/sshd
root 60966 13918 0 00:03:07 ? 0:00 /usr/lib/ssh/sshd
root 48654 13878 0 10:13:22 ? 0:00 /usr/lib/ssh/sshd
root 13918 1 0 Sep 21 ? 0:00 /usr/lib/ssh/sshd
root 17389 13924 0 00:07:02 ? 0:00 /usr/lib/ssh/sshd
root 39554 13878 0 09:21:51 ? 0:00 /usr/lib/ssh/sshd
root 64681 1 0 11:25:02 ? 0:00 /usr/lib/ssh/sshd
root 11912 13878 0 09:29:14 ? 0:00 /usr/lib/ssh/sshd
root 56172 13878 0 11:54:55 ? 0:00 /usr/lib/ssh/sshd
root 17386 13924 0 00:07:02 ? 0:00 /usr/lib/ssh/sshd
root 34708 13878 0 08:51:07 ? 0:00 /usr/lib/ssh/sshd
root 60201 13878 0 09:27:36 ? 0:00 /usr/lib/ssh/sshd
root 55272 1 0 11:54:33 ? 0:00 /usr/lib/ssh/sshd
root 5850 13878 0 08:08:47 ? 0:00 /usr/lib/ssh/sshd
root 9865 44290 0 11:56:17 pts/4 0:00 grep ssh
root 13924 1 0 Sep 21 ? 0:00 /usr/lib/ssh/sshd
root 13878 1 0 Sep 21 ? 0:01 /usr/lib/ssh/sshd

Creating file /var/run/sshd.pid with sshd PID

echo 8047 > /var/run/sshd.pid

root@solaris:/var/run # ls -l sshd.pid
-rw-r–r– 1 root root 5 Sep 10 21:27 sshd.pid

sshdmon does not complain anymore

root@solaris:~ # /var/opt/OV/bin/instrumentation/UXMONbroker -check sshdmon
Fri Sep 23 11:58:15 2016 : INFO : UXMONsshdmon is running now, pid=18095
mv: /dev/null and /dev/null are identical
Fri Sep 23 11:58:15 2016 : INFO : UXMONsshdmon end, pid=18095

UXMON:bond1.1504 is down – Network Bonding Interface is alarming that is down but it is active in the system

Node : linux.setaoffice.com
Node Type : Intel/AMD x64(HTTPS)
Severity : critical
OM Server Time: 2016-09-27 18:35:55
Message : UXMON:bond1.1504 is down
Msg Group : OS
Application : bondmon
Object : bond
Event Type :
not_found

Instance Name :
not_found

Instruction : The ‘cat /sys/class/net/$bond/bonding/mii_status’ command shows the detail status

Please check /var/opt/OV/log/OpC/bond_mon.log for more details

The module bondmon is complaining about a network bonding interface down

root@linux:~ # /var/opt/OV/bin/instrumentation/UXMONbroker -check bondmon
Wed Sep 28 08:15:43 2016 : INFO : UXMONbondmon is running now, pid=31311
Wed Sep 28 08:15:43 2016 : Critical: bond1.1504 is down
mv: `/dev/null’ and `/dev/null’ are the same file
Wed Sep 28 08:15:43 2016 : INFO : UXMONbondmon end, pid=31311

It was showing bond1.1504 in the network bonding interfaces and no interfaces available.

root@linux:~ # ls -l /proc/net/bonding
total 0
-r–r–r– 1 root root 0 Sep 28 08:48 bond0
-r–r–r– 1 root root 0 Sep 28 08:48 bond1
-r–r–r– 1 root root 0 Sep 28 08:48 bond1.1504
-r–r–r– 1 root root 0 Sep 28 08:48 bond2

Removed bond1.1504

root@linux:~ # echo “-bond1.1504” > /sys/class/net/bonding_masters

root@linux:~ # ls -l /proc/net/bonding
total 0
-r–r–r– 1 root root 0 Sep 28 08:59 bond0
-r–r–r– 1 root root 0 Sep 28 08:59 bond1
-r–r–r– 1 root root 0 Sep 28 08:59 bond2

The configuration file for bond1.1504 was missing the parameter VLAN=yes. So added the paramter

root@linux:~ # cat /etc/sysconfig/network-scripts/ifcfg-bond1.1504
DEVICE=bond1.1504
BOOTPROT=none
ONBOOT=yes
IPADDRES=10.32.28.175
NETMASK=255.255.255.0
BONDING_OPTS=”miimon=1000 mode=active-backup”

root@linux:~ # cat /etc/sysconfig/network-scripts/ifcfg-bond0.1504
DEVICE=bond0.1504
BOOTPROT=none
ONBOOT=yes
IPADDR=10.32.17.87
NETMASK=255.255.254.0
BONDING_OPTS=”miimon=1000 mode=active-backup”
VLAN=yes

Bring the network interface up

root@linux:~ # ifup ifcfg-bond1.1504

And configured the IP and netmask shown on the configuration file

root@linux:~ # ifconfig bond1.1504 10.32.28.175 netmask 255.255.255.0

root@linux:~ # ifconfig bond1.1504
bond1.1504 Link encap:Ethernet HWaddr 6C:C2:17:30:88:88
inet addr:10.32.28.175 Bcast:10.32.28.255 Mask:255.255.255.0
inet6 addr: fe80::6ec2:17ff:fe30:8888/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:1034 errors:0 dropped:0 overruns:0 frame:0
TX packets:6 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:0
RX bytes:78494 (76.6 KiB) TX bytes:468 (468.0 b)

Running UXMONbroker with the module bondmon

root@linux:~ # /var/opt/OV/bin/instrumentation/UXMONbroker -check bondmon
Wed Sep 28 09:15:19 2016 : INFO : UXMONbondmon is running now, pid=25212
mv: `/dev/null’ and `/dev/null’ are the same file
Wed Sep 28 09:15:19 2016 : INFO : UXMONbondmon end, pid=25212

Enabling EPEL on CentOS 7

Extra Packages for Enterprise Linux (or EPEL) is a Fedora Special Interest Group that creates, maintains, and manages a high quality set of additional packages for Enterprise Linux, including, but not limited to, Red Hat Enterprise Linux (RHEL), CentOS and Scientific Linux (SL), Oracle Linux (OL).

To enable it on CentOS 7, simply install a package called epel-release

root@linux:~ # yum install -y epel-release
Loaded plugins: fastestmirror
Loading mirror speeds from cached hostfile
* base: centos.xpg.com.br
* extras: centos.xpg.com.br
* updates: mirror.globo.com
Resolving Dependencies
–> Running transaction check
—> Package epel-release.noarch 0:7-6 will be installed
–> Finished Dependency Resolution

Dependencies Resolved

================================================================================
Package Arch Version Repository Size
================================================================================
Installing:
epel-release noarch 7-6 extras 14 k

Transaction Summary
================================================================================
Install 1 Package

Total download size: 14 k
Installed size: 24 k
Downloading packages:
epel-release-7-6.noarch.rpm | 14 kB 00:00
Running transaction check
Running transaction test
Transaction test succeeded
Running transaction
Installing : epel-release-7-6.noarch 1/1
Verifying : epel-release-7-6.noarch 1/1

Installed:
epel-release.noarch 0:7-6

Complete!

Listing repositories

root@linux:~ # yum repolist
Loaded plugins: fastestmirror
Loading mirror speeds from cached hostfile
* base: centos.xpg.com.br
* epel: mirror.globo.com
* extras: centos.xpg.com.br
* updates: mirror.globo.com
repo id repo name status
base/7/x86_64 CentOS-7 – Base 9,007
epel/x86_64 Extra Packages for Enterprise Linux 7 – x86_64 10,600
extras/7/x86_64 CentOS-7 – Extras 390
updates/7/x86_64 CentOS-7 – Updates 2,471
repolist: 22,468

Samba share without permission. Directory showing as d———

Server is mounting network share being exported using CIFS

root@linux:~ # df -hP /arq/avf/ROT_EFC
Filesystem Size Used Avail Use% Mounted on
//172.20.1.2/Operacao_ROT_EFC$ 43G 40G 3.2G 93% /arq/avf/ROT_EFC

root@linux:~ # mount | grep ROT_EFC
//172.20.1.2/Operacao_ROT_EFC$ on /arq/avf/ROT_EFC type cifs (rw)

Information about the filesystem on /etc/fstab

root@linux:~ # grep Operacao_ROT_EFC /etc/fstab
//172.20.1.2/Operacao_ROT_EFC$ /arq/avf/ROT_EFC cifs _netdev,user=s-ad-USER1468,pass=userpassword,uid=21376,gid=889,file_mode=0775,dir_mode=0775,domain=setaoffice,cifsacl

There is no permission and it can’t be changed by the Linux server brqsb1valeas890

root@linux:~ # ls -ld /arq/avf/ROT_EFVM /arq/avf/ROT_EFC
d——— 7 user1468 admweb 0 Sep 15 15:53 /arq/avf/ROT_EFC

root@linux:~ # chmod 775 /arq/avf/ROT_EFC
chmod: changing permissions of `/arq/avf/ROT_EFC’: Permission denied

No problem mounting manually

mount -t cifs //172.20.1.2/Operacao_ROT_EFC$ /arq/avf/ROT_EFC -o “username=s-ad-USER1468,domain=setaoffice,uid=21376,gid=889,file_mode=0775,dir_mode=0775”

Rewrote entry in /etc/fstab. Must have been a hidden character

root@linux:~ # umount /arq/avf/ROT_EFC
root@linux:~ # mount /arq/avf/ROT_EFC
root@linux:~ # ls -dl /arq/avf/ROT_EFC
drwxrwxr-x 7 user1468 admweb 0 Sep 15 15:53 /arq/avf/ROT_EFC

UXMON: File /var/log/cron age exceeds 3d threshold on linux.setaoffice.com

ATTENTION, RMC LEVEL 1 AGENT: This ticket will be automatically worked by the Automation Bus. Pls. ensure your Ticket List/View includes the “Assignee” column, monitor this ticket until the user “ABOPERATOR” is no longer assigned, BEFORE you start work on this ticket.
Node : linux.setaoffice.com
Node Type : Intel/AMD x64(HTTPS)
Severity : warning
OM Server Time: 2016-09-09 14:51:09
Message : UXMON: File /var/log/cron age exceeds 3d threshold.
Msg Group : OS
Application : actmon
Object : LINUX
Event Type : NONE
Instance Name : NONE
Instruction : No

This is a Suse Linux server

root@linux:~ # cat /etc/*release
SUSE Linux Enterprise Server 10 (x86_64)
VERSION = 10
PATCHLEVEL = 4
LSB_VERSION=”core-2.0-noarch:core-3.0-noarch:core-2.0-x86_64:core-3.0-x86_64″

Suse Linux cron log is located on /var/log/messages.

Comment the line /var/log/cron on configuration file

/var/opt/OV/conf/OpC/act_mon.cfg
[LINUX]
#/var/log/cron 3d WARNING 0000-2400 * TT_LINUX

UXMON: Too many instances: ntpd

ATTENTION, RMC LEVEL 1 AGENT: This ticket will be automatically worked by the Automation Bus. Pls. ensure your Ticket List/View includes the “Assignee” column, monitor this ticket until the user “ABOPERATOR” is no longer assigned, BEFORE you start work on this ticket.
Node : linux.setaoffice.com
Node Type : Intel/AMD x64(HTTPS)
Severity : warning
OM Server Time: 2016-09-11 02:03:20
Message : UXMON: Too many instances: ntpd . ARGS: -p /var/run/ntp/ntpd.pid -x -g -u ntp:ntp -i /var/lib/ntp -c /etc/ntp.conf
Msg Group : OS
Application : psmon
Object : ntpd
Event Type : NONE
Instance Name : NONE
Instruction : No

When there are two ntp processes, review your /etc/ntp.conf file. There must be an unreachable NTP server

root@linux:~ # ps -ef | grep ntp
root 24639 24476 0 10:22 pts/0 00:00:00 grep ntp
ntp 26152 1 0 Sep11 ? 00:00:06 /usr/sbin/ntpd -p /var/run/ntp/ntpd.pid -x -g -u ntp:ntp -i /var/lib/ntp -c /etc/ntp.conf
root 26277 26152 0 Sep11 ? 00:00:00 /usr/sbin/ntpd -p /var/run/ntp/ntpd.pid -x -g -u ntp:ntp -i /var/lib/ntp -c /etc/ntp.conf

Review your /etc/ntp.conf file and restart NTP

root@linux:~ # service ntp stop
Shutting down network time protocol daemon (NTPD) done

root@linux:~ # ps -ef | grep ntp
root 24778 24476 0 10:23 pts/0 00:00:00 grep ntp

root@linux:~ # service ntp start
Starting network time protocol daemon (NTPD) done

root@linux:~ # ps -ef | grep ntp
root 24819 1 0 10:23 ? 00:00:00 /usr/sbin/ntpd -p /var/run/ntp/ntpd.pid -x -g -u ntp:ntp -i /var/lib/ntp -c /etc/ntp.conf
root 24824 24476 0 10:23 pts/0 00:00:00 grep ntp

UXMON: Power Supply Error

Node : linux.setaoffice.com
Node Type : Intel/AMD x64(HTTPS)
Severity : major
OM Server Time: 2016-08-25 16:40:25
Message : UXMON: Power Supply Error
Msg Group : OS
Application : hwmon
Object : hardware
Event Type :
not_found

Instance Name :
not_found

Instruction : No

I have a HP Proliant DL580 Gen9 where I received a ticket about a power supply problem.

Check with hpasmcli

root@linux:~ # rpm -qf /sbin/hpasmcli
hp-health-10.20-1723.26.sles11

root@linux:~ # hpasmcli -s “show powersupply”

Power supply #1
Present : Yes
Redundant: Yes
Condition: Ok
Hotplug : Supported
Power : 235 Watts
Power supply #2
Present : Yes
Redundant: Yes
Condition: DEGRADED
Hotplug : Supported
Power supply #3
Present : Yes
Redundant: Yes
Condition: Ok
Hotplug : Supported
Power : 235 Watts
Power supply #4
Present : Yes
Redundant: Yes
Condition: Ok
Hotplug : Supported
Power : 205 Watts

Schedule a replacement with HP

UXMON: EXT3-fs: checktime reached, running e2fsck is recommended

Node : linux.setaoffice.com
Node Type : Intel/AMD x64(HTTPS)
Severity : warning
OM Server Time: 2016-08-31 10:22:49
Message : UXMON: EXT3-fs: checktime reached, running e2fsck is recommended
Msg Group : OS
Application : dmsg_mon
Object : EXT3
Event Type :
not_found

Instance Name :
not_found

Instruction : No

Check file /etc/fstab to see if your root filesystem has the number 1 . All the other filesystems should have a number 2

root@linux:~ # cat /etc/fstab
/dev/cciss/c0d0p1 / ext3 acl,user_xattr 1 1
/dev/cciss/c0d0p3 /boot ext3 acl,user_xattr 1 2
/dev/localVG/var /var ext3 acl,user_xattr 1 2
/dev/localVG/app.software /usr/software ext3 acl,user_xattr 1 2
/dev/localVG/dat.users /usr/users ext3 acl,user_xattr 1 2
/dev/localVG/dat.audit /audit ext3 acl,user_xattr 1 2
/dev/cciss/c0d0p2 swap swap defaults 0 0
proc /proc proc defaults 0 0
sysfs /sys sysfs noauto 0 0
debugfs /sys/kernel/debug debugfs noauto 0 0
usbfs /proc/bus/usb usbfs noauto 0 0
devpts /dev/pts devpts mode=0620,gid=5 0 0

Create the file /forcefsck and reboot the server if possible

root@linux:~ # ls -l /forcefsck
/bin/ls: /forcefsck: No such file or directory

root@linux:~ # touch /forcefsck

root@linux:~ # shutdown -r now

Run dmesg -c twice to clear the dmesg buffer

root@linux:~ # dmesg -c
root@linux:~ # dmesg -c

UXMON: Volume UUID=c7c47b25-30d8-42bc-8ca8-13f939b5c7b8 should be mounted on /srv. Please check your

ATTENTION, RMC LEVEL 1 AGENT: This ticket will be automatically worked by the Automation Bus. Pls. ensure your Ticket List/View includes the “Assignee” column, monitor this ticket until the user “ABOPERATOR” is no longer assigned, BEFORE you start work on this ticket.
Node : linux.setaoffice.com
Node Type : Intel/AMD x64(HTTPS)
Severity : warning
OM Server Time: 2016-07-22 13:31:16
Message : UXMON: Volume UUID=c7c47b25-30d8-42bc-8ca8-13f939b5c7b8 should be mounted on /srv. Please check your vfstab fstab or filesystems file. Please also check: UUID=c7c47b25-30d8-42bc-8ca8-13f939b5c7b8 UUID=c7c47b25-30d8-42bc-8ca8-13f939b5c7b8 UUID=c7c47b25-30d8-42bc-8ca8-13f939b5c7b8 UUID=c7c47b25-30d8-42bc-8ca8-13f939b5c7b8 UUID=c7c47b25-30d8-42bc-8ca8-13f939b5c7b8 UUID=c7c47b25-30d8-42bc-8ca8-13f939b5c7b8 UUID=c7c47b25-30d8-42bc-8ca8-13f939b5c7b8
Msg Group : OS
Application : volmon
Object : LVM
Event Type : NONE
Instance Name : NONE
Instruction : No

Seems like a HPOM bug. It’s a Suse Linux 11 SP3 with a btrfs.

root@linux:~ # df -h /srv
Filesystem Size Used Avail Use% Mounted on
/dev/dm-12 509G 16G 492G 4% /srv

root@linux:~ # blkid | grep dm-12
root@linux:~ #

Running blkid against the device multipath shows in blkid

root@linux:~ # blkid /dev/dm-12
/dev/dm-12: UUID=”c7c47b25-30d8-42bc-8ca8-13f939b5c7b8″ UUID_SUB=”ebe2d68d-0b4f-4586-bd40-6476a824f170″ TYPE=”btrfs”

root@linux:~ # blkid | grep dm-12
/dev/dm-12: UUID=”c7c47b25-30d8-42bc-8ca8-13f939b5c7b8″ UUID_SUB=”ebe2d68d-0b4f-4586-bd40-6476a824f170″ TYPE=”btrfs”

Solaris Volume Manager – Delete replicas of the metadevice state database

In this Solaris server, one of the disks needs replacement

root@solaris # echo | format
Searching for disks…done

AVAILABLE DISK SELECTIONS:
0. c0t2d0 <drive not available>
/pci@1f,0/pci@1,1/scsi@2/sd@2,0
1. c0t3d0 <SUN72G cyl 14087 alt 2 hd 24 sec 424>
/pci@1f,0/pci@1,1/scsi@2/sd@3,0
Specify disk (enter its number): Specify disk (enter its number):

Checking replicas of the metadevice state database

root@solaris # metadb
flags first blk block count
M p 16 unknown /dev/dsk/c0t2d0s4
M p 8208 unknown /dev/dsk/c0t2d0s4
M p 16400 unknown /dev/dsk/c0t2d0s4
M p 16 unknown /dev/dsk/c0t2d0s5
M p 8208 unknown /dev/dsk/c0t2d0s5
M p 16400 unknown /dev/dsk/c0t2d0s5
a m p lu 16 8192 /dev/dsk/c0t3d0s4
a p l 8208 8192 /dev/dsk/c0t3d0s4
a p l 16400 8192 /dev/dsk/c0t3d0s4
a p l 16 8192 /dev/dsk/c0t3d0s5
a p l 8208 8192 /dev/dsk/c0t3d0s5
a p l 16400 8192 /dev/dsk/c0t3d0s5

Deleting metadevice state database on the slices of the bad disk. First slice 4

root@solaris # metadb -d /dev/dsk/c0t2d0s4
metadb: solaris: Bad address

root@solaris # metadb
flags first blk block count
M p 16 unknown /dev/dsk/c0t2d0s5
M p 8208 unknown /dev/dsk/c0t2d0s5
M p 16400 unknown /dev/dsk/c0t2d0s5
a m p lu 16 8192 /dev/dsk/c0t3d0s4
a p l 8208 8192 /dev/dsk/c0t3d0s4
a p l 16400 8192 /dev/dsk/c0t3d0s4
a p l 16 8192 /dev/dsk/c0t3d0s5
a p l 8208 8192 /dev/dsk/c0t3d0s5
a p l 16400 8192 /dev/dsk/c0t3d0s5

And then in slice 5

root@solaris # metadb -d /dev/dsk/c0t2d0s5
metadb: solaris: Bad address

root@solaris # metadb
flags first blk block count
a m p lu 16 8192 /dev/dsk/c0t3d0s4
a p l 8208 8192 /dev/dsk/c0t3d0s4
a p l 16400 8192 /dev/dsk/c0t3d0s4
a p l 16 8192 /dev/dsk/c0t3d0s5
a p l 8208 8192 /dev/dsk/c0t3d0s5
a p l 16400 8192 /dev/dsk/c0t3d0s5

%d bloggers like this: