Tag: lscfg

Replacing a disk in AIX

IBM technician informed that disk on DQD 82MC-P3-D5 needs to be replaced.

Server is ESERVER P5 570 9117 with 2 LPARs

aix6
aix7

Listing disks

root@aix6:/ # lsdev -Cc disk
hdisk0 Available 08-08-00 SAS Disk Drive
hdisk1 Available 08-08-00 SAS Disk Drive
hdisk2 Available 08-08-00 SAS RAID 5 Disk Array
hdisk3 Available 0D-08-00 SAS RAID 5 Disk Array
hdisk4 Available 01-00-02 MPIO FC 2145
hdisk21 Available 01-00-02 MPIO FC 2145
hdisk22 Available 01-00-02 MPIO FC 2145
hdisk44 Available 01-00-02 MPIO FC 2145
hdisk55 Available 01-00-02 MPIO FC 2145
hdisk56 Available 01-00-02 MPIO FC 2145
hdisk57 Available 01-00-02 MPIO FC 2145

Trying to find disk with Hardware Location Code DQD 82MC-P3-D5

root@aix6:/ # lsdev -Cc disk | awk ‘{print “lscfg -vpl “$1’}
lscfg -vpl hdisk0
lscfg -vpl hdisk1
lscfg -vpl hdisk2
lscfg -vpl hdisk3
lscfg -vpl hdisk4
lscfg -vpl hdisk21
lscfg -vpl hdisk22
lscfg -vpl hdisk44
lscfg -vpl hdisk55
lscfg -vpl hdisk56
lscfg -vpl hdisk57

root@aix6:/ # lsdev -Cc disk | awk ‘{print “lscfg -vpl “$1” | grep \”Hardware Location Code\””‘} | sh
Hardware Location Code……U789D.001.DQD60VA-P3-D5
Hardware Location Code……U789D.001.DQD60VA-P3-D6

Found on LPAR aix7

root@aix7:/root # lsdev -Cc disk | awk ‘{print “lscfg -vpl “$1” | grep \”Hardware Location Code\””‘} | sh
Hardware Location Code……U789D.001.DQD82MC-P3-D1
Hardware Location Code……U789D.001.DQD82MC-P3-D2
Hardware Location Code……U789D.001.DQD82MC-P3-D3
Hardware Location Code……U789D.001.DQD82MC-P3-D4
Hardware Location Code……U789D.001.DQD82MC-P3-D5
Hardware Location Code……U789D.001.DQD82MC-P3-D6

root@aix7:/root # lscfg -vpl hdisk4
hdisk4 U789D.001.DQD82MC-P3-D5 SAS Disk Drive (450000 MB)

Manufacturer…………….IBM
Machine Type and Model……ST3450856SS
FRU Number………………44V4432
ROS Level and ID…………45363134
Serial Number……………3QQ0KZEP
EC Level………………..L35857
Part Number……………..44V4433
Device Specific.(Z0)……..000005329F001002
Device Specific.(Z1)……..1217E614
Device Specific.(Z2)……..0021
Device Specific.(Z3)……..000-0
Device Specific.(Z4)……..
Device Specific.(Z5)……..22
Device Specific.(Z6)……..L35857
Hardware Location Code……U789D.001.DQD82MC-P3-D5

PLATFORM SPECIFIC

Name: disk
Node: disk
Device Type: block

Before replacing the disk using diag, remove disk definition otherwise you will receive the error: The device is currently in use and cannot be removed

root@aix7:/root # rmdev -dl hdisk4
hdisk4 deleted

Run diag

root@aix7:/root # diag
DIAGNOSTIC OPERATING INSTRUCTIONS VERSION 7.1.3.15 801001

LICENSED MATERIAL and LICENSED INTERNAL CODE – PROPERTY OF IBM
(C) COPYRIGHTS BY IBM AND BY OTHERS 1982, 2014.
ALL RIGHTS RESERVED.

These programs contain diagnostics, service aids, and tasks for
the system. These procedures should be used whenever problems
with the system occur which have not been corrected by any
software application procedures available.

In general, the procedures will run automatically. However,
sometimes you will be required to select options, inform the
system when to continue, and do simple tasks.

Several keys are used to control the procedures:
– The Enter key continues the procedure or performs an action.
– The Backspace key allows keying errors to be corrected.
– The cursor keys are used to select an option.

Press the F3 key to exit or press Enter to continue.

FUNCTION SELECTION 801002

Move cursor to selection, then press Enter.

Diagnostic Routines
This selection will test the machine hardware. Wrap plugs and
other advanced functions will not be used.
Advanced Diagnostics Routines
This selection will test the machine hardware. Wrap plugs and
other advanced functions will be used.
Task Selection (Diagnostics, Advanced Diagnostics, Service Aids, etc.)
This selection will list the tasks supported by these procedures.
Once a task is selected, a resource menu may be presented showing
all resources supported by the task.
Resource Selection
This selection will list the resources in the system that are supported
by these procedures. Once a resource is selected, a task menu will
be presented showing all tasks that can be run on the resource(s).

TASKS SELECTION LIST 801004

From the list below, select a task by moving the cursor to
the task and pressing ‘Enter’.
To list the resources for the task highlighted, press ‘List’.

Run Diagnostics
Run Error Log Analysis
Run Exercisers
Display or Change Diagnostic Run Time Options

Add Resource to Resource List
Automatic Error Log Analysis and Notification
Backup and Restore Media
Certify Media
Change Hardware Vital Product Data
Configure Platform Processor Diagnostics
Delete Resource from Resource List
Disk Maintenance
Display Configuration and Resource List
Display Firmware Device Node Information
Display Hardware Error Report
Display Hardware Vital Product Data
Display Multipath I/O (MPIO) Device Configuration
Display Previous Diagnostic Results
Display Resource Attributes
Display Service Hints
Display Software Product Data
Display or Change Bootlist
Format Media
Gather System Information
Hot Plug Task
IBM Tape Drive Service Aids
This selection provides a set of tools to perform utility functions on
Tape and Medium Changer Devices supported by the Atape Device Driver.
Identify and Attention Indicators
Log Repair Action
Microcode Tasks
Periodic Diagnostics
RAID Array Manager
Update Disk Based Diagnostics

Hot Plug Task 801004

Move cursor to desired item and press Enter.

PCI Hot Plug Manager
SCSI and SCSI RAID Hot Plug Manager

SCSI and SCSI RAID HOT PLUG MANAGER 802481

The following operations can be performed on devices attached to
SCSI and SCSI RAID Hot Plug Enclosure devices.

Make selection, use Enter to continue.

List Hot Swap Enclosure Devices
This selection lists all scsi hot swap slots and their contents.
Identify a Device Attached to a SCSI Hot Swap Enclosure Device
This selection sets the Identify indication.
Attach a Device to an SCSI Hot Swap Enclosure Device
This selection sets the Add indication and prepares
the slot for insertion of a device.
Replace/Remove a Device Attached to an SCSI Hot Swap Enclosure Device
This selection sets the Remove indication and prepares
the device for removal.
Configure Added/Replaced Devices
This selection runs the configuration manager on the
parent adapter where devices have been added or replaced.

REMOVE OR REPLACE DEVICE ATTACHED TO A SCSI HOT SWAP ENCLOSURE DEVICE 802485

The following is a list of configured, unconfigured and populated
SCSI Hot Swap Enclosure device slots. Select a slot to remove or
replace the device attached to that slot.
ENSURE THAT NO OTHER HOST IS USING THE DEVICE BEFORE REMOVING IT.

Make selection, use Enter to continue.

U789D.001.DQD50B4-
ses0 P4-Y1
slot 1 P4-D1 cd0

U789D.001.DQD82MC-
ses1 P3-Y2
ses2 P3-Y1
slot 1 P3-D1 hdisk0
slot 2 P3-D2 hdisk1
slot 3 P3-D3 hdisk2
slot 4 P3-D4 hdisk3
slot 5 P3-D5 [populated]
slot 6 P3-D6 hdisk5

Tell the technician to replace the disk. After disk is replaced, press Enter

The LED should be in the Remove state for the selected device. You may now remove or replace the device. Use ‘Enter’ to indicate you are finished.

Instead of showing populated, it will show the disk name

REMOVE OR REPLACE DEVICE ATTACHED TO A SCSI HOT SWAP ENCLOSURE DEVICE 802485

The following is a list of configured, unconfigured and populated
SCSI Hot Swap Enclosure device slots. Select a slot to remove or
replace the device attached to that slot.
ENSURE THAT NO OTHER HOST IS USING THE DEVICE BEFORE REMOVING IT.

Make selection, use Enter to continue.

U789D.001.DQD50B4-
ses0 P4-Y1
slot 1 P4-D1 cd0

U789D.001.DQD82MC-
ses1 P3-Y2
ses2 P3-Y1
slot 1 P3-D1 hdisk0
slot 2 P3-D2 hdisk1
slot 3 P3-D3 hdisk2
slot 4 P3-D4 hdisk3
slot 5 P3-D5 hdisk4
slot 6 P3-D6 hdisk5

Source:
Replacing a disk drive with the AIX system or logical partition powered on
Replacing a SCSI disk drive

UXMON: AIX syslog alarm: TAPE DRIVE FAILURE with ID 5537AC5F

Node : aix.setaoffice.com
Node Type : IBM RS/6000 64 HTTPS
Severity : warning
OM Server Time: 2017-01-21 12:03:23
Message : UXMON: AIX syslog alarm: TAPE DRIVE FAILURE with ID 5537AC5F
Msg Group : OS
Application : syslog
Object : 5537AC5F
Event Type :
not_found

Instance Name :
not_found

Instruction : Has been detected an alarm in the AIX errpt module
The annotation of this case will show the template description of this ID
If you feel this event is useless you can filter out it using the uxmonsyslog.cfg, see this
same file and/or documentation for details

Checking device status

root@aix:/ # lsdev -Cc tape | grep rmt109
rmt109 Available 0I-00-02 IBM 3580 Ultrium Tape Drive (FCP)

Checking device information

root@aix:/ # lscfg -vpl rmt109
rmt109 U789D.001.DQD16LD-P1-C1-T1-W50050763004BE309-L0 IBM 3580 Ultrium Tape Drive (FCP)

Manufacturer…………….IBM
Machine Type and Model……ULT3580-TD5
Serial Number……………00078AE800
Device Specific.(FW)……..F990

PLATFORM SPECIFIC

Name: tape
Node: tape
Device Type: byte

In our environment rmt109 is part of a tape library. BUR team needs to check tape W50050763004BE309

Checking Disk Array status in AIX

This AIX server has a disk array connected.

root@aix:/ # lscfg -vl hdisk2
hdisk2           U0.1-P2-I3/Z3-A0  SCSI RAID 5 Disk Array

The array has 7 physical disks
root@aix:/ # lsdev -C | grep pdisk

pdisk0         Available 1A-08-01-0,0  Physical SCSI Disk Drive
pdisk1         Available 1A-08-01-1,0  Physical SCSI Disk Drive
pdisk2         Available 1A-08-01-2,0  Physical SCSI Disk Drive
pdisk3         Available 1A-08-01-3,0  Physical SCSI Disk Drive
pdisk4         Available 1A-08-01-4,0  Physical SCSI Disk Drive
pdisk5         Available 1A-08-01-5,0  Physical SCSI Disk Drive
pdisk6         Available 1A-08-01-6,0  Physical SCSI Disk Drive

Connected through a PCI-X SCSI RAID adapter
root@aix:/ # lscfg -vl sisioa0

sisioa0          U0.1-P2-I3  PCI-X Dual Channel U320 SCSI RAID Adapter
PCI-X Dual Channel Ultra320 SCSI RAID Adapter:
Part Number……………..97P3960
FRU Number………………97P3960
Serial Number……………YL10C4077074
Manufacture ID…………..000C
EC Level………………..0
ROM Level.(alterable)…….0709000c
Product Specific.(Z0)…….5703
Product Specific.(Z1)…….1
Device Specific.(YL)……..U0.1-P2-I3

When we checked the status we saw that the array was running in degraded state.
root@aix:/ #sisraidmgr -L -j1 -l sisioa0

————————————————————————
Name      Location      State       Description        Size
————————————————————————
sisioa0   1A-08         Available   PCI-X Dual Channel U320 SCSI RAID Adapter
scsi2    1A-08-00-07,0 NoLink      No remote adapter target
scsi3    1A-08-01-07,0 NoLink      No remote adapter target
hdisk2    1A-08-ff-0,0  Degraded RAID 5 Array     428.0GB
pdisk0   1A-08-01-0,0  Active      Array Member      71.4GB
pdisk1   1A-08-01-1,0  Active      Array Member      71.4GB
pdisk2   1A-08-01-2,0  Active      Array Member      71.4GB
pdisk3   1A-08-01-3,0  Active      Array Member      71.4GB
pdisk4   1A-08-01-4,0  Active      Array Member      71.4GB
pdisk5   1A-08-01-5,0  Active      Array Member      71.4GB
pdisk6   1A-08-01-6,0  Active      Array Member      71.4GB
hdisk0    1A-08-00-10,0 Available   16 Bit LVD SCSI  146.8GB
hdisk1    1A-08-00-11,0 Available   16 Bit LVD SCSI  146.8GB

AIX – Some tape commands

Getting information about the tape drive.

root@aix:/ # lscfg -vpl rmt0
rmt0             U787E.001.AAA2714-P2-C1-T2-L0-L0  4.0 GB 4mm Tape Drive
Manufacturer…………….ARCHIVE
Machine Type and Model……IBM4326NP/RP  !D
Device Specific.(Z1)……..5ALG
Serial Number……………DR0N6QG 0011
Load ID…………………0011
Part Number……………..21H5154
FRU Number………………59H3481
EC Level………………..E30312
Device Specific.(Z0)……..0180020283000018
Device Specific.(Z3)……..L1
PLATFORM SPECIFIC
Name:  st
Device Type:  byte

Getting information about the tape. It is using compression and a variable length block size.

root@aix:/ # tctl -f /dev/rmt0 status
rmt0 Available 06-08-01-0,0 4.0 GB 4mm Tape Drive
ttribute  value description                          user_settable
block_size 0     BLOCK size (0=variable length)       True
compress   yes   Use data COMPRESSION                 True
mode       yes   Use DEVICE BUFFERS during writes     True
ret_error  no    RETURN error on tape change or reset True

Rewinding tape drive

root@aix:/ # tctl -f /dev/rmt0 rewind

Reading tape and redirecting output to a file. In this case, the operating system showed that the tape itself is damaged.

root@aix:/ # tctl -f /dev/rmt0 read > /tmp/testtape
tctl: 0511-578 Cannot read 512 bytes: The media surface is damaged.