Checking the LUNS when clusterware arises

Version of the grid: 11.2.0.3.6

Platform: Oracle Enterprise Linux 6.2

In our 2 bow TIE, Node2 got expelled. Once started Node2 CRS did not start. I couldn't find anything significant in the alert.log, ocssd.log or crsd.log grid.

In node2, I was able to do fdisk-l on all LUNS in the diskgroup OCR_VOTE. After a few hours of headaches and escalations, we discovered that LUNS were not really accessible to the clusterware in Node2, although fdisk-l has been correctly showing the partition.

When the cluster is down, I wanted to check if the drive to vote was really accessible to the CRS (GI), but I couldn't (as shown below).

#. / crsctl start crs

CRS-4640: Oracle high availability Services is already active

CRS-4000: Start command has failed, or completed with errors.

#. / crsctl query css votedisk

Unable to communicate with the Cluster synchronization service daemon.

How can I check if the drive to vote is accessible to the RSC in a knot when the CRS is down?

There are 2 layers necessary for the start of the CRS.

Storage. IMO, multipath is required for managing the storage of cluster at the physical level. To check if the storage is available, use the multipath-l for a list of the devices command. I usually use multipath-l | grep to list the LUN, key word matches the LUN entry. For example

[root@xx-rac01 ~] # multipath-l | grep VRAID | Sorting
VNX-LUN0 (360060160abf02e00f8712272de99e111) dm-8 DGC, VRAID
VNX-LUN1 (360060160abf02e009050a27bde99e111) dm-3 DGC, VRAID
VNX-LUN2 (360060160abf02e009250a27bde99e111) dm-9 DGC, VRAID
VNX-LUN3 (360060160abf02e009450a27bde99e111) dm-4 DGC, VRAID
VNX-LUN4 (360060160abf02e009650a27bde99e111) dm-0 DGC, VRAID
VNX-LUN5 (360060160abf02e009850a27bde99e111) dm-5 DGC, VRAID
VNX-LUN6 (360060160abf02e009a50a27bde99e111) dm-1 DGC, VRAID
VNX-LUN7 (360060160abf02e009c50a27bde99e111) dm-6 DGC, VRAID
VNX-LUN8 (360060160abf02e009e50a27bde99e111) dm-2 DGC, VRAID
VNX-LUN9 (360060160abf02e00a050a27bde99e111) dm-7 DGC, VRAID

If the number of LUNS is bad and one or more LUNS are missing, I would check the logs to begin. You can also run a multichannel flush and rediscovering (and up to the fi level of verbosity errors are thrown).

If all LUNS are there, check the device permissions and make sure that the pile of s/w Oracle has access.

The other layer which needs to work is the interconnection. There are 2 basis points to check. The interface for local interconnection exist? This can be verified using ifconfig. And communicate this interface for interconnection with interfaces for interconnection of the other cluster node? This can be checked by using ping - or if Infiniband is used, via the ibhost and other orders of ib.

So if the CRS does not start - these 2 controls (storage and interconnection) would be my first port of call, as in my experience most of the time one of these 2 layers failed.

Tags: Database

Similar Questions

Maybe you are looking for

  • 0F00:133 C error code

    I have error codes 0F00:133 C, 5 d 00:0009, 4400:011 A can theses errors fixed

  • Administrator has restricted this computer to access the USB/1394 mass storage devices

    This message just started showing unexpectedly. Sometimes windows load. It is not and has never been a business machine, so I am the administrator. He has blocked my external hard drive up and now won't let me even to read a memory stick. In this cas

  • What brand of tuner TV should I buy?

    MSI TV@nywhere Plus card TV Tuner FOR WINDOWS 7 I am from pakistan and I want to buy a new tuner tv card for windows 7, so please tell me the brand of tv tuner card

  • HTTP POST will fail with the status code 400

    Hello I am trying to DISPLAY some data but my code still fails with code 400. I tried to change the URL and the method and adding different headers but I get the same answer. What I am doing wrong? My code is below. HttpConnection connection = null;

  • 2.0 Simulator of (for Android)

    Version 2.0 Beta of the Simulator seems to be available for developers of Android applications: https://bdsc.webapps.BlackBerry.com/developer/Android/tool Is this common Simulator throughout software development kits that can be used to check applica