In a SAS Metadata Server cluster, you might encounter a situation where a slave node is reported to be a part of the quorum, but it does not receive redirection requests from the master node.
This situation is most likely to occur when Enterprise Grid Orchestrator (EGO) is used to start the metadata server cluster nodes or if the metadata servers on each node are all started at approximately the same time. This problem occurs on start-up of the metadata servers when they are trying to connect to each other in order to determine which server will become the master and which server will be the slave.
The issue might also occur when a quorum loss occurs during normal run time. After the quorum loss, if two cluster members both connect to a third cluster member at almost the same time, this situation might occur where one slave joins the quorum without receiving future redirection requests from the master node.
In either situation, the following messages will be seen in the metadata server log of one of the nodes in the metadata server cluster:
This problem might be hard to detect at run time, because it requires a review of the metadata server logs. To determine whether you are encountering this problem, you must check the metadata server master node log to see whether redirect requests are going to all slaves or to only a subset of slave nodes.
When this problem occurs, the following message is logged to the metadata server master node log file (SASMeta_MetadataServer_date_host_pid.log) when the connection request is redirected to one of the slave nodes. In this case, the connection request is redirected to server "SASMeta - Metadata Server Node 3":
Under normal operating conditions, you should also see redirect requests go to the other slave node. Based on the log message above, the other slave node that receives redirection requests might be "SASMeta - Metadata Server Node 2" or "SASMeta - Metadata Server" depending on who the master is.
If you see that ALL of the redirection requests are going to only ONE slave node and not to the other, it is possible that you are encountering the problem described in this note.
To work around this problem, allow more time in between start-up of metadata server cluster nodes. This extra time prevents the condition where two nodes could encounter the error during start-up.
Click the Hot Fix tab in this note to access the hot fix for this issue.
Product Family | Product | System | Product Release | SAS Release | ||
Reported | Fixed* | Reported | Fixed* | |||
SAS System | SAS Metadata Server | Microsoft® Windows® for x64 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 |
Microsoft Windows 8 Enterprise 32-bit | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows 8 Enterprise x64 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows 8 Pro 32-bit | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows 8 Pro x64 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows 8.1 Enterprise 32-bit | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows 8.1 Enterprise x64 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows 8.1 Pro | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows 8.1 Pro 32-bit | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows 10 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows Server 2008 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows Server 2008 R2 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows Server 2008 for x64 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows Server 2012 Datacenter | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows Server 2012 R2 Datacenter | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows Server 2012 R2 Std | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Microsoft Windows Server 2012 Std | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Windows 7 Enterprise 32 bit | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Windows 7 Enterprise x64 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Windows 7 Home Premium 32 bit | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Windows 7 Home Premium x64 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Windows 7 Professional 32 bit | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Windows 7 Professional x64 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Windows 7 Ultimate 32 bit | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Windows 7 Ultimate x64 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
64-bit Enabled AIX | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
64-bit Enabled Solaris | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
HP-UX IPF | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Linux for x64 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 | ||
Solaris for x64 | 9.4 | 9.4_M4 | 9.4 TS1M2 | 9.4 TS1M4 |
A fix for this issue for Base SAS 9.4_M3 is available at:
https://tshf.sas.com/techsup/download/hotfix/HF2/V01.html#57428A fix for this issue for Base SAS 9.4_M2 is available at:
https://tshf.sas.com/techsup/download/hotfix/HF2/R19.html#57428Type: | Problem Note |
Priority: | medium |
Date Modified: | 2016-05-16 16:28:48 |
Date Created: | 2016-01-14 12:10:59 |