Configuration of high availability in vSphere alarms

Hello

We have three ESXi hosts in a cluster and, recently, one of the hosts that failed (I have connected a matter of pension with VMWare to get assistance to research into why).  The virtual machines on this host automatically migrate to the other two hosts in the cluster, but we did not know that this had happened until we saw some VMS showing high CPU ready using vCOPs.

My boss asked me to alert us if the high availability feature is used as it was the case here.  I looked on Google and the VMWare Web site, but I can't find anything specific but I've only worked with VMWare products for about a month so it could well be that I'm looking for simply not the right thing.

I would be grateful for any help, as I have said that I am very new to VMWare, so I hope I got all the information that is needed, if it's not made me know.

See you soon,.

Ben.

There are quite a few definitions standard alarm, you can use. Maybe there's a useful alarm because you can find them in the 'Alerts' tab

Name Description
---- -----------

Insufficient vsphere HA failover resources default alarm to be alerted when there are insufficient for vSphere HA cluster resources to ensure failover

vSphere HA failover in progress Alarm by default to be alerted when vSphere HA is failing on virtual machines
Cannot find vSphere HA master agent Default alarm to alert when vCenter Server could not connect to a main agent of vSphere HA for an extended period
Status of the vSphere HA host By default the alarm to monitor the status of a host such as reported by vSphere HA

vSphere HA failover the virtual machine has no alarm alert default when vSphere HA does not failover a virtual machine

HA vSphere virtual machine followed the default action to alarm to alert when vSphere HA reset a virtual machine

vSphere HA machine virtual error default monitoring alert alarm when vSphere HA could not reset a virtual machine

Insufficient vsphere HA failover resources default alarm to be alerted when there are insufficient for vSphere HA cluster resources to ensure failover

vSphere HA failover in progress Alarm by default to be alerted when vSphere HA is failing on virtual machines
Cannot find vSphere HA master agent Default alarm to alert when vCenter Server could not connect to a main agent of vSphere HA for an extended period
Status of the vSphere HA host By default the alarm to monitor the status of a host such as reported by vSphere HA

vSphere HA failover the virtual machine has no alarm alert default when vSphere HA does not failover a virtual machine

HA vSphere virtual machine followed the default action to alarm to alert when vSphere HA reset a virtual machine

vSphere HA machine virtual error default monitoring alert alarm when vSphere HA could not reset a virtual machine

Tags: VMware

Similar Questions

  • Configuration of high availability.

    Hello

    Please help me to configure high availability for Foglight existing environment, please send me the steps and requirements of pre.

    How many servers can exist in a cluster?

    Capacity how do we need on the primary server and the other servers if there is a failure?

    We currently have 1 unifying and 3 child FMS.

    version: 5.6.10

    Thank you

    Vicky

    Vicky,

    There are 2 very useful field guides that go through the requirements and the Setup process.

    High Availability Guide - http://edocs.quest.com/foglight/5610/doc/wwhelp/wwhimpl/common/html/frameset.htm?context=field&file=HA-field/index.php&single=true

    Federation of field guide-

    http://eDOCS.quest.com/Foglight/5610/doc/wwhelp/wwhimpl/common/HTML/frameset.htm?context=field&file=Federation-field/index.php&single=true

    Note the following points, known issue

    http://eDOCS.quest.com/Foglight/5611/doc/wwhelp/wwhimpl/common/HTML/frameset.htm?context=field&file=HA-field/overview.1.php&single=true

    "A master of the Federation running in mode high availability is not supported. Only children Federated can be run by high availability. »

    Golan

  • How to set up the single instance of data of high availability and disaster tolerance

    Hi Experts,

    I have unique database and instance need to configure for high availability and disaster tolerance

    What are the DR options available for synchronization of database at a remote site with a delay of 5 to 10 minutes.

    Application connects to the local site and DR site should be the remote site.

    1 oracle FailSafe with SAN?

    2. What is the security integrated on linux centos/oel solution?

    3. If the storage is on the San (for example) then is it possible to set up a shell script

    which detects if the source database is down for 5 minutes, ride the SAN stored the files of database on the remote computer and

    change the ip in the application, so it will never connect to the source ip address

    Thank you and best regards,

    IVW

    Hello

    Rupture can occur in any level

    1 oracle FailSafe with SAN?

    --> Do check if you have failure in storage, your security will do nothing to bring back data

    --> Seen failsafe, will be only the insurance when MS cluster moving the disc and services to different node, configured services starts.

    2. What is the security integrated on linux centos/oel solution?

    --> Under linux, you need to set the scripts to run, and you can check the option on the cluster OS

    3. If the storage is on the San (for example) then it is possible to configure a shell script that detects if data source is down for 5 minutes, mount SAN stored the files of database on the remote computer and

    change the ip in the application, so it will never connect to the source ip address

    --> This you will get a cluster of BONES...

    Points to note:

    ================

    --> If there is power failure sudden, we can expect there may be lost writing in your data block, which will bring the inconsistency to your data file and redo

    --> Here, there is problem with your drive?

    --> If there is problem with your complete domain controller (how will you mount FS to a remote server?)

    Note what we are discussing was HA and you try to keep an idle server all the time (you have a server with more RAM & CPU)

    Why you can't check an option of CARS, can also have the cluster extension...

    And to avoid the loss of data and to meet the RPO and RTO in all time (same DC came down, storage failure, server crash), you may need to use Oracle data guard...

    Ask questions if you have

    Thank you

  • ODI 11 g high availability features

    Hello

    Is there a link to all official documentation describing how ODI can be configured for high availability, and more specifically, which described how agents and repositories behave in case of failure.

    I saw the notes on how to configure the load balancing between two or more agents.

    However, I would like to know what happens if an agent fails:

    • Is there a process of follow-up that imprisons the failure and tries to restart the agent?
    • In this case, the ODI load balancer will be perfectly direct traffic only to available agents.
    • What happens if the agent fails in the middle of a stage of script performance?
    • Will of the database session, getting lost and rollback?
    • Stop the script or remaining available agents will manage the process of running success?

    Similar requests are the repository itself - if the master repository database or the work fails what is happening? Assuming, the repository is installed on a multiple-node cluster environment, if only one node fails, everything will run scenario fail (and need to reboot) or that there is a transparent failover to the remaining nodes in the cluster housing deposits ODI?

    I have a requirement where the ETL (ODI) system must be available 99.5% of the year and thus resilience is a key factor and I need to understand the capabilities.

    I need to know if high in ODI availability simply means that a process is restarted, if a node or agent fails, or if an existing process will continue until the end due to a transparent failover.

    see you soon,

    John

    Take a look at the link below:

    http://www.rittmanmead.com/2012/03/deploying-ODI-11g-agents-for-high-availability-and-load-balancing/

    I hope this has given a table on this topic clear about close.

    Apart from this we use EM to monitor the status of the Agent.

    See you soon!

    SH! going

  • vSphere high availability 'Élection' fails at 99% 'operation timed out' to 1 of the 2 hosts

    Hello

    We had a system with 1 host ESXi 5.1 with local disks.

    Now, we install redundancy by adding a 5.5 U2 ESXi host and a device of vCenter 5.5.

    After having installed and any addition to vcenter, we went the ESXi 5.1 to 5.5 U2 ESXi. The SAN works correctly (vMotion works on the own NETWORK card).

    Now, if I try to activate the high availability, both servers will install the HA Agent and start "Election".

    All the warehouses of data (4) on the SAN are chosen for the HA heartbeat, response of isolation is "keep it turned on" by default.

    A server will always be this process done, and the other will keep "elect" until it reaches 100% and errors on the operation "election timed out.

    I've seen this problem on both servers, so I think that the 'master' elected does not have the problem, only the "slave".

    I have checked these items and executed, but not worked:

    VMware KB: Reconfiguration HA (FDM) on a cluster fails with the error: operation timed out

    -The services were running

    VMware KB: Configuration HA in VMware vCenter Server 5.x does not work, error: Operation Timed out

    -All the MTU has been set at 1500

    VMware KB: VMware High Availability Configuration fails with the error: unable to complete the configuration of the ag of HA...

    -the default gateway is not the same on the two hosts, but I fixed that. There is no itinerary changes. HA the setting is "leave power." After correction and deactivation/reactivation HA, the problem is always the same.

    VMware KB: Check and reinstall the correct version of the VMware vCenter Server agents

    -J' ran "Reinstall ESX host management agents and HA on ESXi" agent of HA, and I checked that it was uninstalled and reinstalled during the reactivation of HA.

    CP /opt/vmware/uninstallers/VMware-fdm-uninstall.sh/tmp
    chmod + x /tmp/VMware-fdm-uninstall.sh
    /tmp/vmware-FDM-uninstall.sh

    I did this for two guests. This fix makes the problem of the election and I was still able to run a HA test successfully, but when after this test, I turned off the 2nd Server (in order to test the PA in the other direction), HA has no failover to 1 and everything remained low. After pressing 'reconfigure HA', the problem of the election appeared again on 1 hosts.

    Here are a few extractions of newspapers:

    -L' availability vSphere HA of that host State became election info 29/11/2014 22:03 192.27.224.138

    -vSphere HA agent is healthy info 29/11/2014 22:02:56 192.27.224.138

    -L' availability vSphere HA of that host State became Master info 29/11/2014 22:02:56 192.27.224.138

    -L' availability vSphere HA of that host State became election info 29/11/2014 22:01:26 192.27.224.138

    -vSphere HA agent is healthy info 29/11/2014 22:01:22 192.27.224.138

    -L' availability vSphere HA of that host State became Master info 29/11/2014 22:01:22 192.27.224.138

    -L' availability vSphere HA of that host State became election info 29/11/2014 22:03:02 192.27.224.139

    -Message "host vSphere HA State" on 192.27.224.139 changes from green to red info 29/11/2014 22:02:58 192.27.224.139

    officer of HA - vSphere for this host has an error: vSphere HA agent may not be properly installed or configured WARNING 29/11/2014 22:02:58 192.27.224.139

    -L' availability vSphere HA of that host State became info error of initialization of 29/11/2014 22:02:58 192.27.224.139

    -L' availability vSphere HA of that host State became election info 29/11/2014 22:00:52 192.27.224.139

    -DSMD3400DG2VD2 data store is selected for the storage heartbeat controlled by the HA agent vSphere on this info host 29/11/2014 22:00:49 192.27.224.139

    -DSMD3400DG2VD1 data store is selected for the storage heartbeat controlled by the HA agent vSphere on this info host 29/11/2014 22:00:49 192.27.224.139

    -Firewall configuration has changed. 'Enable' to rule the fdm value successful.  29/11/2014 info 22:00:45 192.27.224.139

    -L' availability vSphere HA of that host State became not initialized info 29/11/2014 22:00:40 reconfigures vSphere HA 192.27.224.139 root host

    -vSphere HA agent on this host is disabled info 29/11/2014 22:00:40 reconfigures vSphere HA 192.27.224.139 root

    -Reconfigure HA vSphere host 192.27.224.139 operation has timed out.     root of HOSTSERVER01 29/11/2014 22:00:31 29/11/2014 22:00:31 29/11/2014 22:02:51

    -Configuration of vSphere HA 192.27.224.139 operation has timed out.     System HOSTSERVER01 29/11/2014 21:56:42 29/11/2014 21:56:42 29/11/2014 21:58:55

    Can someone give me please with help here?

    Or the extra things that I can check or provide?

    I'm currenty running options.

    Best regards

    Joris

    P.S. I had problems with cold Migration during the implementation of the SAN. After setting up all (vMotion, ESX upgrade), these problems have disappeared.

    When you search for this error, I came to this article: KB VMware: VMware vCenter Server displays the error message: unable to connect to the host

    And this cause could make sense, since the server vCenter changed and IP addressing has been changed during the implementation.

    However, in the file vpxa.cfg, the < hostip > and < serverip > is correct (verified by using https://< hostip > / home).

    Tried this again today, no problem at all.

    P.P.S. I have set up several of these systems from scratch in the past without problem (if it is an 'upgrade').

    OK, so the problem is solved.

    I contacted Dell Pro Support (OEM offering the license) and they checked the logs (fdm.log) and found that the default IP gateway could not be reached.

    The default gateway is the ip address by default host isolation, used by HA.

    Because it is an isolated production system, the front door provided proved only for future purposes.

    Now, I've changed the default gateway address of management on the switch plugged into both hosts, namely the ping requests.

    This solved any problem.

  • VSphere hosts high availability 2

    Hi all

    I am trying to achieve the following by using the essential plus the package. I had 2 identical machines with 2 TB of storage. The two are running ESXI and I want to configure high availability and load the pendulum. From what I've read on for them is that vmware needs shared storage so kind of san if I want to use high availability. But I don't have this machine, what I want to achieve is the following: Vmware reflects the discs on both machines (network raid 1) and if a hardware failure occurred on one of them, the other machine will start all the vm remaining. In the same way vmware should do load balancing. From what I've read on the internet vmware had VSA, but this has been discontinued. VSAN is not an option, because I don't have 3 machines. One possible option would be for example starwind but I would have preferred an option of vmware itself.

    So in short it is a way to configure high availability and load with 2 hosts balancing and not shared storage? Preferably without third party software.

    Kind regards

    Sebastian Wehkamp

    Technically, you can use DRBD and Linux (or FreeBSD and POLE if you're on the dark side of the Moon) to create a device block tolerant replication between a pair of virtual machines running on a hypervisor nodes. Throw in

    failover NFSv4 point mounting on top and you have a nice VMware VM data store. Tons of a defined software storage vendors have exactly this ideology within their virtual storage devices if you do not go to the difference

    However as StarWind Virtual SAN is FREE for the installation of 2 knots in the scenario of VMware (Hyper-V licensing is different if you care) and can be run on a free Hyper-V Server (no need to pay for licenses of Windows), you CAN end up with a faster

    back from road StarWind. Depends on what you want from a set of functionality (iSCSI? NFS? SMB3? Online dedupe? Cache?) and what forum do you prefer for public support to ask questions

    --

    Thank you for your response. I thought that since preparations are exactly the same secondary machine would be able to easily start the vm of lost. For synchronization, I thought it would be something like drbd, when unable to connect with the other secondary hosts will become primary. This works in the case of a hardware failure, and network failure half a brain will take place that must be resolved manually. I play with it replicaton and reconsider the starwind software.

  • vCO 5.5 high availability / load balancer configuration

    I do a planning for a new deployment of vCO 5.5 and I would like to take advantage of the new features for high availability, but documentation seems to be a little lacking in Department of load balancer. I hope someone knows the answers to the following questions using zero in on possible configurations for load balancing options:

    Calls to instances of vCO they state or can I use a setting of type stateless round robin for the configuration of lb or what do I need to support sessions?

    Is there a health control end point which I can query the load balancer to automatically remove service instances in a configuration active / active failure? If there isn't a specific is a good audit can be done to create a service to act as a health check?

    Did he already works with a configuration similar and willing to share their experiences?

    Hello

    You can read the following article which explains how to configure Nginx as the load balancer. I hope it's useful.

    VMware KB: Nginx Configuration load lalancing software VMware vCenter Orchestrator 5.5

    Kind regards

    Radostin

  • vSphere high availability with no shared storage? And general problems with VMware partner supplier


    HA can function without shared storage?

    It may not by the availability of vSphere manual.  However the global VMware partner who sold me on VMware solution said that the shared storage is not required for HA.

    This is the same guy who told me that I would not need to buy Windows Server licenses because everything was included in the package (vSphere Essentials Plus).  Now, I have no Windows license, no shared storage and a customer who will not be happy that we did not include these costs in the citation for this project.

    HA can function without shared storage?

    No.... you need a storage shared for HA.

    This is the same guy who told me that I would not need to buy Windows Server licenses because everything was included in the package (vSphere Essentials Plus).  Now, I have no Windows license, no shared storage and a customer who will not be happy that we did not include these costs in the citation for this project.

    Maybe your partner speak the vCenter Server Appliance that is included on vSphere Essentials Plus and you can use this device to manage your vSphere, ESXi without need a VM of Windows (with Windows license) to install the vCenter server.

  • High availability of components in the design of vWorkspace tips

    Hi all

    Would ask you some advice regarding the design of vWorkspace components highly available. Suppose that vWorkspace components will be deployed in vSphere or hypervisors managed SCVMM hence HA is in place, if the failure of a host. In this situation, if we still need components redundant (n + 1 VMS) vWorkspace?

    On the other note, I understand that we can add a couple of broker for vWorkspace in vWorkspace Management Console connections and based on KB 99163 it would just work. I'm not sure how the traffic would be when an application is web access? As in, I guess that the connection broker news would be 'defined' at the request of the web call to the broker for connections. Or this is done automatically? Access Web would choose randomly from the broker for connections to go?

    Thanks for any advice in advance

    Kind regards

    Cyril

    Hi Cyril,.

    Big questions. As with any IT architecture in layers, you must plan HA and redundancy at all points of failure required by your environment or level of Service (SLA) agreements. For vWorkspace, the center of his universe is SQL and you must plan accordingly the failure and recovery. In some environments, full backup can meet the requirement of HA. In others, full SQL Cluster, Mirroring, replication, or Always-On configurations may be required. With our broker, we recommend N + 1 deployment in most scenarios HA. When you move peripheral components or enabling, you must evaluate each component and needs its impact of failure as well as its valuation to determine the appropriate AP.

    Load balancing between several brokers is done automatically by logic in the client connectors. In the case of Web access, when you configure the site Web Access in the Management Console, it includes broker list in the Web access configuration xml file. As client connectors, Web Access includes balancing logic that distributes the client load on brokers available automatically.

    If you have any questions about specific components and requirements of HA or architecture, please add them in the discussions.

  • WLC 5508 high availability

    Hello

    Today I have two WLC 5508 (with license for 100 AP each of them), on a single site.

    The WLC work availability (active-standby).

    However, we have a new scenario, with 02 sites: A and B (attachment).

    I would like to know if it is possible to work as follows:

    The WLC - A as the main controller of site A. WLC - B as a backup (BDC) of WLC.-a.

    The WLC - B that has the PDC site B. WLC - as a backup (BDC) to WLC - B.

    For example:

    If WLC - a falls, site access Points are managed by B WLC site - B and vice versa.

    Is this possible?

    How can I configure the new scenario? Don't forget, there is a site-to-site between Site A and Site b.

    Another point:

    If I add more than 50 APs on Site A. How does the license number?

    Should I buy a license for the two WLC?

    TKS,

    >....

    >.. .is it possible?

    No. , high availability in terms of controller is supposed to be what is said, the backup controller is not 'full' - stby and cannot play other roles.

    M.

  • Deployment of high availability of the IPCC 4.5

    In a future HD architecture implementation, the voice service will provide CallManager 5.0, that will integrate with 4.5 of the IPCC. 4.5 (required with 5.0 CM) IPCC does implement a high availability. How can we ensure that technical support continues to operate if the IPCC goes down? One possibility might be to configure CM such that if the IPCC goes down, all the number of help desk calls are automatically and immediately headed to a group (which includes all extensions help desk). This redirection can be configured in CM? Is there a better option?

    Thanks in advance,

    SB

    This is your best bet. On the road Points for your call center just put the call before busy, no answer and failure to the fighter pilot. Thus, when the IPCC Express Server is down it will sent to your fighter pilot.

    Please evaluate the useful messages.

    adignan - berbee

  • Two WLC 5508 anchor high availability

    Hello.

    It is possible use 2 WLC 5508 EN HOW to ANCHOR in an active scenario?.

    For example, if a WLC down the service, another Dungeon provide service to customers of anchor?

    At the moment we have just a WLC 5508 anchor mode. What do I have to configure high availability of the ANCHOR.

    Thank you very much!!!

    You have redundant WLC as anchor points, but if an anchor fails, the user must reconnect.

    There is a feature on the WLC HA, but it is mainly for foreigners redundancy WLC anchor no redundancy. With guest several anchors overseas WLC balance the load between the two. You will not be able to put a primary or backup.

    Sent by Cisco Support technique iPhone App

  • NAC Manager high availability peer CAM DEAD

    Hello

    I have two managers of the NAC with high availability and I used both interface eth1 of sides as a link Heartbit.

    I did following steps for high availability.

    (1) synchronize the time between two cams.

    (2) generate a temporary SSL certificate in CAMs and import-export procedure made in the other.

    (3) make a CAM as a primary and the other as secondary.

    But after all this made configuration I can see the State in surveillance > reports-primary CAM is in place in both servers and redundant CAM is down.

    Also on the failover tab, I can see - Local CAM - OK [Active] and counterpart CAM:-DEAD.

    I have attached some screenshots so that you can find the same.

    Your help will be very appreciated.

    Thank you

    Try these steps and check that all steps were followed:

    http://www.Cisco.com/c/en/us/support/docs/security/NAC-appliance-clean-access/99945-NAC-cam-HA.html

  • Fabric interconnecting Cisco high availability

    Hi Experts,

    I would like to know,

    (1) what does fabric adapter interconnection cluster (L1, L2) ports are using high availability?

    (2) can form us able to interconnect fabric and fabric interconnection interconnection 6100 6200 to a cluster? If ok means how they exchange of history since the two are completely different material. ?

    Thanks in advance,

    Jako Raj

    Hello

    (1) L1 and L2 ports are used for the purpose of managing only basically to synchronize data management, triggering a possible failover.

    (2) different hardware platforms are supported in the same cluster only in a process of upgrading equipment. Actually what really happens in this meanwhile is the copy of the configuration and the new FI is promoted as primary. Although he works as a group, he is an active-standby cluster, where a single FI acts as principal.

    Kind regards

  • ASA 5520 high availability

    I have two ASA 5520 s.  We have an ID card and we didn't. This makes the wizard high availability fail.  Can I manually configure high availability.  I don't really need two ASA-SSM-20 s.  I just want to have an ASA in standby mode.  Is this possible.  Does anyone have a configuration similar to that?

    Thank you

    Alex Pfeil

    The hardware should match. If you want to switch then remove the Sam from the primary or add one in high school.

    Sent by Cisco Support technique iPhone App

Maybe you are looking for