Foglight alarm

Hello

Please advice on these alarms that we receive "service is unavailable" agent netmonitor

that more often than not is not true. When you browse our alarms, that's what we find

These screenshots show the alarms generated by the agent of LogFilter, which monitors the server.log to find errors.

This is different from a message like "service is unavailable" to an alarm of agent NetMonitor, that ping one set of devices and you alerts if a target device is not available, or if there is packet loss on the network.

Kind regards

Brian Wheeldon

Tags: Dell Tech

Similar Questions

  • Foglight alarms automatically erase themselves?

    Hello

    I'm new to Foglight and I have a general question.

    Foglight alarms disappear automatically themselves once a problem is solved?

    For example, if I get an alarm saying that the memory usage is high on a server and I'm going in and kill a process on this server to lower it, will get deleted automatically alarm or do I still go and delete it?

    Please let me know.

    Thank you

    Tony

    Multi-gravite rules generate alerts that will be automatically clear if the State goes away and the State of gravity goes back to normal. Also, the severities change automatically if an alarm has reached another gravity as an alarm becoming a fatal alarm.

    Simple rules such as the rules for the LogFilter agent will not automatically clear.

  • Foglight - alarm SQL

    All,

    Any idea on how I can determine how the SQL cartidge pulls this inforamtion? Network management team, State of DBA and Windows Guy there is no problem. Are there any SQL query or perfmon I should look?

    Foglight cartridge for SQL Server generated an alarm for example on the SRWSQL1 host. POSNETPROD.COM.

    Alarm:

    DBSS - SQL Packages error rate (Group message: network)

    Severity:

    Warning

    Message:

    The instance of SQL Server is 1,00 errors in packets per second.

    Created the:

    Kill Mar 27 22:12:13 UTC 2012

    The metric is a Global Variable, SQL Server, and can be captured in the database instance by running the following command in SQL Server:

    SELECT @PACKET_ERRORS

    The returned value is cumulative since the last SQL Server restart, so Foglight performs a delta to indicate errors in packets since the previous collection.  Here's an MSDN article that explains the variable.

    http://msdn.Microsoft.com/en-us/library/ms190343.aspx

    Data are collected by the collection of Global Variables in SQL Server that is integrated into the SQL Server cartridge.

    SQL Server Global Variables

    This collection provides SQL Server General settings, such as i/o, networking and database operating time.

    Type

    SQL Server.

    Collection sampling intervals

    Frequency Mode

    Interval (in seconds) collection

    In real time

    20

    Online

    60

    In offline mode

    300

    Parameters in the collection

    Display name

    Description

    Availability of instance

    The relative length of the instance of SQL Server runs during the current interval. If the instance is down, no data collection occurs. Thus, if this metric value is less than the length of the interval, the value of this measure reflects only the active portion of the interval.

    (100 * instance times) / (length of the interval)

    Instance unavailable

    The relative amount in which the instance of SQL Server has decreased during the current interval. If the instance is down, no data collection occurs.

    (100 * (Instance-length of time interval)).

    (Length of the interval)

    IO errors

    The number of i/o errors encountered by SQL Server.

    Packages pending

    Which instance rate packets are being received by SQL Server client applications.

    Instance of Out Packets

    The rate at which packets are sent SQL Server for client applications.

    Packages of instance errors

    The rate at which SQL Server errors of network packets.

  • Foglight - alarms have http instead of https

    All,

    Alarms have a hyper link in the body which is http. Our FMS works well on https. Is there a setting I'm missing somewhere?

    Thank you

    -Daniel

    Daniel

    Locate the CATALYST_URL registry variable and change it to be the correct link for example a https port number

    David Mendoza

    Foglight Consultant

  • In a tool such as Foglight events about APM-driven alerts

    Hi all

    Our company is currently having an internal debate whether Foglight can replace IBM Netcool Omnibus.  I had a conference call with Gartner last week to discuss the surveillance as a whole and one gentlemen on the phone said that "Foglight can replace Omnibus' which has left me a bit confused because I know that we have had these discussions with Dell, who said that they are not a replacement for Netcool Omnibus. Unfortunately, we ran out of time and I couldn't get their probe on this statement.

    We currently use the functionality of Netcool Omnibus in 5 ways that are:

    Event command line injection

    There is a binary called postzmsg which is drawn from the command line with parameters that contain the severity, summary, on call group and so on.

    Syslog monitoring
    Omnibus has a probe called syslog that we use for monitoring Oracle database log files and trigger events in Omnibus if specific events are found in the newspaper.  This is done in real time.

    SNMP traps
    Omnibus currently listening to breaks SNMP of some of our products like CA Autosys and cypress trees of the ASG.


    E-mail

    Omnibus was an electronic probe that monitors a specific e-mail (Google) for specifically formatted e-mail account and then turns them into an Omnibus event.  We use usually this method as a last resort because it is dependent on the e-mail.

    Automation of self-correction
    Omnibus contains what is called "Actions".  We can search for specific events, and then take corrective action with Automation in Omnibus to fix what caused the event. An example of this would be with our product CA Autosys.  Omnibus search specifically for events 'pulse' every 5 minutes. These heartbeats are sent directly by a job in Autosys which in turn proves Autosys is running.  If Omnibus does not see the heart beat, she considers as inactive Autosys and run scripts which then try to restart Autosys save.  Omnibus then reports if this attempt succeeded or failed and worsens the severity according to its conclusions.

    Please keep in mind that all of these methods are with an event-driven approach rather than a performance-based approach.

    I really need to have a clear picture on whether and how the Foglight can cope with the event focused on issues.

    I asked Dell technical support who said the follow-up:

    You may be able to use a script agent to meet your needs.  The script agent could return data to the FMS and you could have a rule that fires based on the data returned.You would be responsible to creating the script used by the agent.  There is a "Builiding Script Agents" section under the "Customizing Your Environment with Tooling | Building Script Agents" in the Foglight Administration and Configuration guide.
    

    Has anyone had experience with this? What was the result? Do you use today? Advantages vs disadvantages?

    If anyone has the source code example that it would be really appreciated! I am also very interested in hearing other thoughts which have Foglight and another tool such as Omnibus or other tools and how they use with Foglight or if they withdrew these tools after having long Foglight.

    Thanks to you all! Looking forward to your answers!

    Larry Roberts

    Larry,

    Foglight can be configured to monitor the applications where an out of the box cartridge (module) was not developed and delivered with the product. There are several ways to customize Foglight.

    Event command line injection

    Looks like you want to send alarms foglight via a command line program. It is feasible because there are available with Foglight APIs to create alarms.

    Syslog monitoring

    Foglight monitoring logs with the logfilter agent. You can set the strings that need to be searched (when a log file is updated by your applications), then specify the corresponding string (based on regular expressions) is a warning, critcal or a fatal alert.

    For monitoring of log files from the database, which is available with the cartridges of the database (DB2, Oracle, SQL Server, MySQL, Sybase) delivered with Foglight.

    SNMP traps

    Foglight can receive (out of the box) SNMP traps and convert them to Foglight alarms.


    E-mail

    I study further on this request. Basically, it takes an e-mail player that puts the emails in a log file that will read the Foglight LogFilter agent and search for error conditions.

    Automation of self-correction Foglight rules have actions that can be performed in the cases where the rule detects specific conditions as the processor high as a basic example and then run the scripts commands, scripts remotely, send SNMP traps, invoke a JavaEE trace, etc. A custom agent could be developed to run your heartbeat AutoSys and then search for an answer to a file or stdout and update a Foglight table that is being tested in a rule.

    David Mendoza

    Foglight Consultant

  • Foglight and integration Sitescope

    Have anyone tried incorporating foglight sitescope?

    I would do it I push foglight alarm for use in sitescope. Is there any documentation for it?

    Thank you

    Research online it looks like Sitescope has a trap SNMP monitor, you are trying to send SNMP traps from Foglight

    http://en.community.Dell.com/TechCenter/performance-monitoring/Foglight-administrators/w/Admins-wiki/5567.event-driven-rules-SNMP-forward

    The article talks about general SNMP forward, but I hope it helps.

    Golan

  • How interrogate foglight alarm_alarm and connect guests with her

    All,

    I know that this question has been asked in different ways.  I'm looking to directly query the database Foglight I can do an analysis through 4 other systems.  We are looking to integrate different sources into a data warehouse for later analysis in a real-time database close.  Foglight is able to give us a lot of points, but not all, and that's why we need query directly on the database.  Ultimately we are only looking for alarms, created time and the host just now.  Help or direction would be really appreciated.

    Kaleb

    Hi, Kaleb,.

    To answer your other point, Foglight alarms are associated with various items in different monitored domain models that may or may not be a host or associated with a host.

    For example, an alarm associated with a 'cluster' has no associated host.

    In an event focused on the rule, the object that triggered the alarm is the source of the event:

    sourceID = @event.get ('topologyObjectID')

    def source = server. TopologyService.getObject (sourceID)

    Once you have the source, you need to look through a variety of possible paths to find the associated host.

    This trial and error process is implemented in the following script, that works for the majority of the alarms:

    def tryPath (source, path) {}

    result = null

    try {}

    result = source.get (path)

    }

    catch (Exception e) {}

    Returns the result

    }

    def getHostName (objectID) {}

    def hostname = null

    def source = null

    try {}

    source = server. TopologyService.getObject (objectID)

    }

    catch (Exception e) {}

    e return

    }

    If (source! = null) {}

    host name = tryPath (source, ' monitoredHost/name "" ")

    If (hostname == null) {}

    host name = tryPath (source, "the name of the controller/monitoredHost /")

    }

    If (hostname == null) {}

    name of host = tryPath (source, 'parent, agent, monitoredHost, name')

    }

    If (hostname == null) {}

    host name = tryPath (source, ' / hostname ")

    }

    If (hostname == null) {}

    host name = tryPath (source, ' controller / / hostname ")

    }

    If (hostname == null) {}

    name of host = tryPath (source, "agent/hostName")

    }

    If (hostname == null) {}

    parents-tryPath (source, 'parents')

    If the host name (parents.size () > 0) = tryPath (parents [0], "/ hostname")

    }

    If (hostname == null) {}

    parents-tryPath (source, ' controller/parents')

    If the host name (parents.size () > 0) = tryPath (parents [0], "/ hostname")

    }

    }

    Returns the host name

    }

    sourceID = def @event.get ('topologyObjectID')

    def hostName = getHostName (sourceID)

    Returns the host name

    Kind regards

    Brian Wheeldon

    Published by Brian Wheeldon to include additional paths to the hosts required by a few rules.

  • Clearly the alarms console outside FMS

    Hello

    I heard of a way to clean up an alarm outside the FMS console.

    Therefore, you should put a document in xml format in a folder on the fms server.

    has someone at - he tested this or could explain the folder in which the XML should be placed and what must be inside the xml more detailed?

    I already have the cartridge of integration but the help pdf confuses me:)

    Thank you very much

    VECD

    Hello vero,.

    I don't know which section including documentation confuses you. Maybe you could post details?

    It is easy to draw a Foglight alarm from the command line using the AlarmService API.

    Here's a groovy script to clear an alarm:

    clearAlarm.groovy

    def alarmID = args [1]

    If (server. AlarmService.getAlarm (alarmID) == null) return "error: alarm"+ alarmID + "not found!". "

    Server. AlarmService.clearAlarm (alarmID)

    return "Off alarm."

    Here is an example of a command line to run this script (Windows and Unix):

    > %FGLHOME%\bin\fglcmd.bat - srv fmshost - fglusr usr - pwd fgluserpw1 - cmd script: run f clearAlarm.groovy 776d121b-e18f-487c-a0cb-b633614a88f2

    Off alarm.

    > $FGLHOME/bin/fglcmd.sh - srv fmshost - fglusr usr - pwd fgluserpw1 - cmd script: run f clearAlarm.groovy 4f905ffb-ba7c-4809-8940-99ad3468a3e7

    Off alarm.

    The "alarmID' passed as an argument to the script is the identifier of the alarm returned by alarm.getID ().

    Kind regards

    Brian Wheeldon

  • Query to find a required rule in the database

    Hi all

    I get a few empty alerts a unknown rule in my console and it's really hard to check each any every rule as much of his time. Could someone help me if there is no work around where we can write a query in the database and look for the rulename, or something like that.

    Please help me if possible...

    Kind regards

    Shiva G

    Hey Shiva

    In your original post, you mentioned that the alarm appears in your console.  You hear your dashboard Foglight alarms?  If so, how the alarm appears in the dashboard?

    Brian

  • ORACLE: Redo wait

    Hi all

    on 5.6.4 we receive foglight alarm:

    ORACLE: Wait for Redo: criticism

    The instance has spent the 29, 30 sec of its activity on again wait, which is significantly higher than the observed typical behavior for this period (

    until 24,69 sec).

    I have a few questions:

    -J' looked at Oracle alertlog and lgwr trace for alarm period but file I saw nothing of Oracle in them. How/where to check if Oracle warned on this subject?

    -Is a problem writing in newspapers of recovery or a problem of redo buffer?

    Thanks for the help.

    I do not think that oracle could write anything in the alert for writing redo log is slow.  Since it is a deviation from normal alarm, you can watch the activity at this time and also waiting for disk queuing (response of the disk write time) during this period.  It's maybe just a case where you have more activity than usual update/insert causing higher than usual waiting for recovery.  Or you can have disk contention where residence your recovery logs.

    Jeff

  • How to plan to clear all alarms in Foglight on a specific date and time?

    Here's what I want to accomplish informing a group of people that all alarms will be erased and then foglight removes all the weapons on a specific date and time

    ..

    1

    I'd like to be able to plan / create a rule or any other option available to use... to be able to send an email through foglight or notification to a specific group of people telling them that all messages will be deleted.

    2nd

    What I'm looking for must be able to clear all alarms in Foglight to a date and a specific time.

    I just noticed this has not had a response.

    I hope that you could see the information contained in the KB, some items may have been written there so I recommend for anyone who wants to use it to test on a test server or check with the support, if there are recent scripts.

    How to purge the alarms? How to program the purge of alarms? (40651)

    Clear the alarms then X days (74729)

    How to clear alarms command line? How to program the clearing of alarms? (51324)

    Is there a way to clean only certain rules specific alarms? (73003)

    How to accuse reception/Clear old messages older then X number of days automatically (72146)

    Regarding the sending of notifications on a schedule, I think the simplest if to set that plan and create a rule with a true value, which is only applicable to this annex (annex rule or rule that is valid on a precise timetable) and has the action to send the notification to users on the alarms about to be cleared/purged.

    Hope this helps

    Golan

  • Configure alarms for VMware Datastore latency in version FREE Foglight!

    Hi all

    I downloaded the version free foglight and that I imported the FVO in my test harness. I want that when a data store has over 50ms latency so I should launch an alert.

    I can't see the alarms on the console free foglight.

    Can you please how can I do to achieve the same thing.

    Thank you

    Vaibhav

    Hi vaibhav,

    The free version does not include a rule to check latency of data store. I checked with development and looks like a new rule has been added to the Foglight for virtualization Enterprise Edition 7.2 (to be released in the following months). The new rule is called: VMW Datastore total latency.

    For the moment, in my view, there is a latency of data store rule included in one of the established community of cartridges. These cartridges are not created by Dell and are not supported. Take a look at it and see if it meets your request.

    http://communities.quest.com/docs/doc-12956#comment-6001

    Note: If your Foglight does not have the ability to install cartridges, chances are that you are running the Standard Edition version. If you have problems, please indicate the full version number. You should find it in the topic of the article.

    Concerning

    Gaston.

  • Foglight for SQL server back alarm

    How foglight knows if a backup succeeded or failed wicker basket SQL server? Best I can tell backups are good, but all the sudden foglight is alarming every day on this.

    There is a metric/property for days since the last backup.  It is a component of SQL Server that we look at and then alarm based on what SQL Server is the last backup date/number of days.

  • Foglight Suggestions alarms

    All,

    How to add or change the suggestion given in the alarms diaglog box? Some I see are not useful, and I would like to add a bit of some alarms.

    Thank you!

    When you change a Foglight rule, you can change the text of the alarm for all gravity. Is that what you had in mind? If I am offbase thanks for posting a screencap and I will advise you further.

    Thank you

    Robert

  • Foglight for VMware - what alarm intercepts a low disk space?

    All,

    According to me, Miss me something here. Low disk space problems catches which alarms for virtual machines?

    Thank you

    -Daniel

    There are

    http://eDOCS.quest.com/Foglight/56/doc/cartridge/vFoglightCartridge/reference.59.3.php#582597

    Logical VMW Virtual Machine time lead estimated fill

    VMW Machine virtual logical drive use.

    Golan

Maybe you are looking for