Foglight alarm

Hello

Please advice on these alarms that we receive "service is unavailable" agent netmonitor

that more often than not is not true. When you browse our alarms, that's what we find

These screenshots show the alarms generated by the agent of LogFilter, which monitors the server.log to find errors.

This is different from a message like "service is unavailable" to an alarm of agent NetMonitor, that ping one set of devices and you alerts if a target device is not available, or if there is packet loss on the network.

Kind regards

Brian Wheeldon

Tags: Dell Tech

Similar Questions

Foglight alarms automatically erase themselves?

Hello

I'm new to Foglight and I have a general question.

Foglight alarms disappear automatically themselves once a problem is solved?

For example, if I get an alarm saying that the memory usage is high on a server and I'm going in and kill a process on this server to lower it, will get deleted automatically alarm or do I still go and delete it?

Please let me know.

Thank you

Tony

Multi-gravite rules generate alerts that will be automatically clear if the State goes away and the State of gravity goes back to normal. Also, the severities change automatically if an alarm has reached another gravity as an alarm becoming a fatal alarm.

Simple rules such as the rules for the LogFilter agent will not automatically clear.

Foglight - alarm SQL

All,

Any idea on how I can determine how the SQL cartidge pulls this inforamtion? Network management team, State of DBA and Windows Guy there is no problem. Are there any SQL query or perfmon I should look?

Foglight cartridge for SQL Server generated an alarm for example on the SRWSQL1 host. POSNETPROD.COM.

Alarm:	DBSS - SQL Packages error rate (Group message: network)
Severity:	Warning
Message:	The instance of SQL Server is 1,00 errors in packets per second.
Created the:	Kill Mar 27 22:12:13 UTC 2012

The metric is a Global Variable, SQL Server, and can be captured in the database instance by running the following command in SQL Server:

SELECT @PACKET_ERRORS

The returned value is cumulative since the last SQL Server restart, so Foglight performs a delta to indicate errors in packets since the previous collection. Here's an MSDN article that explains the variable.

http://msdn.Microsoft.com/en-us/library/ms190343.aspx

Data are collected by the collection of Global Variables in SQL Server that is integrated into the SQL Server cartridge.

SQL Server Global Variables

This collection provides SQL Server General settings, such as i/o, networking and database operating time.

Type

SQL Server.

Collection sampling intervals

Frequency Mode

Interval (in seconds) collection

In real time

20

Online

60

In offline mode

300

Parameters in the collection

Display name

Description

Availability of instance

The relative length of the instance of SQL Server runs during the current interval. If the instance is down, no data collection occurs. Thus, if this metric value is less than the length of the interval, the value of this measure reflects only the active portion of the interval.

(100 * instance times) / (length of the interval)

Instance unavailable

The relative amount in which the instance of SQL Server has decreased during the current interval. If the instance is down, no data collection occurs.

(100 * (Instance-length of time interval)).

(Length of the interval)

IO errors

The number of i/o errors encountered by SQL Server.

Packages pending

Which instance rate packets are being received by SQL Server client applications.

Instance of Out Packets

The rate at which packets are sent SQL Server for client applications.

Packages of instance errors

The rate at which SQL Server errors of network packets.

Foglight - alarms have http instead of https

All,

Alarms have a hyper link in the body which is http. Our FMS works well on https. Is there a setting I'm missing somewhere?

Thank you

-Daniel

Daniel

Locate the CATALYST_URL registry variable and change it to be the correct link for example a https port number

David Mendoza

Foglight Consultant

In a tool such as Foglight events about APM-driven alerts

Hi all

Our company is currently having an internal debate whether Foglight can replace IBM Netcool Omnibus. I had a conference call with Gartner last week to discuss the surveillance as a whole and one gentlemen on the phone said that "Foglight can replace Omnibus' which has left me a bit confused because I know that we have had these discussions with Dell, who said that they are not a replacement for Netcool Omnibus. Unfortunately, we ran out of time and I couldn't get their probe on this statement.

We currently use the functionality of Netcool Omnibus in 5 ways that are:

Event command line injection

There is a binary called postzmsg which is drawn from the command line with parameters that contain the severity, summary, on call group and so on.

Syslog monitoring
Omnibus has a probe called syslog that we use for monitoring Oracle database log files and trigger events in Omnibus if specific events are found in the newspaper. This is done in real time.

SNMP traps
Omnibus currently listening to breaks SNMP of some of our products like CA Autosys and cypress trees of the ASG.

E-mail

Omnibus was an electronic probe that monitors a specific e-mail (Google) for specifically formatted e-mail account and then turns them into an Omnibus event. We use usually this method as a last resort because it is dependent on the e-mail.

Automation of self-correction
Omnibus contains what is called "Actions". We can search for specific events, and then take corrective action with Automation in Omnibus to fix what caused the event. An example of this would be with our product CA Autosys. Omnibus search specifically for events 'pulse' every 5 minutes. These heartbeats are sent directly by a job in Autosys which in turn proves Autosys is running. If Omnibus does not see the heart beat, she considers as inactive Autosys and run scripts which then try to restart Autosys save. Omnibus then reports if this attempt succeeded or failed and worsens the severity according to its conclusions.

Please keep in mind that all of these methods are with an event-driven approach rather than a performance-based approach.

I really need to have a clear picture on whether and how the Foglight can cope with the event focused on issues.

I asked Dell technical support who said the follow-up:
```
You may be able to use a script agent to meet your needs.  The script agent could return data to the FMS and you could have a rule that fires based on the data returned.You would be responsible to creating the script used by the agent.  There is a "Builiding Script Agents" section under the "Customizing Your Environment with Tooling | Building Script Agents" in the Foglight Administration and Configuration guide.
```
Has anyone had experience with this? What was the result? Do you use today? Advantages vs disadvantages?

If anyone has the source code example that it would be really appreciated! I am also very interested in hearing other thoughts which have Foglight and another tool such as Omnibus or other tools and how they use with Foglight or if they withdrew these tools after having long Foglight.

Thanks to you all! Looking forward to your answers!

Larry Roberts

Larry,

Foglight can be configured to monitor the applications where an out of the box cartridge (module) was not developed and delivered with the product. There are several ways to customize Foglight.

Event command line injection

Looks like you want to send alarms foglight via a command line program. It is feasible because there are available with Foglight APIs to create alarms.

Syslog monitoring

Foglight monitoring logs with the logfilter agent. You can set the strings that need to be searched (when a log file is updated by your applications), then specify the corresponding string (based on regular expressions) is a warning, critcal or a fatal alert.

For monitoring of log files from the database, which is available with the cartridges of the database (DB2, Oracle, SQL Server, MySQL, Sybase) delivered with Foglight.

SNMP traps

Foglight can receive (out of the box) SNMP traps and convert them to Foglight alarms.

E-mail

I study further on this request. Basically, it takes an e-mail player that puts the emails in a log file that will read the Foglight LogFilter agent and search for error conditions.

Automation of self-correction Foglight rules have actions that can be performed in the cases where the rule detects specific conditions as the processor high as a basic example and then run the scripts commands, scripts remotely, send SNMP traps, invoke a JavaEE trace, etc. A custom agent could be developed to run your heartbeat AutoSys and then search for an answer to a file or stdout and update a Foglight table that is being tested in a rule.

David Mendoza

Foglight Consultant

Foglight and integration Sitescope

Have anyone tried incorporating foglight sitescope?

I would do it I push foglight alarm for use in sitescope. Is there any documentation for it?

Thank you

Research online it looks like Sitescope has a trap SNMP monitor, you are trying to send SNMP traps from Foglight

http://en.community.Dell.com/TechCenter/performance-monitoring/Foglight-administrators/w/Admins-wiki/5567.event-driven-rules-SNMP-forward

The article talks about general SNMP forward, but I hope it helps.

Golan

How interrogate foglight alarm_alarm and connect guests with her

All,

I know that this question has been asked in different ways. I'm looking to directly query the database Foglight I can do an analysis through 4 other systems. We are looking to integrate different sources into a data warehouse for later analysis in a real-time database close. Foglight is able to give us a lot of points, but not all, and that's why we need query directly on the database. Ultimately we are only looking for alarms, created time and the host just now. Help or direction would be really appreciated.

Kaleb

Hi, Kaleb,.

To answer your other point, Foglight alarms are associated with various items in different monitored domain models that may or may not be a host or associated with a host.

For example, an alarm associated with a 'cluster' has no associated host.

In an event focused on the rule, the object that triggered the alarm is the source of the event:

sourceID = @event.get ('topologyObjectID')

def source = server. TopologyService.getObject (sourceID)

Once you have the source, you need to look through a variety of possible paths to find the associated host.

This trial and error process is implemented in the following script, that works for the majority of the alarms:

def tryPath (source, path) {}

result = null

try {}

result = source.get (path)

}

catch (Exception e) {}

Returns the result

}

def getHostName (objectID) {}

def hostname = null

def source = null

try {}

source = server. TopologyService.getObject (objectID)

}

catch (Exception e) {}

e return

}

If (source! = null) {}

host name = tryPath (source, ' monitoredHost/name "" ")

If (hostname == null) {}

host name = tryPath (source, "the name of the controller/monitoredHost /")

}

If (hostname == null) {}

name of host = tryPath (source, 'parent, agent, monitoredHost, name')

}

If (hostname == null) {}

host name = tryPath (source, ' / hostname ")

}

If (hostname == null) {}

host name = tryPath (source, ' controller / / hostname ")

}

If (hostname == null) {}

name of host = tryPath (source, "agent/hostName")

}

If (hostname == null) {}

parents-tryPath (source, 'parents')

If the host name (parents.size () > 0) = tryPath (parents [0], "/ hostname")

}

If (hostname == null) {}

parents-tryPath (source, ' controller/parents')

If the host name (parents.size () > 0) = tryPath (parents [0], "/ hostname")

}

Returns the host name

}

sourceID = def @event.get ('topologyObjectID')

def hostName = getHostName (sourceID)

Returns the host name

Kind regards

Brian Wheeldon

Published by Brian Wheeldon to include additional paths to the hosts required by a few rules.

Clearly the alarms console outside FMS

Hello

I heard of a way to clean up an alarm outside the FMS console.

Therefore, you should put a document in xml format in a folder on the fms server.

has someone at - he tested this or could explain the folder in which the XML should be placed and what must be inside the xml more detailed?

I already have the cartridge of integration but the help pdf confuses me:)

Thank you very much

VECD

Hello vero,.

I don't know which section including documentation confuses you. Maybe you could post details?

It is easy to draw a Foglight alarm from the command line using the AlarmService API.

Here's a groovy script to clear an alarm:

clearAlarm.groovy
def alarmID = args [1] If (server. AlarmService.getAlarm (alarmID) == null) return "error: alarm"+ alarmID + "not found!". " Server. AlarmService.clearAlarm (alarmID) return "Off alarm."

clearAlarm.groovy

def alarmID = args [1]

If (server. AlarmService.getAlarm (alarmID) == null) return "error: alarm"+ alarmID + "not found!". "

Server. AlarmService.clearAlarm (alarmID)

return "Off alarm."

Here is an example of a command line to run this script (Windows and Unix):

> %FGLHOME%\bin\fglcmd.bat - srv fmshost - fglusr usr - pwd fgluserpw1 - cmd script: run f clearAlarm.groovy 776d121b-e18f-487c-a0cb-b633614a88f2

Off alarm.

> $FGLHOME/bin/fglcmd.sh - srv fmshost - fglusr usr - pwd fgluserpw1 - cmd script: run f clearAlarm.groovy 4f905ffb-ba7c-4809-8940-99ad3468a3e7

Off alarm.

The "alarmID' passed as an argument to the script is the identifier of the alarm returned by alarm.getID ().

Kind regards

Brian Wheeldon

Query to find a required rule in the database

Hi all

I get a few empty alerts a unknown rule in my console and it's really hard to check each any every rule as much of his time. Could someone help me if there is no work around where we can write a query in the database and look for the rulename, or something like that.

Please help me if possible...

Kind regards

Shiva G

Hey Shiva

In your original post, you mentioned that the alarm appears in your console. You hear your dashboard Foglight alarms? If so, how the alarm appears in the dashboard?

Brian

ORACLE: Redo wait

Hi all

on 5.6.4 we receive foglight alarm:

ORACLE: Wait for Redo: criticism

The instance has spent the 29, 30 sec of its activity on again wait, which is significantly higher than the observed typical behavior for this period (

until 24,69 sec).

I have a few questions:

-J' looked at Oracle alertlog and lgwr trace for alarm period but file I saw nothing of Oracle in them. How/where to check if Oracle warned on this subject?

-Is a problem writing in newspapers of recovery or a problem of redo buffer?

Thanks for the help.

I do not think that oracle could write anything in the alert for writing redo log is slow. Since it is a deviation from normal alarm, you can watch the activity at this time and also waiting for disk queuing (response of the disk write time) during this period. It's maybe just a case where you have more activity than usual update/insert causing higher than usual waiting for recovery. Or you can have disk contention where residence your recovery logs.

Jeff

How to plan to clear all alarms in Foglight on a specific date and time?

Here's what I want to accomplish informing a group of people that all alarms will be erased and then foglight removes all the weapons on a specific date and time

..

1

I'd like to be able to plan / create a rule or any other option available to use... to be able to send an email through foglight or notification to a specific group of people telling them that all messages will be deleted.

2nd

What I'm looking for must be able to clear all alarms in Foglight to a date and a specific time.

I just noticed this has not had a response.

I hope that you could see the information contained in the KB, some items may have been written there so I recommend for anyone who wants to use it to test on a test server or check with the support, if there are recent scripts.

How to purge the alarms? How to program the purge of alarms? (40651)

Clear the alarms then X days (74729)

How to clear alarms command line? How to program the clearing of alarms? (51324)

Is there a way to clean only certain rules specific alarms? (73003)

How to accuse reception/Clear old messages older then X number of days automatically (72146)

Regarding the sending of notifications on a schedule, I think the simplest if to set that plan and create a rule with a true value, which is only applicable to this annex (annex rule or rule that is valid on a precise timetable) and has the action to send the notification to users on the alarms about to be cleared/purged.

Hope this helps

Golan

Configure alarms for VMware Datastore latency in version FREE Foglight!

Hi all

I downloaded the version free foglight and that I imported the FVO in my test harness. I want that when a data store has over 50ms latency so I should launch an alert.

I can't see the alarms on the console free foglight.

Can you please how can I do to achieve the same thing.

Thank you

Vaibhav

Hi vaibhav,

The free version does not include a rule to check latency of data store. I checked with development and looks like a new rule has been added to the Foglight for virtualization Enterprise Edition 7.2 (to be released in the following months). The new rule is called: VMW Datastore total latency.

For the moment, in my view, there is a latency of data store rule included in one of the established community of cartridges. These cartridges are not created by Dell and are not supported. Take a look at it and see if it meets your request.

http://communities.quest.com/docs/doc-12956#comment-6001

Note: If your Foglight does not have the ability to install cartridges, chances are that you are running the Standard Edition version. If you have problems, please indicate the full version number. You should find it in the topic of the article.

Concerning

Gaston.

Foglight for SQL server back alarm

How foglight knows if a backup succeeded or failed wicker basket SQL server? Best I can tell backups are good, but all the sudden foglight is alarming every day on this.

There is a metric/property for days since the last backup. It is a component of SQL Server that we look at and then alarm based on what SQL Server is the last backup date/number of days.

Foglight Suggestions alarms

All,

How to add or change the suggestion given in the alarms diaglog box? Some I see are not useful, and I would like to add a bit of some alarms.

Thank you!

When you change a Foglight rule, you can change the text of the alarm for all gravity. Is that what you had in mind? If I am offbase thanks for posting a screencap and I will advise you further.

Thank you

Robert

Foglight for VMware - what alarm intercepts a low disk space?

All,

According to me, Miss me something here. Low disk space problems catches which alarms for virtual machines?

Thank you

-Daniel

There are

http://eDOCS.quest.com/Foglight/56/doc/cartridge/vFoglightCartridge/reference.59.3.php#582597

Logical VMW Virtual Machine time lead estimated fill

VMW Machine virtual logical drive use.

Golan

Frequency Mode	Interval (in seconds) collection
In real time	20
Online	60
In offline mode	300

Display name	Description
Availability of instance	The relative length of the instance of SQL Server runs during the current interval. If the instance is down, no data collection occurs. Thus, if this metric value is less than the length of the interval, the value of this measure reflects only the active portion of the interval. (100 * instance times) / (length of the interval)
Instance unavailable	The relative amount in which the instance of SQL Server has decreased during the current interval. If the instance is down, no data collection occurs. (100 * (Instance-length of time interval)). (Length of the interval)
IO errors	The number of i/o errors encountered by SQL Server.
Packages pending	Which instance rate packets are being received by SQL Server client applications.
Instance of Out Packets	The rate at which packets are sent SQL Server for client applications.
Packages of instance errors	The rate at which SQL Server errors of network packets.

Foglight alarm

Similar Questions

Maybe you are looking for