Foglight alarm
Hello
Please advice on these alarms that we receive "service is unavailable" agent netmonitor
that more often than not is not true. When you browse our alarms, that's what we find
These screenshots show the alarms generated by the agent of LogFilter, which monitors the server.log to find errors.
This is different from a message like "service is unavailable" to an alarm of agent NetMonitor, that ping one set of devices and you alerts if a target device is not available, or if there is packet loss on the network.
Kind regards
Brian Wheeldon
Tags: Dell Tech
Similar Questions
-
Foglight alarms automatically erase themselves?
Hello
I'm new to Foglight and I have a general question.
Foglight alarms disappear automatically themselves once a problem is solved?
For example, if I get an alarm saying that the memory usage is high on a server and I'm going in and kill a process on this server to lower it, will get deleted automatically alarm or do I still go and delete it?
Please let me know.
Thank you
Tony
Multi-gravite rules generate alerts that will be automatically clear if the State goes away and the State of gravity goes back to normal. Also, the severities change automatically if an alarm has reached another gravity as an alarm becoming a fatal alarm.
Simple rules such as the rules for the LogFilter agent will not automatically clear.
-
All,
Any idea on how I can determine how the SQL cartidge pulls this inforamtion? Network management team, State of DBA and Windows Guy there is no problem. Are there any SQL query or perfmon I should look?
Foglight cartridge for SQL Server generated an alarm for example on the SRWSQL1 host. POSNETPROD.COM.
Alarm:
DBSS - SQL Packages error rate (Group message: network)
Severity:
Warning
Message:
The instance of SQL Server is 1,00 errors in packets per second.
Created the:
Kill Mar 27 22:12:13 UTC 2012
The metric is a Global Variable, SQL Server, and can be captured in the database instance by running the following command in SQL Server:
SELECT @PACKET_ERRORS
The returned value is cumulative since the last SQL Server restart, so Foglight performs a delta to indicate errors in packets since the previous collection. Here's an MSDN article that explains the variable.
http://msdn.Microsoft.com/en-us/library/ms190343.aspx
Data are collected by the collection of Global Variables in SQL Server that is integrated into the SQL Server cartridge.
SQL Server Global Variables
This collection provides SQL Server General settings, such as i/o, networking and database operating time.
Type
SQL Server.
Collection sampling intervals
Frequency Mode
Interval (in seconds) collection
In real time
20
Online
60
In offline mode
300
Parameters in the collection
Display name
Description
Availability of instance
The relative length of the instance of SQL Server runs during the current interval. If the instance is down, no data collection occurs. Thus, if this metric value is less than the length of the interval, the value of this measure reflects only the active portion of the interval.
(100 * instance times) / (length of the interval)
Instance unavailable
The relative amount in which the instance of SQL Server has decreased during the current interval. If the instance is down, no data collection occurs.
(100 * (Instance-length of time interval)).
(Length of the interval)
IO errors
The number of i/o errors encountered by SQL Server.
Packages pending
Which instance rate packets are being received by SQL Server client applications.
Instance of Out Packets
The rate at which packets are sent SQL Server for client applications.
Packages of instance errors
The rate at which SQL Server errors of network packets.
-
Foglight - alarms have http instead of https
All,
Alarms have a hyper link in the body which is http. Our FMS works well on https. Is there a setting I'm missing somewhere?
Thank you
-Daniel
Daniel
Locate the CATALYST_URL registry variable and change it to be the correct link for example a https port number
David Mendoza
Foglight Consultant
-
In a tool such as Foglight events about APM-driven alerts
Hi all
Our company is currently having an internal debate whether Foglight can replace IBM Netcool Omnibus. I had a conference call with Gartner last week to discuss the surveillance as a whole and one gentlemen on the phone said that "Foglight can replace Omnibus' which has left me a bit confused because I know that we have had these discussions with Dell, who said that they are not a replacement for Netcool Omnibus. Unfortunately, we ran out of time and I couldn't get their probe on this statement.
We currently use the functionality of Netcool Omnibus in 5 ways that are:
Event command line injection
There is a binary called postzmsg which is drawn from the command line with parameters that contain the severity, summary, on call group and so on.
Syslog monitoring
Omnibus has a probe called syslog that we use for monitoring Oracle database log files and trigger events in Omnibus if specific events are found in the newspaper. This is done in real time.SNMP traps
Omnibus currently listening to breaks SNMP of some of our products like CA Autosys and cypress trees of the ASG.E-mail
Omnibus was an electronic probe that monitors a specific e-mail (Google) for specifically formatted e-mail account and then turns them into an Omnibus event. We use usually this method as a last resort because it is dependent on the e-mail.
Automation of self-correction
Omnibus contains what is called "Actions". We can search for specific events, and then take corrective action with Automation in Omnibus to fix what caused the event. An example of this would be with our product CA Autosys. Omnibus search specifically for events 'pulse' every 5 minutes. These heartbeats are sent directly by a job in Autosys which in turn proves Autosys is running. If Omnibus does not see the heart beat, she considers as inactive Autosys and run scripts which then try to restart Autosys save. Omnibus then reports if this attempt succeeded or failed and worsens the severity according to its conclusions.Please keep in mind that all of these methods are with an event-driven approach rather than a performance-based approach.
I really need to have a clear picture on whether and how the Foglight can cope with the event focused on issues.
I asked Dell technical support who said the follow-up:
You may be able to use a script agent to meet your needs. The script agent could return data to the FMS and you could have a rule that fires based on the data returned.You would be responsible to creating the script used by the agent. There is a "Builiding Script Agents" section under the "Customizing Your Environment with Tooling | Building Script Agents" in the Foglight Administration and Configuration guide.
Has anyone had experience with this? What was the result? Do you use today? Advantages vs disadvantages?
If anyone has the source code example that it would be really appreciated! I am also very interested in hearing other thoughts which have Foglight and another tool such as Omnibus or other tools and how they use with Foglight or if they withdrew these tools after having long Foglight.
Thanks to you all! Looking forward to your answers!
Larry Roberts
Larry,
Foglight can be configured to monitor the applications where an out of the box cartridge (module) was not developed and delivered with the product. There are several ways to customize Foglight.
Event command line injection
Looks like you want to send alarms foglight via a command line program. It is feasible because there are available with Foglight APIs to create alarms.
Syslog monitoring
Foglight monitoring logs with the logfilter agent. You can set the strings that need to be searched (when a log file is updated by your applications), then specify the corresponding string (based on regular expressions) is a warning, critcal or a fatal alert.
For monitoring of log files from the database, which is available with the cartridges of the database (DB2, Oracle, SQL Server, MySQL, Sybase) delivered with Foglight.
SNMP traps
Foglight can receive (out of the box) SNMP traps and convert them to Foglight alarms.
E-mail
I study further on this request. Basically, it takes an e-mail player that puts the emails in a log file that will read the Foglight LogFilter agent and search for error conditions.
Automation of self-correction Foglight rules have actions that can be performed in the cases where the rule detects specific conditions as the processor high as a basic example and then run the scripts commands, scripts remotely, send SNMP traps, invoke a JavaEE trace, etc. A custom agent could be developed to run your heartbeat AutoSys and then search for an answer to a file or stdout and update a Foglight table that is being tested in a rule.
David Mendoza
Foglight Consultant
-
Foglight and integration Sitescope
Have anyone tried incorporating foglight sitescope?
I would do it I push foglight alarm for use in sitescope. Is there any documentation for it?
Thank you
Research online it looks like Sitescope has a trap SNMP monitor, you are trying to send SNMP traps from Foglight
The article talks about general SNMP forward, but I hope it helps.
Golan
-
How interrogate foglight alarm_alarm and connect guests with her
All,
I know that this question has been asked in different ways. I'm looking to directly query the database Foglight I can do an analysis through 4 other systems. We are looking to integrate different sources into a data warehouse for later analysis in a real-time database close. Foglight is able to give us a lot of points, but not all, and that's why we need query directly on the database. Ultimately we are only looking for alarms, created time and the host just now. Help or direction would be really appreciated.
Kaleb
Hi, Kaleb,.
To answer your other point, Foglight alarms are associated with various items in different monitored domain models that may or may not be a host or associated with a host.
For example, an alarm associated with a 'cluster' has no associated host.
In an event focused on the rule, the object that triggered the alarm is the source of the event:
sourceID = @event.get ('topologyObjectID')
def source = server. TopologyService.getObject (sourceID)
Once you have the source, you need to look through a variety of possible paths to find the associated host.
This trial and error process is implemented in the following script, that works for the majority of the alarms:
def tryPath (source, path) {}
result = null
try {}
result = source.get (path)
}
catch (Exception e) {}
Returns the result
}
def getHostName (objectID) {}
def hostname = null
def source = null
try {}
source = server. TopologyService.getObject (objectID)
}
catch (Exception e) {}
e return
}
If (source! = null) {}
host name = tryPath (source, ' monitoredHost/name "" ")
If (hostname == null) {}
host name = tryPath (source, "the name of the controller/monitoredHost /")
}
If (hostname == null) {}
name of host = tryPath (source, 'parent, agent, monitoredHost, name')
}
If (hostname == null) {}
host name = tryPath (source, ' / hostname ")
}
If (hostname == null) {}
host name = tryPath (source, ' controller / / hostname ")
}
If (hostname == null) {}
name of host = tryPath (source, "agent/hostName")
}
If (hostname == null) {}
parents-tryPath (source, 'parents')
If the host name (parents.size () > 0) = tryPath (parents [0], "/ hostname")
}
If (hostname == null) {}
parents-tryPath (source, ' controller/parents')
If the host name (parents.size () > 0) = tryPath (parents [0], "/ hostname")
}
}
Returns the host name
}
sourceID = def @event.get ('topologyObjectID')
def hostName = getHostName (sourceID)
Returns the host name
Kind regards
Brian Wheeldon
Published by Brian Wheeldon to include additional paths to the hosts required by a few rules.
-
Clearly the alarms console outside FMS
Hello
I heard of a way to clean up an alarm outside the FMS console.
Therefore, you should put a document in xml format in a folder on the fms server.
has someone at - he tested this or could explain the folder in which the XML should be placed and what must be inside the xml more detailed?
I already have the cartridge of integration but the help pdf confuses me:)
Thank you very much
VECD
Hello vero,.
I don't know which section including documentation confuses you. Maybe you could post details?
It is easy to draw a Foglight alarm from the command line using the AlarmService API.
Here's a groovy script to clear an alarm:
clearAlarm.groovy def alarmID = args [1]
If (server. AlarmService.getAlarm (alarmID) == null) return "error: alarm"+ alarmID + "not found!". "
Server. AlarmService.clearAlarm (alarmID)
return "Off alarm."
Here is an example of a command line to run this script (Windows and Unix):
> %FGLHOME%\bin\fglcmd.bat - srv fmshost - fglusr usr - pwd fgluserpw1 - cmd script: run f clearAlarm.groovy 776d121b-e18f-487c-a0cb-b633614a88f2
Off alarm.
> $FGLHOME/bin/fglcmd.sh - srv fmshost - fglusr usr - pwd fgluserpw1 - cmd script: run f clearAlarm.groovy 4f905ffb-ba7c-4809-8940-99ad3468a3e7
Off alarm.
The "alarmID' passed as an argument to the script is the identifier of the alarm returned by alarm.getID ().
Kind regards
Brian Wheeldon
-
Query to find a required rule in the database
Hi all
I get a few empty alerts a unknown rule in my console and it's really hard to check each any every rule as much of his time. Could someone help me if there is no work around where we can write a query in the database and look for the rulename, or something like that.
Please help me if possible...
Kind regards
Shiva G
Hey Shiva
In your original post, you mentioned that the alarm appears in your console. You hear your dashboard Foglight alarms? If so, how the alarm appears in the dashboard?
Brian
-
Hi all
on 5.6.4 we receive foglight alarm:
ORACLE: Wait for Redo: criticism
The instance has spent the 29, 30 sec of its activity on again wait, which is significantly higher than the observed typical behavior for this period (
until 24,69 sec).
I have a few questions:
-J' looked at Oracle alertlog and lgwr trace for alarm period but file I saw nothing of Oracle in them. How/where to check if Oracle warned on this subject?
-Is a problem writing in newspapers of recovery or a problem of redo buffer?
Thanks for the help.
I do not think that oracle could write anything in the alert for writing redo log is slow. Since it is a deviation from normal alarm, you can watch the activity at this time and also waiting for disk queuing (response of the disk write time) during this period. It's maybe just a case where you have more activity than usual update/insert causing higher than usual waiting for recovery. Or you can have disk contention where residence your recovery logs.
Jeff
-
How to plan to clear all alarms in Foglight on a specific date and time?
Here's what I want to accomplish informing a group of people that all alarms will be erased and then foglight removes all the weapons on a specific date and time
..
1
I'd like to be able to plan / create a rule or any other option available to use... to be able to send an email through foglight or notification to a specific group of people telling them that all messages will be deleted.
2nd
What I'm looking for must be able to clear all alarms in Foglight to a date and a specific time.
I just noticed this has not had a response.
I hope that you could see the information contained in the KB, some items may have been written there so I recommend for anyone who wants to use it to test on a test server or check with the support, if there are recent scripts.
How to purge the alarms? How to program the purge of alarms? (40651)
Clear the alarms then X days (74729)
How to clear alarms command line? How to program the clearing of alarms? (51324)
Is there a way to clean only certain rules specific alarms? (73003)
How to accuse reception/Clear old messages older then X number of days automatically (72146)
Regarding the sending of notifications on a schedule, I think the simplest if to set that plan and create a rule with a true value, which is only applicable to this annex (annex rule or rule that is valid on a precise timetable) and has the action to send the notification to users on the alarms about to be cleared/purged.
Hope this helps
Golan
-
Configure alarms for VMware Datastore latency in version FREE Foglight!
Hi all
I downloaded the version free foglight and that I imported the FVO in my test harness. I want that when a data store has over 50ms latency so I should launch an alert.
I can't see the alarms on the console free foglight.
Can you please how can I do to achieve the same thing.
Thank you
Vaibhav
Hi vaibhav,
The free version does not include a rule to check latency of data store. I checked with development and looks like a new rule has been added to the Foglight for virtualization Enterprise Edition 7.2 (to be released in the following months). The new rule is called: VMW Datastore total latency.
For the moment, in my view, there is a latency of data store rule included in one of the established community of cartridges. These cartridges are not created by Dell and are not supported. Take a look at it and see if it meets your request.
http://communities.quest.com/docs/doc-12956#comment-6001
Note: If your Foglight does not have the ability to install cartridges, chances are that you are running the Standard Edition version. If you have problems, please indicate the full version number. You should find it in the topic of the article.
Concerning
Gaston.
-
Foglight for SQL server back alarm
How foglight knows if a backup succeeded or failed wicker basket SQL server? Best I can tell backups are good, but all the sudden foglight is alarming every day on this.
There is a metric/property for days since the last backup. It is a component of SQL Server that we look at and then alarm based on what SQL Server is the last backup date/number of days.
-
All,
How to add or change the suggestion given in the alarms diaglog box? Some I see are not useful, and I would like to add a bit of some alarms.
Thank you!
When you change a Foglight rule, you can change the text of the alarm for all gravity. Is that what you had in mind? If I am offbase thanks for posting a screencap and I will advise you further.
Thank you
Robert
-
Foglight for VMware - what alarm intercepts a low disk space?
All,
According to me, Miss me something here. Low disk space problems catches which alarms for virtual machines?
Thank you
-Daniel
There are
http://eDOCS.quest.com/Foglight/56/doc/cartridge/vFoglightCartridge/reference.59.3.php#582597
Logical VMW Virtual Machine time lead estimated fill
VMW Machine virtual logical drive use.
Golan
Maybe you are looking for
-
The download button never displays on my screen or the taskbar. How do I set it so it doesn't? I couldn't find the answer to this on my research...
-
Libretto U100 - best Alternative for lack of Caps Lock Led
If you use Windows XP, search the Internet for the file keystate.zip. It creates an icon in the systray with simulated led which goes red when the caps lock is on and black when the caps lock are out of service. There is a Toshiba program even better
-
Novatel Ovation MC760 Micro Modem USB wireless
a pilot is missing. where can I find out what driver is missing and how can I install it?
-
I'm running BACK Mame and he prefers on Mame32. However, I can't get the sound to work right. Whenever I start a game, I get this message: Select the audio device:0. the silence1 sound Blaster3 pro Audio Spectrum4 ultrasound Max (CS4231 Codec)5 ultra
-
RAM: quantity and speed for Compaq Presario CQ60-119tu - I can't understand it!
Hi, I'll have trouble determining how much & how fast the RAM is that I can install in a "new to me" old computer that I just inherited. The manual is not very clear, it seems to say 3 GB or 4 GB, but I don't know that (Page 1-3 of the user's guide h