| View previous topic :: View next topic |
| Author |
Message |
pcooley
Joined: 07 Nov 2008 Posts: 4
|
Posted: Fri Feb 05, 2010 8:43 am Post subject: Monitoring Time Sync Probe false positives |
|
|
I have the versiera agent installed on an Ubuntu Server:
Linux lorax 2.6.31-17-server #54-Ubuntu SMP Thu Dec 10 18:06:56 UTC 2009 x86_64 GNU/Linux
I've recently recovered this server after its been off line for a month. It has a Time Sync Probe to make sure it doesn't drift too far.
My symptom is that the alert appears to go off once every 10 to 20 minutes. My investigation suggests that NTP isn't needing to actually adjust the time by 174 seconds (ever). Below are a set of email alerts I received in an hour time frame. Over a day this can amount to 50 or more emails.
My Instinct -- (of course I could be in left field)? There are three load balanced servers that each have a different time. Depending on which one I hit there is a different time on the server?
---
Versiera Service to pcooley
show details 4:43 AM (41 minutes ago)
Alert 376 changed (Host lorax [192.168.0.202])
The condition of the alert has changed to normal.
Template: Linux Server (Lorax)
Probe: Time Sync
Condition: normal
Time Sync Offset: -34 seconds
---
Versiera Service to pcooley
show details 4:53 AM (30 minutes ago)
Alert 376 changed (Host lorax [192.168.0.202])
The condition of the alert has changed to degraded.
Template: Linux Server (Lorax)
Probe: Time Sync
Condition: degraded
Time Sync Offset: -173 seconds
---
Versiera Service to pcooley
show details 5:03 AM (20 minutes ago)
Alert 376 changed (Host lorax [192.168.0.202])
The condition of the alert has changed to normal.
Template: Linux Server (Lorax)
Probe: Time Sync
Condition: normal
Time Sync Offset: -15 seconds
---
Versiera Service to pcooley
show details 5:23 AM (0 minutes ago)
Alert 376 changed (Host lorax [192.168.0.202])
The condition of the alert has changed to degraded.
Template: Linux Server (Lorax)
Probe: Time Sync
Condition: degraded
Time Sync Offset: -174 seconds |
|
| Back to top |
|
 |
versiera_admin Site Admin
Joined: 14 Mar 2006 Posts: 107
|
Posted: Fri Feb 05, 2010 2:43 pm Post subject: |
|
|
We have investigated the issue and found that there was a problem with NTP on some of the Versiera servers. The NTP issue has been corrected.
Please let us know if this has resulted in addressing the incorrect alert being sent to you.
Versiera Support |
|
| Back to top |
|
 |
pcooley
Joined: 07 Nov 2008 Posts: 4
|
Posted: Fri Feb 05, 2010 4:34 pm Post subject: |
|
|
Confirmed. This fixed it for me. All the Time Sync errors have disappeared.
As a side-effect it appears other server up/down false positives have all went away at the same time. *my other hosts were often reporting that they were going up and down*
| versiera_admin wrote: | We have investigated the issue and found that there was a problem with NTP on some of the Versiera servers. The NTP issue has been corrected.
Please let us know if this has resulted in addressing the incorrect alert being sent to you.
Versiera Support |
|
|
| Back to top |
|
 |
|