Versiera Support Forum Index Versiera Support
NetCraft Communications Inc.
 
 FAQFAQ   SearchSearch   UsergroupsUsergroups 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 

Monitoring Time Sync Probe false positives

 
Post new topic   Reply to topic    Versiera Support Forum Index -> Bugs or Issues
View previous topic :: View next topic  
Author Message
pcooley



Joined: 07 Nov 2008
Posts: 4

PostPosted: Fri Feb 05, 2010 8:43 am    Post subject: Monitoring Time Sync Probe false positives Reply with quote

I have the versiera agent installed on an Ubuntu Server:
Linux lorax 2.6.31-17-server #54-Ubuntu SMP Thu Dec 10 18:06:56 UTC 2009 x86_64 GNU/Linux

I've recently recovered this server after its been off line for a month. It has a Time Sync Probe to make sure it doesn't drift too far.


My symptom is that the alert appears to go off once every 10 to 20 minutes. My investigation suggests that NTP isn't needing to actually adjust the time by 174 seconds (ever). Below are a set of email alerts I received in an hour time frame. Over a day this can amount to 50 or more emails.

My Instinct -- (of course I could be in left field)? There are three load balanced servers that each have a different time. Depending on which one I hit there is a different time on the server?

---

Versiera Service to pcooley
show details 4:43 AM (41 minutes ago)
Alert 376 changed (Host lorax [192.168.0.202])

The condition of the alert has changed to normal.

Template: Linux Server (Lorax)
Probe: Time Sync
Condition: normal
Time Sync Offset: -34 seconds
---
Versiera Service to pcooley
show details 4:53 AM (30 minutes ago)
Alert 376 changed (Host lorax [192.168.0.202])

The condition of the alert has changed to degraded.

Template: Linux Server (Lorax)
Probe: Time Sync
Condition: degraded
Time Sync Offset: -173 seconds
---

Versiera Service to pcooley
show details 5:03 AM (20 minutes ago)
Alert 376 changed (Host lorax [192.168.0.202])

The condition of the alert has changed to normal.

Template: Linux Server (Lorax)
Probe: Time Sync
Condition: normal
Time Sync Offset: -15 seconds

---

Versiera Service to pcooley
show details 5:23 AM (0 minutes ago)
Alert 376 changed (Host lorax [192.168.0.202])

The condition of the alert has changed to degraded.

Template: Linux Server (Lorax)
Probe: Time Sync
Condition: degraded
Time Sync Offset: -174 seconds
Back to top
View user's profile Send private message
versiera_admin
Site Admin


Joined: 14 Mar 2006
Posts: 107

PostPosted: Fri Feb 05, 2010 2:43 pm    Post subject: Reply with quote

We have investigated the issue and found that there was a problem with NTP on some of the Versiera servers. The NTP issue has been corrected.

Please let us know if this has resulted in addressing the incorrect alert being sent to you.

Versiera Support
Back to top
View user's profile Send private message Send e-mail
pcooley



Joined: 07 Nov 2008
Posts: 4

PostPosted: Fri Feb 05, 2010 4:34 pm    Post subject: Reply with quote

Confirmed. This fixed it for me. All the Time Sync errors have disappeared.

As a side-effect it appears other server up/down false positives have all went away at the same time. *my other hosts were often reporting that they were going up and down*


versiera_admin wrote:
We have investigated the issue and found that there was a problem with NTP on some of the Versiera servers. The NTP issue has been corrected.

Please let us know if this has resulted in addressing the incorrect alert being sent to you.

Versiera Support
Back to top
View user's profile Send private message
Display posts from previous:   
Post new topic   Reply to topic    Versiera Support Forum Index -> Bugs or Issues All times are GMT - 5 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group