Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Turn alarms on and off on a schedule #41

Open
AnthonySteele opened this issue Feb 17, 2017 · 1 comment
Open

Turn alarms on and off on a schedule #41

AnthonySteele opened this issue Feb 17, 2017 · 1 comment

Comments

@AnthonySteele
Copy link
Contributor

AnthonySteele commented Feb 17, 2017

It is frequently requested that we add the feature that the alerting group or an item in it contains schedule when the alarm is active, or stop alerting during "quiet hours".

A run would have to look at this schedule and decide to Disable or re-enable relevant Cloudwatch alarms.

Discussion of pros and cons goes here.

I'm not confident that turning alarms off is a good idea. Would prefer to make an alarm that works 24/7 (i.e. goes of for the right reasons but no false alarms). We have looked into several over-sensitive alarms and usually we identify a current, new or requested AWS feature that we need to fix them - e.g.

@AnthonySteele AnthonySteele changed the title Disable alarms on a schedule Trun alarms on and off on a schedule Feb 17, 2017
@AnthonySteele AnthonySteele changed the title Trun alarms on and off on a schedule Turn alarms on and off on a schedule Feb 17, 2017
@tomhaigh
Copy link
Contributor

tomhaigh commented Feb 28, 2017

Another option would be to send these alarms via a lambda (and a secondary sns topic). At the moment we have this:

Cloudwatch -> Alarm SNS topic -> Target (e.g. Pagerduty URL)

For alarms which don't need to be active all the time we could have this:

Cloudwatch -> New intermediate SNS topic -> Lambda -> Alarm SNS topic -> Target (e.g. Pagerduty URL)

The lambda would need to know about the alarm schedule and drop any messages during the times the alarm is scheduled to be off. At other times it would just pass it on to the usual SNS topic that we use to notify the Email/URL targets.

One way that the lambda could read the alarm schedule is that AWS watchman could publish it to an s3 bucket, and the lambda could read it from there - would just need to decide on a filename format that could be read out of the SNS message when the alarm goes off.

The benefit of this approach is that alarms do not need to be modified on a schedule, which has risk (e.g. if the alarm reactivation fails). Obviously the lambda would have other risks though,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants