Incident Handling: Difference between revisions

Incident Handling (view source)

Revision as of 05:52, 17 October 2025

2,348 bytes removed , 17 October 2025

→‎Handover

Jakobbuis

118

edits

@@ Line 196: / Line 196: @@
 == Handover ==
-When handing over the responsibility of '''first responder''' (FR), the following needs to happen:
+See [[Handover]]
-* The handover can be initiated by both the upcoming FR or the acting FR
-* Acting FR adds the upcoming FR to the IPA sla-first-responder user group and enables Zabbix calling for the upcoming FR if they have that set by going to Zabbix > Configuration > Actions > [https://status.delftinfra.net/zabbix/actionconf.php?eventsource=0# Trigger actions]
-* Before the handover, the acting FR must ensure that all active incidents are acknowledged (this includes alert emails or opened topics in SRE General, etc...), updated with the latest status, and properly documented.
-* The upcoming FR makes sure they are aware of the state of the SLA and knows what questions they wants to ask the acting FR.
-* The upcoming FR makes sure they are subscribed to the right channels.
-The following steps can be done async or in person:
-* The acting FR announces/informs the upcoming FR has been added to the sla-first-responder group (In Zulip's [https://chat.dsinternal.net/#narrow/stream/13-Organisational Organisational channel] if asynq).
-* If the acting FR wants to hand over responsibility for any ongoing incident they also state which incidents they want the upcoming FR to take over.
-* If there are any particularities the upcoming FR needs to be aware of, those are shared.
-* The upcoming FR asks their questions until they are satisfied and able to take over the FR
-* The upcoming FR ensures they are subscribed to the following channels on Zulip: [https://chat.dsinternal.net/#narrow/stream/23-SRE---General SRE - General], [https://chat.dsinternal.net/#narrow/stream/24-SRE-.23-Critical SRE # Critical] and if part of the SRE team [https://chat.dsinternal.net/#streams/4/SRE%20##%20Non-critical SRE ## Non-Critical] and [https://chat.dsinternal.net/#streams/5/SRE%20###%20Informational SRE ### Informational].
-* The upcoming FR announces/informs that they are now the acting FR over Zulip's [https://chat.dsinternal.net/#narrow/stream/13-Organisational Organisational channel]
-* The now acting FR removes the previous FR from IPA the sla-first-responder user group and disables Zabbix calling for the previous FR if they had that enabled by going to Zabbix > Configuration > Actions > [https://status.delftinfra.net/zabbix/actionconf.php?eventsource=0# Trigger actions]

Incident Handling: Difference between revisions

Incident Handling (view source)

Revision as of 05:52, 17 October 2025

Navigation menu

Search