Incident Handling: Difference between revisions

From Delft Solutions
Jump to navigation Jump to search
(Created page with "* Critical incidents are resolved within 16 hours. * Non-critical incidents are diagnosed within 9 hours and resolved within one week.")
 
No edit summary
Line 1: Line 1:
== Critical incidents ==
* Critical incidents are resolved within 16 hours.
* Critical incidents are resolved within 16 hours.
* Non-critical incidents are diagnosed within 9 hours and resolved within one week.
 
Checklist
# Acknowledge on Zabbix and state who is responsible for resolving this in the description
# Determine affected clients
# Communicate to affected clients that the issue is being investigated
# Communicate plan/next steps (even if that is gathering information)
# Communicate findings/results of executed plan, go back to previous step if not resolved
# Resolve incident
 
== Non-Critical incidents ==
* Non-critical incidents are acknowledged within 9 hours and resolved within one week.
 
Checklist
# Acknowledge on Zabbix and state who is responsible for resolving this in the description
# Communicate plan/next steps (even if that is gathering information)
# Communicate findings/results of executed plan, go back to previous step if not resolved
# Resolve incident
 
== Informational incidents ==
* Informational incidents are acknowledged within 72 hours
 
Checklist
# Acknowledge on Zabbix
# Sanity check the event, post result in thread

Revision as of 06:12, 4 August 2023

Critical incidents

  • Critical incidents are resolved within 16 hours.

Checklist

  1. Acknowledge on Zabbix and state who is responsible for resolving this in the description
  2. Determine affected clients
  3. Communicate to affected clients that the issue is being investigated
  4. Communicate plan/next steps (even if that is gathering information)
  5. Communicate findings/results of executed plan, go back to previous step if not resolved
  6. Resolve incident

Non-Critical incidents

  • Non-critical incidents are acknowledged within 9 hours and resolved within one week.

Checklist

  1. Acknowledge on Zabbix and state who is responsible for resolving this in the description
  2. Communicate plan/next steps (even if that is gathering information)
  3. Communicate findings/results of executed plan, go back to previous step if not resolved
  4. Resolve incident

Informational incidents

  • Informational incidents are acknowledged within 72 hours

Checklist

  1. Acknowledge on Zabbix
  2. Sanity check the event, post result in thread