Incident Handling: Difference between revisions

Jump to navigation Jump to search
no edit summary
No edit summary
No edit summary
Line 16: Line 16:
# If unresolved, create new plan
# If unresolved, create new plan
# When resolved:
# When resolved:
** Verify trigger is no longer firing
## Verify trigger is no longer firing
** Mark Zulip topic as resolved if no other incidents for host
## Mark Zulip topic as resolved if no other incidents for host
** Check for related triggers and resolve them
## Check for related triggers and resolve them


==== Common Issues ====
==== Common issues ====
* SSH down: Check MaxStartups throttling, apply custom SSH config
* SSH down: Check MaxStartups throttling, apply custom SSH config
* No backup: Verify backup process is running, check devteam email
* No backup: Verify backup process is running, check devteam email
Line 31: Line 31:
# Check metrics sheet for existing milestone
# Check metrics sheet for existing milestone
# If milestone exists:
# If milestone exists:
** Add Lynx project ID to Zulip topic
## Add Lynx project ID to Zulip topic
** Add 🔁 emoji if ID already reported
## Add 🔁 emoji if ID already reported
# If no milestone:
# If no milestone:
** Add to metrics sheet
## Add to metrics sheet
** Create Lynx project (priority 99, then 20 after estimation)
## Create Lynx project (priority 99, then 20 after estimation)
** Create Kimai activity
## Create Kimai activity
** Document IDs in Zulip topic
## Document IDs in Zulip topic


=== Informational Incidents (72hr acknowledge) ===
=== Informational Incidents ===
Informational incidents must be acknowledged within 72 hours.


# Acknowledge in Zabbix
# Acknowledge in Zabbix
116

edits

Navigation menu