For details, see ” Points where DigitalStacks is chosen “.
In [Mail, SNMP, syslog], only mail does not require API linkage and can be set from the management screen. API transmission is required to send syslog and SNMP alerts. API integration is not required
via Integration (eg, Zabbix for SNMP, Splunk for syslog, etc.) that works with PagerDuty .
When setting up a schedule with a multi-person system (the same time zone is handled by multiple people), there are two setting methods below. ・ Users corresponding to the same time are set as different schedules. (Multiple users cannot be set in the same time zone within the same schedule.) ・ Define different schedules in the same escalation rule of the escalation policy.
When creating a new Team, you can also select an Escalation Policy, but the Escalation Policy displayed as a candidate at this time will be the one that does not belong to any Team. An Escalation Policy belongs to only one Team, but one Team can have multiple Escalation Policies. The creation order is as follows: Team creation (no Escalation Policy setting) → Escalation Policy creation. Specify Team at that time and create Escalation Policy. In that case, do not specify Team → Create Team. At that time, you can specify either Escalation Policy.
PagerDuty’s basic roles include Account Owner, Admin, User, Limited User, and Stakeholder (aka Oserver). Stakeholder Role is available only for standard or enterprise plans. If your account includes the Team Responder Role, this role will be retired in the future.
The limits of the number of users and the number of escalation policies that can be set in Team are as follows. ■ Maximum number of users in Team: 500 ■ Maximum number of escalation policies in Team: 100
In the event of an incident, the on-call responder performs an Ack operation on the PagerDuty screen. If you set the timeout value of Ack, if the confirmation is not made within the timeout period, the incident will be triggered again. The timeout value of Ack can be set from the Settings of the service. The default is set to 30 minutes.
Use the override function. Scheduling using the override feature creates an override layer on an existing schedule. The scheduled override layer is displayed below the schedule layer as the lowest layer, and the schedule set by the override function has the highest priority.
You can stay “Verified” for 1, 4, 8, 24, or other time intervals. As an incident responder, you can snooze an incident while troubleshooting a problem. If you have a problem that you can wait for the next day, you can keep it awake again.
PagerDuty allows you to specify comprehensive escalation rules. When an incident is triggered, PagerDuty will first attempt to contact the Level 1 on-call engineer to report the incident. PagerDuty will automatically escalate a warning to Level 2 engineers if the person does not respond within the user-specified escalation timeout.
PagerDuty is an alarm aggregation and dispatch service for system administrators and support teams. Collect alerts from monitoring tools, view all monitoring alarms globally, and alert personnel if there is a problem.
You should consider using PagerDuty if: You need to add the ability to send SMS and phone calls to existing monitoring tools. You need to add on-call scheduling, escalation, and incident tracking to your existing monitoring tools. You need a single place to view the overall health of your system, regardless of the number of monitoring tools you use.
There are means of push notification to phone, SMS, eMail, smartphone (iPhone / Android). In PagerDuty, it is possible to set the notification method for each person in charge.
It is an extension of PagerDuty that can be connected to Slack, HipChat and other tools as an incident management workflow.
Please refer to the URL below for PagerDuty’s pricing system. https://pagerduty.digitalstacks.net/pricelist
The assignment of the person in charge can be flexibly defined such as day and time in the escalation policy.
This is the mechanism for assigning a person in charge after an alert notification, and defines primary receiving, secondary receiving, etc.
Alerts are notifications that are entered into PagerDuty from various monitoring tools. Incidents are notifications from services that are aggregated and defined in PagerDuty.
From the PagerDuty Web Portal screen, you can reassign by Incident Unit by “Reassign”.
You can set a maintenance window for each service and exclude the period from monitoring.
It becomes “Triggered” at the time of notification, “Ackknoledged” when the person in charge responds, and “Resolved” when it is resolved.
From the PagerDuty Web Portal screen, multiple related incidents can be grouped together to streamline processing between teams and centrally manage important information.