When we went looking for an alerting solution, we wanted something that would make life easier for our folks on-call, as well as something that would help us clean up and make sense of the nigh-impassable flood of alert emails we were seeing constantly from our monitoring. Allowing folks to use their own devices also meant that there was no confusion as to how to reach out to someone, and it's something they always have on them. We were immediately captured by the Twitter-like timeline, so that every person (on-call or not) could view the incoming stream from the entire environment, and the rest of the features common in this space seemed very well-implemented. Since then, they've added tons of new features (the Transmogrifier and Control Calls each were game-changers) that have only made the service better. My director has identified this as the single best product we've implemented in the past two years, based on its impact to our team's efficiency and happiness.
- Communication-centric design - great for team efforts - Very responsive support and dev team - Easy to set up, easy to use - Lots of integrations available - Contact details in the app makes reaching out to teammates easier in a crisis
- Some advanced/niche scheduling options are not currently possible - Data on old alerts can be difficult or impossible to obtain, depending on age - Reporting is a work in progress
It's wonderful affirmation for us when we hear that VictorOps has been a positive driver in team efficiency and happiness! We're also continuing to advance our development efforts (our engineering team is also growing!), so keep an eye out for enhancements on reporting and scheduling!
Splunk's purchase of VictorOps gives the opportunity for major improvement and excellent customer support.
Provides all of the necessary functionality for managing multiple teams of on-call professionals across an organization. App and Slack integration provide a feature-rich experience. override existing schedules, acknowledge or snooze alerts, see detailed history, and run reports.
Some features are buried in menus, and the calendar is a little hard to use at times. Some options require clicking a specific toggle icon instead of just the text.
Likelihood to recommend: 10/10
This is a really good tool to integrate into your tech stack if you need an alert filter, which you probably do. The down side is that it's very basic and routing alerts takes a degree of skill deploying regex or matching conditions. The cost point is very amicable for a decent filtering tool, but the tool itself has a lot of room for polish which could be a detriment to less technical departments.
The software does a good job of the basics. It seemed easy enough to give out to the whole team and we seem to get the needed functionality out of it. The reports are straightforward and easy to identify the problem areas. Integrations are via API so any noise generator that can make an HTTP POST in theory could be integrated.
5,000 total alerts per month maximum which may sound like a lot, but a growing enterprise company may find that total to be limiting. The UI is very basic, possibly by design or my limited permissions but in a lot of ways it feels hollow and lacks any real character if you have to use it daily. The mobile application works well, and again keep things organized, but underwhelms visually and doesn't captivate the user in any way.
Likelihood to recommend: 5/10
Prior to Bluestem Technologies migration to VictorOps, only a few of our squads were using Pagerduty for on-call incident management. Our major goals and justifications for switching to VictorOps was onboarding our entire technology team onto one on-call solution as well increasing our major incident responsiveness. VictorOps was chosen over PagerDuty for it’s people-centric focus, feature capabilities and cost.
The transmogrifier has been an extremely useful tool for many of our teams. We've added playbook annotations to most of our alerts which has been a boon for our 24 by 7 support team. The Datadog integration has also shown VictorOps's savviness and people-centric approach to building a product. A Datadog VictorOps incident will send all of the pertinent information and graphs to an on-call users phone. The VictorOps mobile app can also be used as a platform to seamlessly chat with other users. We've also been encouraged by some of VictorOp's new features they have started to roll out. Their Statuspage.io integration has made many in our organization excited about the ability to have a one-stop-shop dashboard to communicate down-time incidents to a diverse group of users within our company. Their brand-new Control Calling feature (a feature that creates a conference call and pages users to join, penned "the new bat signal" by VictorOps) is also an intriguing feature that several of our teams have started to explore. Possibly our favorite aspect of working with VictorOps over PagerDuty is how responsive their team has been to both our support and development requests.
VictorOps is still lacking some of PagerDuty's clerical and reporting functionality; for example, we no longer have the ability to easily reference a specific incident in the past.
Thanks for such a positive review! We see people make the switch from PagerDuty on a regular basis, not just because of our feature set, but because we also put our users first. It's great that you noticed that too. It's also good to note that if you capture each incident as a post-mortem report, you'll always be able to go back and reference it. However if you need a more specific way to reference past incidents, I would definitely suggest you drop our support team a line. They are constantly looking for ways to improve the product, and would love to hear your ideas.
Using VictorOps is like adding another member to your team. By removing the cumbersome problem of over-active alerts, which can cause some serious team fatigue which we experienced prior to using victor ops, you have the ability to add a number of important options relative to the alerts you have created. With the Transmogrifier you can filter through irrelevant alerts with great ease, delivering only the critical alerts to your team. This is of course a process that involves tuning and can take some time. We use Nagios as one of our monitoring tools and the integration with victor ops was seamless. Allowing us to quickly adapt to the ever changing environment that makes up our infrastructure.
I am really looking forward to our increased use of the Transmogrifier and its ability to attach documentation to alerts on the fly. I can hardly wait until the next alert that rolls in, including a specific and detailed path to solving the problem that is currently alerting. With the Transmogrifier, this feature is only a few clicks away and I am speaking to our team on this on a regular basis in order to improve the team’s efficiency bringing newer folks up to speed faster. The mobile app interface is outstanding and is constantly improved based upon user feedback.
The only Con that I have discovered is the time it takes to weed through the alerts that you setup. The robustness of what alerts can be programmed in to be registered is an ongoing process as new systems get added.
Wow - "I can hardly wait until the next alert rolls in." Thankfully, this means we're doing exactly what we set out to do: making our users' lives better. Thanks for such a wonderful review!
Technology-enabled organizations can’t afford to wait until standard business hours to discover and address vital incidents. They need resolutions right away. VictorOps is a DevOps alerting, routing, and real-time incident management solution that decreases the time it takes resolve problems. The platform brings together IT, development, engineering, and systems administration departments with intelligent tools for efficiently resolving incidents from initial alert through to post-mortem analysis.
Designed to empower teams with the situational awareness to address incidents quickly and with complete confidence, VictorOps filters the “noise” of alerts to improve the lives of on-call technicians. The app provides support for both agile and DevOps development processes, allowing companies of all sizes to keep their focus on continuous innovation.
IT and DevOps teams are under more pressure now than ever before, as they play a key role in enabling companies to deliver on agile’s goal of providing software to customers. VictorOps is a real-time incident management platform that provides these teams with notifications when incidents occur. Most importantly, it gives teams the information and tools they need to resolve these problems in an efficient manner.
In addition to sending smarter alerts, VictorOps serves as a virtual environment where teams can prepare for, react to, and recover from technical incidents. Organizations benefit when teams are able to rapidly mobilize, as this allows for accelerated resolutions and decreased downtime. VictorOps improves the lives of on-call technicians by intelligently routing alerts to the right people, at the right time; thereby filtering the noise of unnecessary alarms. Streamlined communication, along with tools for collaboration, integration, automation, and measurement between software developers and operations teams, comes together within an elegant interface.
VictorOps offers intelligent, rule-based alert delivery and escalation. When an alert is triggered, VictorOps uses the rules that an organization has created to determine what happens next. By default, critical alerts are turned into incidents, which starts off the unique “escalation policy” defined by the individual organization.
Depending on the rules you have set up, specific incidents may or may not require actual human intervention from someone in your organization. You may opt to have incidents created as “warnings,” which is a lesser alert. You can also leave certain rules “disabled” if you want to receive alerts for staging using a simple cURL command.
Rich alerts are annotated alerts that have been setup to include links to specific technical documents that could be needed to resolve the challenge at hand. Using the Alert Rules feature, VictorOps users can elect to annotate alerts containing certain keywords with specific documentation, URL, note, or image link.
Use VictorOps’ Transmogrifier tool to find links to helpful documentation, graphs, and images in the context of your alerts. When an on-call team member responds to an alert, he or she will immediately find the resources needed to fix the problem at hand. This saves time and frustration during stressful periods.
The VictorOps platform offers a number of ways for users to chat and communicate efficiently during periods of crisis. For example, @Mentions can be used to contact specific team members directly via push notifications. All communication that takes place in this manner is recorded and logged to the main timeline, which means it is visible to other users in the organization and it can be used for reporting.
VictorOps also gives you a way to participate in private chats. Private chats are not recorded or logged on the team timeline, and they cannot be used for reporting. Rather, they consist of one-on-one communication between users. You can send a private message to a colleague using the text bar at the bottom of the user view. No @-symbols are required when you send a private message.
Post-mortem (retrospective) reports are the single best way to learn from incidents. The Post-Mortem Report automates the process of retrospection to determine what, exactly, led to the resolution of an issue. Teams can effectively take snapshots of individual incidents, which is useful in the event that a similar situation occurs in the future.
Because all activities are tied to specific incidents—including all alerts, notifications, chats, and paging steps—you’re better able to learn from the past when you use VictorOps. You also have the ability to edit out pieces of the timeline that aren’t relevant or annotate the timeline with interesting points from notes that weren’t otherwise included. VictorOps’ reporting feature makes it easier to generate uptime reports, as well.
Out-of-the-box integrations are available for a continually growing list of 60+ applications, including integrations specific to the needs of the enterprise. The VictorOps API is also available for custom integrations.
VictorOps offers both Standard and Enterprise packages. Standard packages range from $29 to $34 per user, per month, billed annually. Enterprise packages range from $49 to $59 per user, per month, billed annually.
Discounts are available for education providers. Custom add-ons are also available.