AIOPS
How to get started with AIOps It can be stressful for IT Ops to manage, and AIOps is one solution coming to help IT improve system reliability and customer satisfaction while reducing some of the manual work. BY ISSAC SACOLICK, ON BEHALF OF IVANTI “Houston, we have a problem.” This is exactly what people in IT Operations think whenever a series of monitoring alerts go off simultaneously. Within five minutes, they receive the invite to the bridge call and start to read out what each monitor is reporting. The team reviews incidents raised in Cherwell, network alerts from Nagios, system alerts in LogicMonitor, log files in Splunk, and Jenkins deployments to identify potential causes and decide a course of action. Fifteen minutes into the call and business leaders join to get status and remind everyone of the expected service levels on business-critical applications. Business leaders have higher expectations on system reliability and performance, especially on customerfacing applications and critical workflows. It can be stressful for IT Ops to manage, and AIOps is one solution coming to help IT improve system reliability and customer satisfaction while reducing some of the manual work.
6
WWW.DIGITALISATIONWORLD.COM
l
ISSUE II 2021
l
COPYRIGHT DIGITALISATION WORLD
What is AIOps? Proactive IT leaders look to apply AIOps capabilities to reduce complexities, enhance employee experiences, and improve service levels.
AIOps refers to applying AI and machine learning capabilities to support IT operations. A musthave outcome of AIOps helps IT correlate multiple monitoring alerts into a single, time-sequenced incident that’s easier to review and faster to resolve. It might show that a Continuous Integration (CI)/ Continuous Delivery (CD) deployment triggered database failures followed by application errors and group them into a single incident. An incident manager seeing this sequence can quickly deduce the root cause, consult the development team on the recent changes, and determine the required steps to restore service.
AIOps in incident management and processing data from multiple monitoring tools is one use case of platform intelligence. Applying AI and machine