AI for IT Operations (AIOps) on Cloud Platforms: Reviews, Opportunities and Challenges
Cheng, Qian, Sahoo, Doyen, Saha, Amrita, Yang, Wenzhuo, Liu, Chenghao, Woo, Gerald, Singh, Manpreet, Saverese, Silvio, Hoi, Steven C. H.
–arXiv.org Artificial Intelligence
Artificial Intelligence for IT operations (AIOps) aims to combine the power of AI with the big data generated by IT Operations processes, particularly in cloud infrastructures, to provide actionable insights with the primary goal of maximizing availability. There are a wide variety of problems to address, and multiple use-cases, where AI capabilities can be leveraged to enhance operational efficiency. Here we provide a review of the AIOps vision, trends challenges and opportunities, specifically focusing on the underlying AI techniques. We discuss in depth the key types of data emitted by IT Operations activities, the scale and challenges in analyzing them, and where they can be helpful. We categorize the key AIOps tasks as - incident detection, failure prediction, root cause analysis and automated actions. We discuss the problem formulation for each task, and then present a taxonomy of techniques to solve these problems. We also identify relatively under explored topics, especially those that could significantly benefit from advances in AI literature. We also provide insights into the trends in this field, and what are the key investment opportunities.
arXiv.org Artificial Intelligence
Apr-10-2023
- Country:
- Europe (0.92)
- North America > United States
- California (0.67)
- Genre:
- Overview (1.00)
- Industry:
- Technology:
- Information Technology
- Artificial Intelligence
- Cognitive Science (1.00)
- Machine Learning
- Learning Graphical Models > Directed Networks
- Bayesian Learning (0.46)
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (1.00)
- Learning Graphical Models > Directed Networks
- Natural Language (1.00)
- Representation & Reasoning > Rule-Based Reasoning (0.92)
- Cloud Computing (1.00)
- Communications > Networks (1.00)
- Data Science > Data Mining
- Anomaly Detection (0.76)
- Big Data (1.00)
- Information Management > Search (0.92)
- Artificial Intelligence
- Information Technology