devops
CoE-Ops: Collaboration of LLM-based Experts for AIOps Question-Answering
Zhao, Jinkun, Wang, Yuanshuai, Zhang, Xingjian, Chen, Ruibo, Liao, Xingchuang, Wang, Junle, Huang, Lei, Zhang, Kui, Wu, Wenjun
With the rapid evolution of artificial intelligence, AIOps has emerged as a prominent paradigm in DevOps. Lots of work has been proposed to improve the performance of different AIOps phases. However, constrained by domain-specific knowledge, a single model can only handle the operation requirement of a specific task,such as log parser,root cause analysis. Meanwhile, combining multiple models can achieve more efficient results, which have been proved in both previous ensemble learning and the recent LLM training domain. Inspired by these works,to address the similar challenges in AIOPS, this paper first proposes a collaboration-of-expert framework(CoE-Ops) incorporating a general-purpose large language model task classifier. A retrieval-augmented generation mechanism is introduced to improve the framework's capability in handling both Question-Answering tasks with high-level(Code,build,Test,etc.) and low-level(fault analysis,anomaly detection,etc.). Finally, the proposed method is implemented in the AIOps domain, and extensive experiments are conducted on the DevOps-EVAL dataset. Experimental results demonstrate that CoE-Ops achieves a 72% improvement in routing accuracy for high-level AIOps tasks compared to existing CoE methods, delivers up to 8% accuracy enhancement over single AIOps models in DevOps problem resolution, and outperforms larger-scale Mixture-of-Experts (MoE) models by up to 14% in accuracy.
- North America > United States > Massachusetts (0.04)
- Asia > China > Beijing > Beijing (0.04)
The DevSafeOps Dilemma: A Systematic Literature Review on Rapidity in Safe Autonomous Driving Development and Operation
Nouri, Ali, Cabrero-Daniel, Beatriz, Törner, Fredrik, Berger, Christian
Developing autonomous driving (AD) systems is challenging due to the complexity of the systems and the need to assure their safe and reliable operation. The widely adopted approach of DevOps seems promising to support the continuous technological progress in AI and the demand for fast reaction to incidents, which necessitate continuous development, deployment, and monitoring. We present a systematic literature review meant to identify, analyse, and synthesise a broad range of existing literature related to usage of DevOps in autonomous driving development. Our results provide a structured overview of challenges and solutions, arising from applying DevOps to safety-related AI-enabled functions. Our results indicate that there are still several open topics to be addressed to enable safe DevOps for the development of safe AD.
- Europe > Austria > Vienna (0.14)
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- (9 more...)
- Research Report > New Finding (1.00)
- Research Report > Promising Solution (0.93)
- Transportation > Ground > Road (1.00)
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- (3 more...)
Machine Learning Operations: A Mapping Study
Chakraborty, Abhijit, Das, Suddhasvatta, Gary, Kevin
Machine learning and AI have been recently embraced by many companies. Machine Learning Operations, (MLOps), refers to the use of continuous software engineering processes, such as DevOps, in the deployment of machine learning models to production. Nevertheless, not all machine learning initiatives successfully transition to the production stage owing to the multitude of intricate factors involved. This article discusses the issues that exist in several components of the MLOps pipeline, namely the data manipulation pipeline, model building pipeline, and deployment pipeline. A systematic mapping study is performed to identify the challenges that arise in the MLOps system categorized by different focus areas. Using this data, realistic and applicable recommendations are offered for tools or solutions that can be used for their implementation. The main value of this work is it maps distinctive challenges in MLOps along with the recommended solutions outlined in our study. These guidelines are not specific to any particular tool and are applicable to both research and industrial settings.
- North America > United States > Arizona (0.04)
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- Oceania > Australia > Queensland (0.04)
- (3 more...)
AI for DevSecOps: A Landscape and Future Opportunities
Fu, Michael, Pasuksmit, Jirat, Tantithamthavorn, Chakkrit
DevOps has emerged as one of the most rapidly evolving software development paradigms. With the growing concerns surrounding security in software systems, the DevSecOps paradigm has gained prominence, urging practitioners to incorporate security practices seamlessly into the DevOps workflow. However, integrating security into the DevOps workflow can impact agility and impede delivery speed. Recently, the advancement of artificial intelligence (AI) has revolutionized automation in various software domains, including software security. AI-driven security approaches, particularly those leveraging machine learning or deep learning, hold promise in automating security workflows. They reduce manual efforts, which can be integrated into DevOps to ensure uninterrupted delivery speed and align with the DevSecOps paradigm simultaneously. This paper seeks to contribute to the critical intersection of AI and DevSecOps by presenting a comprehensive landscape of AI-driven security techniques applicable to DevOps and identifying avenues for enhancing security, trust, and efficiency in software development processes. We analyzed 99 research papers spanning from 2017 to 2023. Specifically, we address two key research questions (RQs). In RQ1, we identified 12 security tasks associated with the DevOps process and reviewed existing AI-driven security approaches. In RQ2, we discovered 15 challenges encountered by existing AI-driven security approaches and derived future research opportunities. Drawing insights from our findings, we discussed the state-of-the-art AI-driven security approaches, highlighted challenges in existing research, and proposed avenues for future opportunities.
- Asia (0.04)
- Oceania > Australia > Victoria > Melbourne (0.04)
- North America > United States > Colorado (0.04)
- (3 more...)
- Workflow (1.00)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Overview (1.00)
- Information Technology > Security & Privacy (1.00)
- Government (1.00)
- Education (1.00)
MLOps: A Review
Wazir, Samar, Kashyap, Gautam Siddharth, Saxena, Parag
Recently, Machine Learning (ML) has become a widely accepted method for significant progress that is rapidly evolving. Since it employs computational methods to teach machines and produce acceptable answers. The significance of the Machine Learning Operations (MLOps) methods, which can provide acceptable answers for such problems, is examined in this study. To assist in the creation of software that is simple to use, the authors research MLOps methods. To choose the best tool structure for certain projects, the authors also assess the features and operability of various MLOps methods. A total of 22 papers were assessed that attempted to apply the MLOps idea. Finally, the authors admit the scarcity of fully effective MLOps methods based on which advancements can self-regulate by limiting human engagement.
- North America > United States > North Carolina > Mecklenburg County > Charlotte (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- Europe > Switzerland (0.04)
- (2 more...)
MLOps: In-depth Guide to Benefits, Examples & Tools for 2023
Building machine learning models and applying them to business processes requires collaboration between data scientists, data engineers, designers, business professionals, and IT professionals. Efficient collaboration and orchestration is especially critical for businesses that want to adopt AI and ML at scale, which leads to a three-fold increase in ROI over companies in the AI proof-of-concept stage. Inspired by DevOps practices for software development, MLOps brings diverse teams in an organization together to speed up the development and deployment of machine learning models. In this article, we'll provide an in-depth guide to MLOps, how it helps streamline end-to-end ML processes, and some case studies from companies who have adopted it. MLOps (Machine Learning Operations) is a set of practices to standardize and streamline the process of construction and deployment of machine learning systems.
AI-powered anomaly detection in log data for improved troubleshooting in devops
In summary, implementing a solution for AI-powered anomaly detection in log data for improved troubleshooting in DevOps requires a well-structured plan, a good understanding of the use case, and a good knowledge of the different AI-based anomaly detection techniques. With proper planning, implementation, and maintenance, AI-powered anomaly detection can be a valuable asset for any DevOps team.
MLOps and ML Data pipeline: Key Takeaways
If you have ever worked with a Machine Learning (ML) model in a production environment, you might have heard of MLOps. The term explains the concept of optimizing the ML lifecycle by bridging the gap between design, model development, and operation processes. As more teams attempt to create AI solutions for actual use cases, MLOps is now more than just a theoretical idea; it is a hotly debated area of machine learning that is becoming increasingly important. If done correctly, it speeds up the development and deployment of ML solutions for teams all over the world. MLOps is frequently referred to as DevOps for Machine Learning while reading about the word.
The Pipeline for the Continuous Development of Artificial Intelligence Models -- Current State of Research and Practice
Steidl, Monika, Felderer, Michael, Ramler, Rudolf
Companies struggle to continuously develop and deploy AI models to complex production systems due to AI characteristics while assuring quality. To ease the development process, continuous pipelines for AI have become an active research area where consolidated and in-depth analysis regarding the terminology, triggers, tasks, and challenges is required. This paper includes a Multivocal Literature Review where we consolidated 151 relevant formal and informal sources. In addition, nine-semi structured interviews with participants from academia and industry verified and extended the obtained information. Based on these sources, this paper provides and compares terminologies for DevOps and CI/CD for AI, MLOps, (end-to-end) lifecycle management, and CD4ML. Furthermore, the paper provides an aggregated list of potential triggers for reiterating the pipeline, such as alert systems or schedules. In addition, this work uses a taxonomy creation strategy to present a consolidated pipeline comprising tasks regarding the continuous development of AI. This pipeline consists of four stages: Data Handling, Model Learning, Software Development and System Operations. Moreover, we map challenges regarding pipeline implementation, adaption, and usage for the continuous development of AI to these four stages.
- North America > United States > New York > New York County > New York City (0.28)
- North America > United States > New Jersey > Middlesex County > Piscataway (0.04)
- North America > United States > California > Santa Clara County > Santa Clara (0.04)
- (17 more...)
- Personal > Interview (0.48)
- Research Report > Experimental Study (0.46)
- Information Technology > Services (1.00)
- Education (1.00)
- Energy (0.92)
- Information Technology > Security & Privacy (0.67)
MLOps Best Practices for Machine Learning Model Development, Deployment, and Maintenance
MLOps, or DevOps for machine learning, is a practice that aims to bring the collaboration and automation practices of DevOps to the development and deployment of machine learning models. It aims to improve the speed and reliability of model development and deployment, as well as to make the process more reproducible and maintainable. By following these steps and using tools and practices from the DevOps philosophy, organizations can improve the speed and reliability of their machine learning model development and deployment processes, and better manage the complexity and scale of their machine learning systems. By testing and validating the model continuously throughout the development process, data scientists can ensure that the model is accurate, reliable, and effective when it is deployed to production.