AITopics | kubeflow

Collaborating Authors

kubeflow

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Towards Conversational AI for Human-Machine Collaborative MLOps

Fatouros, George, Makridis, Georgios, Kousiouris, George, Soldatos, John, Tsadimas, Anargyros, Kyriazis, Dimosthenis

arXiv.org Artificial IntelligenceNov-11-2025

This paper presents a Large Language Model (LLM) based conversational agent system designed to enhance human-machine collaboration in Machine Learning Operations (MLOps). We introduce the Swarm Agent, an extensible architecture that integrates specialized agents to create and manage ML workflows through natural language interactions. The system leverages a hierarchical, modular design incorporating a KubeFlow Pipelines (KFP) Agent for ML pipeline orchestration, a MinIO Agent for data management, and a Retrieval-Augmented Generation (RAG) Agent for domain-specific knowledge integration. Through iterative reasoning loops and context-aware processing, the system enables users with varying technical backgrounds to discover, execute, and monitor ML pipelines; manage datasets and artifacts; and access relevant documentation, all via intuitive conversational interfaces. Our approach addresses the accessibility gap in complex MLOps platforms like Kubeflow, making advanced ML tools broadly accessible while maintaining the flexibility to extend to other platforms. The paper describes the architecture, implementation details, and demonstrates how this conversational MLOps assistant reduces complexity and lowers barriers to entry for users across diverse technical skill levels.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/DCOSS-IoT65416.2025.00162

2504.12477

Country:

Europe > Greece (0.15)
Europe > Middle East > Cyprus (0.14)

Genre:

Workflow (0.52)
Research Report (0.50)

Industry: Information Technology (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Building A Machine Learning Platform With Kubeflow And Ray On Google Kubernetes Engine - cyberpogo

#artificialintelligenceApr-11-2023, 01:07:07 GMT

To start building an ML Platform, you should support the basic ML user journey of notebook prototyping to scaled training to online serving. If your organization has multiple teams, you may additionally need to support administrative requirements of multi-user support with identity-based authentication and authorization. Two popular OSS projects – Kubeflow and Ray – together can support these needs. Kubeflow provides the multi-user environment and interactive notebook management. Ray orchestrates distributed computing workloads across the entire ML lifecycle, including training and serving.

google kubernetes engine, kubeflow, ray cluster, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Two Towers Model: A Custom Pipeline in Vertex AI Using Kubeflow

#artificialintelligenceMar-4-2023, 11:16:21 GMT

MLOps is composed by Continuous Integration (CI -- code, unit testing, remerge code), Continuous Delivery (CD -- build, test, release) and Continuous Training (CT -- train, monitor, measure, retrain, serve). Consider the following situation: you develop a solution where you will offer product search for users. There are new users every minute and new products every day. In this situation we will have an index of embeddings containing all the products, and users query will be submitted as numerical vectors to this index, to check for the best results. This index is deployed in a container inside Vertex AI endpoints.

artificial intelligence, machine learning, vertex ai, (16 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.53)
Information Technology > Communications > Social Media (0.51)

Add feedback

New at Civo Navigate: Making Machine Learning Set up Faster - The New Stack

#artificialintelligenceFeb-17-2023, 03:01:58 GMT

Of the time it takes to set up a machine learning project, 60% is actually spent performing infrastructure engineering tasks. That compares to 20% doing data engineering, Civo Chief Innovation Officer Josh Mesout, who has launched 300 machine learning (ML) models in the past two and a half years, said at the Civo Navigate conference here on Tuesday. Civo hopes to simplify machine learning infrastructure with a new managed service offering, Kubeflow as a Service, which it says will improve the developer experience and reduce the time and resources required to gain insights from machine learning algorithms. The Kubernetes cloud provider is betting that developers don't want to deal with the infrastructure piece of the ML puzzle. So its new offering will run the infrastructure for ML as a managed service, while supporting open source tools and frameworks. It believes this will make ML more accessible to smaller organizations, which it said are often priced out of ML due to economies of scale.

civo, cloud provider, majrekar, (13 more...)

#artificialintelligence

Country:

North America > United States > New York (0.05)
North America > United States > Florida > Hillsborough County > Tampa (0.05)
Europe > Germany > Hesse > Darmstadt Region > Frankfurt (0.05)

Industry:

Information Technology > Services (0.96)
Government > Regional Government > North America Government > United States Government (0.30)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Understand Workflow Management with Kubeflow

#artificialintelligenceJan-31-2023, 15:16:06 GMT

Kubeflow is an open-source platform that makes it easy to deploy and manage machine learning (ML) workflows on Kubernetes, a popular open-source system for automating containerized applications' deployment, scaling, and management. Kubeflow can help you run machine learning tasks on your computer by making it easy to set up and manage a cluster of computers to work together on the task. It acts like a "traffic cop" for your computer work, ensuring all the tasks' different steps are done in the right order and that all the computers are working together correctly. This way, you can focus on the task at hand, such as making predictions or finding patterns in your data, and let Kubeflow handle the underlying infrastructure. Imagine you have a big toy box with many different toys inside. Kubeflow is like the toy box organizer.

artificial intelligence, kubeflow, machine learning, (17 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.40)

Add feedback

Machine Learning with Kubeflow on Amazon EKS with Amazon EFS

#artificialintelligenceNov-22-2022, 21:00:12 GMT

Training Machine Learning models involves multiple steps, it gets more complex and time consuming when the size of the data set for training is in the range of 100s of GBs. Data Scientists run through large number of experiments and research which includes testing and training large number of models. Kubeflow provides various ML capabilities to accelerate the training process and run simple, portable scalable Machine Learning workloads on Kubernetes. Model parallelism is a distributed training method in which the deep learning model is partitioned across multiple devices, within or across instances. When Data Scientists adopt Model parallelism there's also a need to share the large dataset across Machine Learning models.

amazon ef, amazon ek, kubeflow, (13 more...)

#artificialintelligence

Industry: Retail > Online (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)

Add feedback

Kubeflow -- Your Toolkit for MLOps

#artificialintelligenceOct-19-2022, 23:05:13 GMT

In MLOps, different platforms work within the data science environment and hold their grips concerning their services -- one of which is Kubeflow. Before understanding the specialties of Kubeflow and its importance, it is necessary to know what is MLOps and why we need it. MLOps, also referred to as Machine Learning Operations, combines Data Science, software engineering, and DevOps practices. Whenever a data scientist builds a model that runs seamlessly and provides a high-performance output, it needs to be deployed for real-time inference. Calling a DevOps engineer for this task is sometimes helpful because a DevOps engineer has hands-on experience in software engineering and development operations, but monitoring a model and dataset in real-time can be sophisticated.

artificial intelligence, kubeflow, machine learning, (17 more...)

#artificialintelligence

Industry: Law > Business Law (0.30)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Accelerating ETL on KubeFlow with RAPIDS

#artificialintelligenceOct-8-2022, 19:50:37 GMT

In the machine learning and MLOps world, GPUs are widely used to speed up model training and inference, but what about the other stages of the workflow like ETL pipelines or hyperparameter optimization? Within the RAPIDS data science framework, ETL tools are designed to have a familiar look and feel to data scientists working in Python. Do you currently use Pandas, NumPy, Scikit-learn, or other parts of the PyData stack within your KubeFlow workflows? If so, you can use RAPIDS to accelerate those parts of your workflow by leveraging the GPUs likely already available in your cluster. In this post, I demonstrate how to drop RAPIDS into a KubeFlow environment.

kubeflow, kubernetes cluster, rapids, (13 more...)

#artificialintelligence

Genre: Workflow (0.77)

Industry: Information Technology > Hardware (0.42)

Technology:

Information Technology > Data Science > Data Integration (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)

Add feedback

Kubernetes ML optimizer, Kubeflow, improves data preprocessing with v1.6

#artificialintelligenceSep-9-2022, 10:25:06 GMT

Were you unable to attend Transform 2022? Check out all of the summit sessions in our on-demand library now! More often than not, when organizations deploy applications across hybrid and multicloud environments, they use the open-source Kubernetes container orchestration system. Kubernetes itself helps to schedule and manage distributed virtual compute resources and isn't optimized by default for any one particular type of workload, that's where projects like Kubeflow come into play. For organizations looking to run machine learning (ML) in the cloud, a group of companies including Google, Red Hat and Cisco helped to found the Kubeflow open-source project in 2017.

kubeflow, kubeflow 1, munteanu, (13 more...)

#artificialintelligence

Country: North America > United States > California > San Francisco County > San Francisco (0.16)

Industry: Information Technology (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

How to Package and Distribute Machine Learning Models with MLFlow - KDnuggets

#artificialintelligenceAug-26-2022, 17:44:07 GMT

One of the fundamental activities during each stage of the ML model life cycle development is collaboration. Taking an ML model from its conception to deployment requires participation and interaction between different roles involved in constructing the model. In addition, the nature of ML model development involves experimentation, tracking of artifacts and metrics, model versions, etc., which demands an effective organization for the correct maintenance of the ML model life cycle. Fortunately, there are tools for developing and maintaining a model's life cycle, such as MLflow. In this article, we will break down MLflow, its main components, and its characteristics.

artifact, mlflow, mlflow server, (13 more...)

#artificialintelligence

Industry: Information Technology (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback