AITopics | bauplan

Collaborating Authors

bauplan

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Trustworthy AI in the Agentic Lakehouse: from Concurrency to Governance

Tagliabue, Jacopo, Bianchi, Federico, Greco, Ciro

arXiv.org Artificial IntelligenceNov-21-2025

Even as AI capabilities improve, most enterprises do not consider agents trustworthy enough to work on production data. In this paper, we argue that the path to trustworthy agentic workflows begins with solving the infrastructure problem first: traditional lakehouses are not suited for agent access patterns, but if we design one around transactions, governance follows. In particular, we draw an operational analogy to MVCC in databases and show why a direct transplant fails in a decoupled, multi-language setting. We then propose an agent-first design, Bauplan, that reimplements data and compute isolation in the lakehouse. We conclude by sharing a reference implementation of a self-healing pipeline in Bauplan, which seamlessly couples agent reasoning with all the desired guarantees for correctness and trust.

artificial intelligence, lakehouse, pipeline, (17 more...)

arXiv.org Artificial Intelligence

2511.16402

Country: North America > United States > South Carolina > Charleston County (0.14)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence (0.89)

Add feedback

AI for Distributed Systems Design: Scalable Cloud Optimization Through Repeated LLMs Sampling And Simulators

Tagliabue, Jacopo

arXiv.org Artificial IntelligenceOct-23-2025

We explore AI-driven distributed-systems policy design by combining stochastic code generation from large language models (LLMs) with deterministic verification in a domain-specific simulator. Using a Function-as-a-Service runtime (Bauplan) and its open-source simulator (Eudoxia) as a case study, we frame scheduler design as an iterative generate-and-verify loop: an LLM proposes a Python policy, the simulator evaluates it on standardized traces, and structured feedback steers subsequent generations. This setup preserves interpretability while enabling targeted search over a large design space. We detail the system architecture and report preliminary results on throughput improvements across multiple models. Beyond early gains, we discuss the limits of the current setup and outline next steps; in particular, we conjecture that AI will be crucial for scaling this methodology by helping to bootstrap new simulators.

large language model, natural language, simulator, (17 more...)

arXiv.org Artificial Intelligence

2510.18897

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Safe, Untrusted, "Proof-Carrying" AI Agents: toward the agentic lakehouse

Tagliabue, Jacopo, Greco, Ciro

arXiv.org Artificial IntelligenceOct-13-2025

Starting from this prototype, we conclude by outlining practical next steps for a full agentic lakehouse. The paper is organized as follows. After reviewing agent-friendly abstractions (Section II), we address key safety objections for high-stakes scenarios (Section III). Once safety is established, we describe a ReAct [12] loop built on these abstractions (Section IV). We put forward our working prototype as a feasibility demonstration of safe-by-design data agents, not as a full-fledged experimental benchmark. We believe that sharing working code is of great value to the community, especially in times of quickly shifting mental models. However, it is important to remember that our fundamental insights - programmability and safety - can be replicated independently of the chosen APIs. For these reasons, we believe our paper to be valuable to a wide range of practitioners: on one hand, those looking for a new mental map of this uncharted territory; on the other, those looking to be inspired by tinkering with existing implementations and inspecting systems working at scale.

data mining, large language model, natural language, (21 more...)

arXiv.org Artificial Intelligence

2510.09567

Genre: Research Report (0.42)

Technology:

Information Technology > Data Science > Data Mining (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.65)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.49)

Add feedback

Bauplan: zero-copy, scale-up FaaS for data pipelines

Tagliabue, Jacopo, Caraza-Harter, Tyler, Greco, Ciro

arXiv.org Artificial IntelligenceOct-22-2024

In this light, data workloads seem to Chaining functions for longer workloads is a key use case for FaaS be a natural fit for Function-as-a-Service (FaaS) platforms designed platforms in data applications. However, modern data pipelines to efficiently handle bursty, functional, and event-driven tasks. Unfortunately, differ significantly from typical serverless use cases (e.g., webhooks existing FaaS runtimes fall short in practice as they and microservices); this makes it difficult to retrofit existing pipeline were primarily designed to support the execution of many simple, frameworks due to structural constraints. In this paper, we describe independent functions that produce small outputs. Although popular these limitations in detail and introduce bauplan, a novel FaaS FaaS platforms (e.g., AWS Lambda [5], Azure Functions [17], and programming model and serverless runtime designed for data practitioners. OpenWhisk [4]) have added support for function chaining, their bauplan enables users to declaratively define functional capabilities fall short for data pipelines. It is therefore not surprising Directed Acyclic Graphs (DAGs) along with their runtime environments, that widely used data engineering frameworks (e.g., Airflow [1], which are then efficiently executed on cloud-based workers. Prefect [19], and Luigi [23]) lack native integration with serverless We show that bauplan achieves both better performance and a runtimes.

artificial intelligence, bauplan, cloud computing, (18 more...)

arXiv.org Artificial Intelligence

2410.17465

Country:

North America > United States > New York > New York County > New York City (0.05)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
(2 more...)

Genre: Research Report (0.40)

Industry:

Information Technology > Security & Privacy (0.46)
Information Technology > Services (0.34)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Information Management (0.88)
Information Technology > Cloud Computing (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.46)

Add feedback

The Dynamic of Body and Brain Co-Evolution

Pagliuca, Paolo, Nolfi, Stefano

arXiv.org Artificial IntelligenceNov-23-2020

We introduce a method that permits to co-evolve the body and the control properties of robots. It can be used to adapt the morphological traits of robots with a hand-designed morphological bauplan or to evolve the morphological bauplan as well. Our results indicate that robots with co-adapted body and control traits outperform robots with fixed hand-designed morphologies. Interestingly, the advantage is not due to the selection of better morphologies but rather to the mutual scaffolding process that results from the possibility to co-adapt the morphological traits to the control traits and vice versa. Our results also demonstrate that morphological variations do not necessarily have destructive effects on robot skills.

agent, experiment, morphology, (16 more...)

arXiv.org Artificial Intelligence

2011.1144

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > United States > New York (0.04)
Europe > Italy (0.04)

Genre: Research Report > New Finding (0.87)

Industry: Health & Medicine (0.47)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Add feedback