Goto

Collaborating Authors

 rpa


A Nascent Taxonomy of Machine Learning in Intelligent Robotic Process Automation

Laakmann, Lukas, Ciftci, Seyyid A., Janiesch, Christian

arXiv.org Artificial Intelligence

Robotic process automation (RPA) is a lightweight approach to automating business processes using software robots that emulate user actions at the graphical user interface level. While RPA has gained popularity for its cost-effective and timely automation of rule-based, well-structured tasks, its symbolic nature has inherent limitations when approaching more complex tasks currently performed by human agents. Machine learning concepts enabling intelligent RPA provide an opportunity to broaden the range of automatable tasks. In this paper, we conduct a literature review to explore the connections between RPA and machine learning and organize the joint concept intelligent RPA into a taxonomy. Our taxonomy comprises the two meta-characteristics RPA-ML integration and RPA-ML interaction. Together, they comprise eight dimensions: architecture and ecosystem, capabilities, data basis, intelligence level, and technical depth of integration as well as deployment environment, lifecycle phase, and user-robot relation.


Are LLM Agents the New RPA? A Comparative Study with RPA Across Enterprise Workflows

Průcha, Petr, Matoušková, Michaela, Strnad, Jan

arXiv.org Artificial Intelligence

The emergence of large language models (LLMs) has introduced a new paradigm in automation: LLM agents or Agentic Automation with Computer Use (AACU). Unlike traditional Robotic Process Automation (RPA), which relies on rule - based workflows and scripting, AACU enables intelligent agents to perform tasks through natural language instructions and autonomous inte raction with user interfaces. This study investigates whether AACU can serve as a viable alternative to RPA in enterprise workflow automation. We conducted controlled experiments across three standard RPA challenges data entry, monitoring, and document extraction comparing RPA (via UiPath) and AACU (via Anthropic's Computer Use Agent) in terms of speed, reliability, and development effort. Results indicate that RPA outperforms AACU in execution speed and reliability, particularly in repetitive, stable environments. However, AACU significantly reduces development time and adapts more flexibly to dynamic interfaces. While current AACU implementations are not yet production - ready, their promise in rapid prototyping and lightweight automation is evident. Future research should explore multi - agent orchestration, hybrid RPA - AACU architectures, and more robust evaluation a cross industries and platforms.


Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents

Chen, Chaoran, Yao, Bingsheng, Zou, Ruishi, Hua, Wenyue, Lyu, Weimin, Li, Toby Jia-Jun, Wang, Dakuo

arXiv.org Artificial Intelligence

Role-Playing Agent (RPA) is an increasingly popular type of LLM Agent that simulates human-like behaviors in a variety of tasks. However, evaluating RPAs is challenging due to diverse task requirements and agent designs. This paper proposes an evidence-based, actionable, and generalizable evaluation design guideline for LLM-based RPA by systematically reviewing 1,676 papers published between Jan. 2021 and Dec. 2024. Our analysis identifies six agent attributes, seven task attributes, and seven evaluation metrics from existing literature. Based on these findings, we present an RPA evaluation design guideline to help researchers develop more systematic and consistent evaluation methods.


SHARP: Unlocking Interactive Hallucination via Stance Transfer in Role-Playing Agents

Kong, Chuyi, Luo, Ziyang, Lin, Hongzhan, Fan, Zhiyuan, Fan, Yaxin, Sun, Yuxi, Ma, Jing

arXiv.org Artificial Intelligence

The advanced role-playing capabilities of Large Language Models (LLMs) have paved the way for developing Role-Playing Agents (RPAs). However, existing benchmarks in social interaction such as HPD and SocialBench have not investigated hallucination and face limitations like poor generalizability and implicit judgments for character fidelity. To address these issues, we propose a generalizable, explicit and effective paradigm to unlock the interactive patterns in diverse worldviews. Specifically, we define the interactive hallucination based on stance transfer and construct a benchmark, SHARP, by extracting relations from a general commonsense knowledge graph and leveraging the inherent hallucination properties of RPAs to simulate interactions across roles. Extensive experiments validate the effectiveness and stability of our paradigm. Our findings further explore the factors influencing these metrics and discuss the trade-off between blind loyalty to roles and adherence to facts in RPAs.


Advancements in Robotics Process Automation: A Novel Model with Enhanced Empirical Validation and Theoretical Insights

Pandy, Gokul, Jayaram, Vivekananda, Krishnappa, Manjunatha Sughaturu, Ingole, Balaji Shesharao, Ganeeb, Koushik Kumar, Joseph, Shenson

arXiv.org Artificial Intelligence

Abstract: Robotics Process Automation (RPA) is revolutionizing business operations by significantly enhancing efficiency, productivity, and operational excellence across various industries. This manuscript delivers a comprehensive review of recent advancements in RPA technologies and proposes a novel model designed to elevate RPA capabilities. Incorporating cutting-edge artificial intelligence (AI) techniques, advanced machine learning algorithms, and strategic integration frameworks, the proposed model aims to push RPA's boundaries. The paper includes a detailed analysis of functionalities, implementation strategies, and expanded empirical validation through rigorous testing across multiple industries. Theoretical insights underpin the model's design, offering a robust framework for its application.


Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing

Liu, Wenhao, An, Siyu, Lu, Junru, Wu, Muling, Li, Tianlong, Wang, Xiaohua, Zheng, Xiaoqing, Yin, Di, Sun, Xing, Huang, Xuanjing

arXiv.org Artificial Intelligence

Role-Playing Agents (RPAs) have shown remarkable performance in various applications, yet they often struggle to recognize and appropriately respond to hard queries that conflict with their role-play knowledge. To investigate RPAs' performance when faced with different types of conflicting requests, we develop an evaluation benchmark that includes contextual knowledge conflicting requests, parametric knowledge conflicting requests, and non-conflicting requests to assess RPAs' ability to identify conflicts and refuse to answer appropriately without over-refusing. Through extensive evaluation, we find that most RPAs behave significant performance gaps toward different conflict requests. To elucidate the reasons, we conduct an in-depth representation-level analysis of RPAs under various conflict scenarios. Our findings reveal the existence of rejection regions and direct response regions within the model's forwarding representation, and thus influence the RPA's final response behavior. Therefore, we introduce a lightweight representation editing approach that conveniently shifts conflicting requests to the rejection region, thereby enhancing the model's refusal accuracy. The experimental results validate the effectiveness of our editing method, improving RPAs' refusal ability of conflicting requests while maintaining their general role-playing capabilities.


Defending against Reverse Preference Attacks is Difficult

Rosati, Domenic, Edkins, Giles, Raj, Harsh, Atanasov, David, Majumdar, Subhabrata, Rajendran, Janarthanan, Rudzicz, Frank, Sajjad, Hassan

arXiv.org Artificial Intelligence

While there has been progress towards aligning Large Language Models (LLMs) with human values and ensuring safe behaviour at inference time, safety-aligned LLMs are known to be vulnerable to training-time attacks such as supervised fine-tuning (SFT) on harmful datasets. In this paper, we ask if LLMs are vulnerable to adversarial reinforcement learning. Motivated by this goal, we propose Reverse Preference Attacks (RPA), a class of attacks to make LLMs learn harmful behavior using adversarial reward during reinforcement learning from human feedback (RLHF). RPAs expose a critical safety gap of safety-aligned LLMs in RL settings: they easily explore the harmful text generation policies to optimize adversarial reward. To protect against RPAs, we explore a host of mitigation strategies. Leveraging Constrained Markov-Decision Processes, we adapt a number of mechanisms to defend against harmful fine-tuning attacks into the RL setting. Our experiments show that ``online" defenses that are based on the idea of minimizing the negative log likelihood of refusals -- with the defender having control of the loss function -- can effectively protect LLMs against RPAs. However, trying to defend model weights using ``offline" defenses that operate under the assumption that the defender has no control over the loss function are less effective in the face of RPAs. These findings show that attacks done using RL can be used to successfully undo safety alignment in open-weight LLMs and use them for malicious purposes.


Optimizing Structured Data Processing through Robotic Process Automation

Bhardwaj, Vivek, Noonia, Ajit, Chaurasia, Sandeep, Kumar, Mukesh, Rashid, Abdulnaser, Othman, Mohamed Tahar Ben

arXiv.org Artificial Intelligence

Robotic Process Automation (RPA) has emerged as a game-changing technology in data extraction, revolutionizing the way organizations process and analyze large volumes of documents such as invoices, purchase orders, and payment advices. This study investigates the use of RPA for structured data extraction and evaluates its advantages over manual processes. By comparing human-performed tasks with those executed by RPA software bots, we assess efficiency and accuracy in data extraction from invoices, focusing on the effectiveness of the RPA system. Through four distinct scenarios involving varying numbers of invoices, we measure efficiency in terms of time and effort required for task completion, as well as accuracy by comparing error rates between manual and RPA processes. Our findings highlight the significant efficiency gains achieved by RPA, with bots completing tasks in significantly less time compared to manual efforts across all cases. Moreover, the RPA system consistently achieves perfect accuracy, mitigating the risk of errors and enhancing process reliability. These results underscore the transformative potential of RPA in optimizing operational efficiency, reducing human labor costs, and improving overall business performance.


Automating the Enterprise with Foundation Models

Wornow, Michael, Narayan, Avanika, Opsahl-Ong, Krista, McIntyre, Quinn, Shah, Nigam H., Re, Christopher

arXiv.org Artificial Intelligence

Automating enterprise workflows could unlock $4 trillion/year in productivity gains. Despite being of interest to the data management community for decades, the ultimate vision of end-to-end workflow automation has remained elusive. Current solutions rely on process mining and robotic process automation (RPA), in which a bot is hard-coded to follow a set of predefined rules for completing a workflow. Through case studies of a hospital and large B2B enterprise, we find that the adoption of RPA has been inhibited by high set-up costs (12-18 months), unreliable execution (60% initial accuracy), and burdensome maintenance (requiring multiple FTEs). Multimodal foundation models (FMs) such as GPT-4 offer a promising new approach for end-to-end workflow automation given their generalized reasoning and planning abilities. To study these capabilities we propose ECLAIR, a system to automate enterprise workflows with minimal human supervision. We conduct initial experiments showing that multimodal FMs can address the limitations of traditional RPA with (1) near-human-level understanding of workflows (93% accuracy on a workflow understanding task) and (2) instant set-up with minimal technical barrier (based solely on a natural language description of a workflow, ECLAIR achieves end-to-end completion rates of 40%). We identify human-AI collaboration, validation, and self-improvement as open challenges, and suggest ways they can be solved with data management techniques. Code is available at: https://github.com/HazyResearch/eclair-agents


Intelligent methods for business rule processing: State-of-the-art

da Costa, Cristiano André, Santos, Uélison Jean Lopes dos, Reis, Eduardo Souza dos, Antunes, Rodolfo Stoffel, Pacheco, Henrique Chaves, França, Thaynã da Silva, Righi, Rodrigo da Rosa, Barbosa, Jorge Luis Victória, Jebadoss, Franklin, Montalvao, Jorge, Kunkel, Rogerio

arXiv.org Artificial Intelligence

Business automation processes have gained popularity in recent times. Robot Process Automation (RPA) reached its peak in September 2018, according to Google Trends data [1]. In this article, we provide an in-depth analysis of selected papers that describe the current state-of-the-art on RPA and Intelligent Process Automation (IPA). The main objective of this article is to present the latest research and understanding of intelligent methods for processing business rules, especially related to service order handling. The methods discussed involve the use of machine processing techniques and natural language processing. The article is structured as follows: Section 2 describe the research methodology. Section 3 focuses on Robot Process Automation (RPA).