AITopics | task automation

Collaborating Authors

task automation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

TaskBench: Benchmarking Large Language Models for Task Automation

Neural Information Processing SystemsDec-23-2025, 20:07:44 GMT

In recent years, the remarkable progress of large language models (LLMs) has sparked interest in task automation, which involves decomposing complex tasks described by user instructions into sub-tasks and invoking external tools to execute them, playing a central role in autonomous agents. However, there is a lack of systematic and standardized benchmarks to promote the development of LLMs in task automation. To address this, we introduce TaskBench, a comprehensive framework to evaluate the capability of LLMs in task automation. Specifically, task automation can be divided into three critical stages: task decomposition, tool selection, and parameter prediction. To tackle the complexities inherent in these stages, we introduce the concept of Tool Graph to represent decomposed tasks and adopt a back-instruct method to generate high-quality user instructions.

large language model, natural language, task automation, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

085185ea97db31ae6dcac7497616fd3e-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsNov-20-2025, 08:28:41 GMT

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.92)

Industry:

Banking & Finance (1.00)
Information Technology > Services (0.46)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(4 more...)

Add feedback

085185ea97db31ae6dcac7497616fd3e-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsOct-9-2025, 17:53:28 GMT

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.92)

Industry:

Banking & Finance (1.00)
Information Technology > Services (0.46)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(4 more...)

Add feedback

TaskBench: Benchmarking Large Language Models for Task Automation

Neural Information Processing SystemsMay-26-2025, 15:37:03 GMT

large language model, natural language, task automation, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

TaskBench: Benchmarking Large Language Models for Task Automation

Shen, Yongliang, Song, Kaitao, Tan, Xu, Zhang, Wenqi, Ren, Kan, Yuan, Siyu, Lu, Weiming, Li, Dongsheng, Zhuang, Yueting

arXiv.org Artificial IntelligenceDec-9-2023

Recently, the incredible progress of large language models (LLMs) has ignited the spark of task automation, which decomposes the complex tasks described by user instructions into sub-tasks, and invokes external tools to execute them, and plays a central role in autonomous agents. However, there lacks a systematic and standardized benchmark to foster the development of LLMs in task automation. To this end, we introduce TaskBench to evaluate the capability of LLMs in task automation. Specifically, task automation can be formulated into three critical stages: task decomposition, tool invocation, and parameter prediction to fulfill user intent. This complexity makes data collection and evaluation more challenging compared to common NLP tasks. To generate high-quality evaluation datasets, we introduce the concept of Tool Graph to represent the decomposed tasks in user intent, and adopt a back-instruct method to simulate user instruction and annotations. Furthermore, we propose TaskEval to evaluate the capability of LLMs from different aspects, including task decomposition, tool invocation, and parameter prediction. Experimental results demonstrate that TaskBench can effectively reflects the capability of LLMs in task automation. Benefiting from the mixture of automated data construction and human verification, TaskBench achieves a high consistency compared to the human evaluation, which can be utilized as a comprehensive and faithful benchmark for LLM-based autonomous agents.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2311.1876

Country:

Asia (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Banking & Finance (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators

Zhang, Zhizheng, Zhang, Xiaoyi, Xie, Wenxuan, Lu, Yan

arXiv.org Artificial IntelligenceDec-4-2023

They have shown a promising prospect in automatically completing tasks upon user instructions, functioning as brain-like coordinators. The associated risks will be revealed as we delegate an increasing number of tasks to machines for automated completion. A big question emerges: how can we make machines behave responsibly when helping humans automate tasks as personal copilots? In this paper, we explore this question in depth from the perspectives of feasibility, completeness and security. In specific, we present Responsible Task Automation (ResponsibleTA) as a fundamental framework to facilitate responsible collaboration between LLM-based coordinators and executors for task automation with three empowered capabilities: 1) predicting the feasibility of the commands for executors; 2) verifying the completeness of executors; 3) enhancing the security (e.g., the protection of users' privacy). We further propose and compare two paradigms for implementing the first two capabilities. One is to leverage the generic knowledge of LLMs themselves via prompt engineering while the other is to adopt domain-specific learnable models. Moreover, we introduce a local memory mechanism for achieving the third capability. We evaluate our proposed ResponsibleTA on UI task automation and hope it could bring more attentions to ensuring LLMs more responsible in diverse scenarios.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2306.01242

Country:

Europe > United Kingdom > England (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre:

Research Report (0.50)
Workflow (0.49)

Industry: Information Technology > Security & Privacy (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Empowering LLM to use Smartphone for Intelligent Task Automation

Wen, Hao, Li, Yuanchun, Liu, Guohong, Zhao, Shanhui, Yu, Tao, Li, Toby Jia-Jun, Jiang, Shiqi, Liu, Yunhao, Zhang, Yaqin, Liu, Yunxin

arXiv.org Artificial IntelligenceSep-9-2023

Mobile task automation is an attractive technique that aims to enable voice-based hands-free user interaction with smartphones. However, existing approaches suffer from poor scalability due to the limited language understanding ability and the non-trivial manual efforts required from developers or end-users. The recent advance of large language models (LLMs) in language understanding and reasoning inspires us to rethink the problem from a model-centric perspective, where task preparation, comprehension, and execution are handled by a unified language model. In this work, we introduce AutoDroid, a mobile task automation system that can handle arbitrary tasks on any Android application without manual efforts. The key insight is to combine the commonsense knowledge of LLMs and domain-specific knowledge of apps through automated dynamic analysis. The main components include a functionality-aware UI representation method that bridges the UI with the LLM, exploration-based memory injection techniques that augment the app-specific domain knowledge of LLM, and a multi-granularity query optimization module that reduces the cost of model inference. We integrate AutoDroid with off-the-shelf LLMs including online GPT-4/GPT-3.5 and on-device Vicuna, and evaluate its performance on a new benchmark for memory-augmented Android task automation with 158 common tasks. The results demonstrated that AutoDroid is able to precisely generate actions with an accuracy of 90.9%, and complete tasks with a success rate of 71.3%, outperforming the GPT-4-powered baselines by 36.4% and 39.7%. The demo, benchmark suites, and source code of AutoDroid will be released at url{https://autodroid-sys.github.io/}.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2308.15272

Country:

North America > United States > New York > New York County > New York City (0.05)
North America > Canada > Ontario > Toronto (0.04)
Europe > Netherlands > North Brabant > Eindhoven (0.04)
Asia > Japan > Honshū > Chūbu > Toyama Prefecture > Toyama (0.04)

Genre:

Research Report (0.50)
Workflow (0.46)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

How Artificial Intelligence is transforming the industry

#artificialintelligenceJan-11-2023, 20:30:20 GMT

Artificial intelligence is transforming industry in many ways. From task automation to decision making, the way we work and interact with the world. In this article, we will explore how AI is being used in different industries and how it is driving efficiency and productivity. Artificial intelligence is changing the way we work in several ways. First -- AI is being used to automate tasks and processes.

ai system, artificial intelligence, efficiency and productivity, (11 more...)

#artificialintelligence

Industry:

Banking & Finance (0.52)
Health & Medicine (0.50)

Technology: Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

How AI is Improving Cloud Computing for Enterprises - ONLINE LIKE

#artificialintelligenceNov-10-2022, 08:14:27 GMT

The first two decades of the 21st century have been marked by exponential advances in technology that were once considered elements of a science fiction movie script. Technologies like Artificial intelligence (AI) and Cloud Computing--have stood the test of time and have become mainstream. In this article, we'll look at what these technologies are and how their combination has been a landscape-changing force in the world of modern technology. Simply put, artificial intelligence is the simulation of human intelligence by machines. The integration of artificial intelligence into business allows it to perceive and observe the environment and generate optimal results accordingly--very similar to how people operate, although much faster.

artificial intelligence, cloud computing, computing, (12 more...)

#artificialintelligence

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

The Future Of AI Process Automation In Marketing

#artificialintelligenceJan-24-2022, 22:00:51 GMT

In the past several years, marketers have embraced artificial intelligence technologies to automate a broad range of high-volume, data-intensive tasks from ad targeting to image manipulation. The next phase of AI in marketing has the potential to deliver a much larger impact as the focus shifts from the automation of single tasks to more complex business processes and workflows, and ultimately influencing marketing strategy. Task automation using AI will continue to add value to marketers, but their benefits will be dwarfed by the intelligent automation of complex workflows. To understand the enormous difference between task automation and process automation, consider the evolution of automotive interfaces. In the early 2000s, we started to see basic voice automation in cars.

ai process automation, artificial intelligence, automation, (5 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Applied AI (0.58)
Information Technology > Artificial Intelligence > The Future (0.40)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.40)

Add feedback