AITopics | user manual

Collaborating Authors

user manual

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Robot Operation of Home Appliances by Reading User Manuals

Zhang, Jian, Zhang, Hanbo, Xiao, Anxing, Hsu, David

arXiv.org Artificial IntelligenceJul-24-2025

Operating home appliances, among the most common tools in every household, is a critical capability for assistive home robots. This paper presents ApBot, a robot system that operates novel household appliances by "reading" their user manuals. ApBot faces multiple challenges: (i) infer goal-conditioned partial policies from their unstructured, textual descriptions in a user manual document, (ii) ground the policies to the appliance in the physical world, and (iii) execute the policies reliably over potentially many steps, despite compounding errors. To tackle these challenges, ApBot constructs a structured, symbolic model of an appliance from its manual, with the help of a large vision-language model (VLM). It grounds the symbolic actions visually to control panel elements. Finally, ApBot closes the loop by updating the model based on visual feedback. Our experiments show that across a wide range of simulated and real-world appliances, ApBot achieves consistent and statistically significant improvements in task success rate, compared with state-of-the-art large VLMs used directly as control policies. These results suggest that a structured internal representations plays an important role in robust robot operation of home appliances, especially, complex ones.

artificial intelligence, large language model, natural language, (16 more...)

arXiv.org Artificial Intelligence

2505.20424

Genre:

Workflow (1.00)
Research Report > New Finding (0.87)

Industry: Appliances & Durable Goods (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.69)
Information Technology > Artificial Intelligence > Robots > Robots in the Home (0.65)

Add feedback

Bridging Literature and the Universe Via A Multi-Agent Large Language Model System

Zhang, Xiaowen, Bi, Zhenyu, Lachance, Patrick, Wang, Xuan, Di Matteo, Tiziana, Croft, Rupert A. C.

arXiv.org Artificial IntelligenceJul-21-2025

As cosmological simulations and their associated software become increasingly complex, physicists face the challenge of searching through vast amounts of literature and user manuals to extract simulation parameters from dense academic papers, each using different models and formats. Translating these parameters into executable scripts remains a time-consuming and error-prone process. To improve efficiency in physics research and accelerate the cosmological simulation process, we introduce SimAgents, a multi-agent system designed to automate both parameter configuration from the literature and preliminary analysis for cosmology research. SimAgents is powered by specialized LLM agents capable of physics reasoning, simulation software validation, and tool execution. These agents collaborate through structured communication, ensuring that extracted parameters are physically meaningful, internally consistent, and software-compliant. We also construct a cosmological parameter extraction evaluation dataset by collecting over 40 simulations in published papers from Arxiv and leading journals that cover diverse simulation types. Experiments on the dataset demonstrate a strong performance of SimAgents, highlighting its effectiveness and potential to accelerate scientific research for physicists. Our demonstration video is available at: https://youtu.be/w1zLpm_CaWA. The complete system and dataset are publicly available at https://github.com/xwzhang98/SimAgents.

large language model, machine learning, simulation, (19 more...)

arXiv.org Artificial Intelligence

2507.08958

Country: North America > United States (0.94)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

Bridging the Gap with Retrieval-Augmented Generation: Making Prosthetic Device User Manuals Available in Marginalised Languages

Ogbonna, Ikechukwu, Davidson, Lesley, Banerjee, Soumya, Dasgupta, Abhishek, Kenney, Laurence, Nagaraja, Vikranth Harthikote

arXiv.org Artificial IntelligenceJul-1-2025

Millions of people in African countries face barriers to accessing healthcare due to language and literacy gaps. This research tackles this challenge by transforming complex medical documents -- in this case, prosthetic device user manuals -- into accessible formats for underserved populations. This case study in cross-cultural translation is particularly pertinent/relevant for communities that receive donated prosthetic devices but may not receive the accompanying user documentation. Or, if available online, may only be available in formats (e.g., language and readability) that are inaccessible to local populations (e.g., English-language, high resource settings/cultural context). The approach is demonstrated using the widely spoken Pidgin dialect, but our open-source framework has been designed to enable rapid and easy extension to other languages/dialects. This work presents an AI-powered framework designed to process and translate complex medical documents, e.g., user manuals for prosthetic devices, into marginalised languages. The system enables users -- such as healthcare workers or patients -- to upload English-language medical equipment manuals, pose questions in their native language, and receive accurate, localised answers in real time. Technically, the system integrates a Retrieval-Augmented Generation (RAG) pipeline for processing and semantic understanding of the uploaded manuals. It then employs advanced Natural Language Processing (NLP) models for generative question-answering and multilingual translation. Beyond simple translation, it ensures accessibility to device instructions, treatment protocols, and safety information, empowering patients and clinicians to make informed healthcare decisions.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2506.23958

Country:

Africa (0.93)
Europe > United Kingdom > England (0.14)

Genre: Research Report (0.70)

Industry: Health & Medicine > Health Care Technology (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Winning Big with Small Models: Knowledge Distillation vs. Self-Training for Reducing Hallucination in QA Agents

Lewis, Ashley, White, Michael, Liu, Jing, Koike-Akino, Toshiaki, Parsons, Kieran, Wang, Ye

arXiv.org Artificial IntelligenceFeb-26-2025

The deployment of Large Language Models (LLMs) in customer support is constrained by hallucination-generating false information-and the high cost of proprietary models. To address these challenges, we propose a retrieval-augmented question-answering (QA) pipeline and explore how to balance human input and automation. Using a dataset of questions about a Samsung Smart TV user manual, we demonstrate that synthetic data generated by LLMs outperforms crowdsourced data in reducing hallucination in finetuned models. We also compare self-training (fine-tuning models on their own outputs) and knowledge distillation (fine-tuning on stronger models' outputs, e.g., GPT-4o), and find that self-training achieves comparable hallucination reduction. We conjecture that this surprising finding can be attributed to increased exposure bias issues in the knowledge distillation case and support this conjecture with post hoc analysis. We also improve robustness to unanswerable questions and retrieval failures with contextualized "I don't know" responses. These findings show that scalable, cost-efficient QA systems can be built using synthetic data and self-training with open-source models, reducing reliance on proprietary tools or costly human annotations.

dataset, hallucination, information, (13 more...)

arXiv.org Artificial Intelligence

2502.19545

Country:

Asia > Thailand > Bangkok > Bangkok (0.04)
North America > United States > Ohio (0.04)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
(9 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Semiconductors & Electronics (0.90)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

CARE: A Clue-guided Assistant for CSRs to Read User Manuals

Du, Weihong, Liu, Jia, Wen, Zujie, Jin, Dingnan, Liang, Hongru, Lei, Wenqiang

arXiv.org Artificial IntelligenceAug-26-2024

It is time-saving to build a reading assistant for customer service representations (CSRs) when reading user manuals, especially information-rich ones. Current solutions don't fit the online custom service scenarios well due to the lack of attention to user questions and possible responses. Hence, we propose to develop a time-saving and careful reading assistant for CSRs, named CARE. It can help the CSRs quickly find proper responses from the user manuals via explicit clue chains. Specifically, each of the clue chains is formed by inferring over the user manuals, starting from the question clue aligned with the user question and ending at a possible response. To overcome the shortage of supervised data, we adopt the self-supervised strategy for model learning. The offline experiment shows that CARE is efficient in automatically inferring accurate responses from the user manual. The online experiment further demonstrates the superiority of CARE to reduce CSRs' reading burden and keep high service quality, in particular with >35% decrease in time spent and keeping a >0.75 ICC score.

node, user manual, user question, (16 more...)

arXiv.org Artificial Intelligence

2408.03633

Country: Asia > China > Liaoning Province > Dalian (0.04)

Genre: Research Report (0.82)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Knowing-how & Knowing-that: A New Task for Machine Comprehension of User Manuals

Liang, Hongru, Liu, Jia, Du, Weihong, Jin, Dingnan, Lei, Wenqiang, Wen, Zujie, Lv, Jiancheng

arXiv.org Artificial IntelligenceAug-8-2023

The machine reading comprehension (MRC) of user manuals has huge potential in customer service. However, current methods have trouble answering complex questions. Therefore, we introduce the Knowing-how & Knowing-that task that requires the model to answer factoid-style, procedure-style, and inconsistent questions about user manuals. We resolve this task by jointly representing the steps and facts in a graph TARA, which supports a unified inference of various questions. Towards a systematical benchmarking study, we design a heuristic method to automatically parse user manuals into TARAs and build an annotated dataset to test the model's ability in answering real-world questions. Empirical results demonstrate that representing user manuals as TARAs is a desired solution for the MRC of user manuals. An in-depth investigation of TARA further sheds light on the issues and broader impacts of future representations of user manuals. We hope our work can move the MRC of user manuals to a more complex and realistic stage.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2023.findings-acl.671

2306.04187

Country:

Asia > China (0.04)
North America > Dominican Republic (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)

Genre: Research Report (1.00)

Industry:

Education (0.48)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

What China's Algorithm Registry Reveals about AI Governance - Carnegie Endowment for International Peace

#artificialintelligenceDec-11-2022, 06:10:14 GMT

For the past year, the Chinese government has been conducting some of the earliest experiments in building regulatory tools to govern artificial intelligence (AI). In that process, China is trying to tackle a problem that will soon face governments around the world: Can regulators gain meaningful insight into the functioning of algorithms, and ensure they perform within acceptable bounds? One particular tool deserves attention both for its impact within China, and for the lessons technologists and policymakers in other countries can draw from it: a mandatory registration system created by China's internet regulator for recommendation algorithms. Although the full details of the registry are not public, by digging into its online instruction manual, we can reveal new insights into China's emerging regulatory architecture for algorithms. The algorithm registry was created by China's 2022 regulation on recommendation algorithms (English translation), which came into effect in March of this year and was led by the Cyberspace Administration of China (CAC).

algorithm, china, registry, (16 more...)

#artificialintelligence

Country:

Asia > China (1.00)
North America > Canada > Ontario > Toronto (0.15)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.69)
Government > Regional Government > Asia Government > China Government (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

Snap's first drone, Pixy, fully revealed in FCC photos

#artificialintelligenceApr-28-2022, 21:17:24 GMT

Update April 28th, 1:09PM ET: Snap has officially revealed the Pixy, and we went hands-on with it. You can read our full article -- with video! Our original article about the FCC documents follows. It appears Snap is working on a drone called Pixy, and the whole thing just leaked with a huge amount of details, including photos and a seemingly unfinished user manual, published by the FCC. It's small: rulers in the photos indicate the drone is roughly 130 millimeters wide and 120 millimeters tall, which translates to approximately 5.1 inches by 4.7 inches.

drone, fcc photo, pixy, (3 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.53)

Add feedback

Adapting machine translation models to new genres

#artificialintelligenceNov-8-2021, 15:40:42 GMT

Neural machine translation systems are often optimized to perform well for specific text genres or domains, such as newspaper articles, user manuals, or customer support chats. In industrial settings with hundreds of language pairs to serve, however, a single translation system per language pair, which performs well across different text domains, is more efficient to deploy and maintain. Additionally, service providers may not know in advance which domains customers will be interested in. At this year's Conference on Empirical Methods in Natural Language Processing (EMNLP), we are presenting a new approach to multidomain adaptation for neural translation models, or adapting an existing model to new domains while maintaining translation quality in the original domain. Our approach provides a better trade-off between performance on old and new tasks than its predecessors do.

news article, translation model, translation system, (16 more...)

#artificialintelligence

Genre: Research Report > New Finding (0.32)

Technology: Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

Add feedback

15 tech tips you won't find in a user manual

FOX NewsApr-24-2021, 12:36:14 GMT

Fox News Flash top headlines are here. Check out what's clicking on Foxnews.com. Most gadgets don't come with a user manual that spells out every single feature. We learn them by doing, when someone spills the beans, or asking, "How'd you do that?" For example, no one thinks to dive into a new router's settings.

computer, user manual, webcam, (15 more...)

FOX News

Industry: Media > Radio (0.48)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Communications > Mobile (0.71)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.33)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.33)

Add feedback