Goto

Collaborating Authors

 Personal


A Planning-Based Explainable Collaborative Dialogue System

arXiv.org Artificial Intelligence

Eva is a multimodal conversational system that helps users to accomplish their domain goals through collaborative dialogue. The system does this by inferring users' intentions and plans to achieve those goals, detects whether obstacles are present, finds plans to overcome them or to achieve higher-level goals, and plans its actions, including speech acts,to help users accomplish those goals. In doing so, the system maintains and reasons with its own beliefs, goals and intentions, and explicitly reasons about those of its user. Belief reasoning is accomplished with a modal Horn-clause meta-interpreter. The planning and reasoning subsystems obey the principles of persistent goals and intentions, including the formation and decomposition of intentions to perform complex actions, as well as the conditions under which they can be given up. In virtue of its planning process, the system treats its speech acts just like its other actions -- physical acts affect physical states, digital acts affect digital states, and speech acts affect mental and social states. This general approach enables Eva to plan a variety of speech acts including requests, informs, questions, confirmations, recommendations, offers, acceptances, greetings, and emotive expressions. Each of these has a formally specified semantics which is used during the planning and reasoning processes. Because it can keep track of different users' mental states, it can engage in multi-party dialogues. Importantly, Eva can explain its utterances because it has created a plan standing behind each of them. Finally, Eva employs multimodal input and output, driving an avatar that can perceive and employ facial and head movements along with emotive speech acts.


Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT

arXiv.org Artificial Intelligence

Recently, ChatGPT has attracted great attention, as it can generate fluent and high-quality responses to human inquiries. Several prior studies have shown that ChatGPT attains remarkable generation ability compared with existing models. However, the quantitative analysis of ChatGPT's understanding ability has been given little attention. In this report, we explore the understanding ability of ChatGPT by evaluating it on the most popular GLUE benchmark, and comparing it with 4 representative fine-tuned BERT-style models. We find that: 1) ChatGPT falls short in handling paraphrase and similarity tasks; 2) ChatGPT outperforms all BERT models on inference tasks by a large margin; 3) ChatGPT achieves comparable performance compared with BERT on sentiment analysis and question-answering tasks. Additionally, by combining some advanced prompting strategies, we show that the understanding ability of ChatGPT can be further improved.


Do we need a National Algorithms Safety Board?

#artificialintelligence

In the United States, the National Transportation Safety Board is widely respected for its prompt responses to investigate plane, train, and boat accidents. Its independent reports have done much to promote safety in civil aviation and beyond. Could a National Algorithms Safety Board have a similar impact in increasing safety for algorithmic systems, especially the rapidly proliferating Artificial Intelligence applications based on unpredictable machine learning? Alternatively, could agencies such as the Food & Drug Administration (FDA), Securities and Exchange Commission (SEC), or Federal Communications Commission (FCC) take on the task of increasing safety of algorithmic systems? In addition to federal agencies, could the major accounting firms provide algorithmic audits as they do in auditing financial statements of publicly listed companies?


A historic Relic (Sci-fi):. Sam: I laugh at the stupid Prophecizers…

#artificialintelligence

Sam: I laugh at the stupid Prophecizers who are so certain of technological Singularity, for ex: Kurzweil. He is a fraud or worst, an inductivist idiot. Just plot a line of past progresses against time, and extend that "exponential" line to the future. And voila you have got Artificial General Intelligence and the fountain of youth. Chris: So you think AGI is a myth that will never happen?


Top Challenges for AI in Finance in 2023

#artificialintelligence

But what are the challenges for AI in Finance this year? We caught up with experts from JP Morgan & Chase, UBS, University of Greenwich, Cornell University, and Fidelity Investments to find out about the top challenges that AI in Finance will face in 2023. These leading experts will be joining us at the AI in Finance Summit New York on April 19-20, 2023, and AI in Finance Summit London on April 25-26, 2023, where they will be discussing the challenges of AI in Finance in more detail and how to overcome them. Early Bird ticket sale for AI in Finance Summit New York ends on Friday, February 24, so secure your place today to save $500. Early Bird ticket sale for AI in Finance Summit London ends on Friday, March 3, so secure your place today to save £500.


AIhub monthly digest: February 2023 – attending AAAI, awards galore, and GPT-3 for 5-minute crafts

AIHub

In a special award session, the best papers of the conference were announced. The AAAI-2023 outstanding paper award went to Joar Skalse and Alessandro Abate for their work Misspecification in Inverse Reinforcement Learning. The AAAI-2023 outstanding student paper award was given to Decorate the Newcomers: Visual Domain Prompt for Continual Test Time Adaptation, authored by Yulu Gan, Yan Bai, Yihang Lou, Xianzheng Ma, Renrui Zhang, Nian Shi, and Lin Luo. There were also 12 distinguished paper award winners, the details of which can be found here. As well as these best paper awards, a number of prestigious AAAI awards were presented at the conference. These included the AAAI Award for Artificial Intelligence for the Benefit of Humanity, which was won by Tuomas Sandholm. You can find out more about this prize, and the others awarded, here. There will be plenty more content to come as we continue to cover the conference, and hear from participants about their work. You can find our conference coverage here, and this collection will be updated as soon as we add new content.


Meet the first-ever artificial intelligence editor at the Financial Times

#artificialintelligence

As some newsroom roles go the way of the dinosaurs, brand new jobs are being born. This interview is part of an occasional series of Q&As with people who are the first to hold their title in their newsroom. Madhumita Murgia describes herself as an accidental tech journalist. As a biology student, Murgia studied non-human intelligence in a gray parrot named Alex before she ever focused on intelligence of the artificial variety. Now, as the Financial Times' first-ever artificial intelligence editor, Murgia has been tasked with leading coverage on the rapidly evolving field and providing advice and expertise to other FT reporters as they "increasingly encounter stories about how AI is upending industries around the world."


Exploring Challenges and Opportunities to Support Designers in Learning to Co-create with AI-based Manufacturing Design Tools

arXiv.org Artificial Intelligence

AI-based design tools are proliferating in professional software to assist engineering and industrial designers in complex manufacturing and design tasks. These tools take on more agentic roles than traditional computer-aided design tools and are often portrayed as "co-creators." Yet, working effectively with such systems requires different skills than working with complex CAD tools alone. To date, we know little about how engineering designers learn to work with AI-based design tools. In this study, we observed trained designers as they learned to work with two AI-based tools on a realistic design task. We find that designers face many challenges in learning to effectively co-create with current systems, including challenges in understanding and adjusting AI outputs and in communicating their design goals. Based on our findings, we highlight several design opportunities to better support designer-AI co-creation.


GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation

arXiv.org Artificial Intelligence

We present GLM-Dialog, a large-scale language model (LLM) with 10B parameters capable of knowledge-grounded conversation in Chinese using a search engine to access the Internet knowledge. GLM-Dialog offers a series of applicable techniques for exploiting various external knowledge including both helpful and noisy knowledge, enabling the creation of robust knowledge-grounded dialogue LLMs with limited proper datasets. To evaluate the GLM-Dialog more fairly, we also propose a novel evaluation method to allow humans to converse with multiple deployed bots simultaneously and compare their performance implicitly instead of explicitly rating using multidimensional metrics.Comprehensive evaluations from automatic to human perspective demonstrate the advantages of GLM-Dialog comparing with existing open source Chinese dialogue models. We release both the model checkpoint and source code, and also deploy it as a WeChat application to interact with users. We offer our evaluation platform online in an effort to prompt the development of open source models and reliable dialogue evaluation systems. The additional easy-to-use toolkit that consists of short text entity linking, query generation, and helpful knowledge classification is also released to enable diverse applications. All the source code is available on Github.


Policy Dispersion in Non-Markovian Environment

arXiv.org Artificial Intelligence

Markov Decision Process (MDP) presents a mathematical framework to formulate the learning processes of agents in reinforcement learning. MDP is limited by the Markovian assumption that a reward only depends on the immediate state and action. However, a reward sometimes depends on the history of states and actions, which may result in the decision process in a non-Markovian environment. In such environments, agents receive rewards via temporally-extended behaviors sparsely, and the learned policies may be similar. This leads the agents acquired with similar policies generally overfit to the given task and can not quickly adapt to perturbations of environments. To resolve this problem, this paper tries to learn the diverse policies from the history of state-action pairs under a non-Markovian environment, in which a policy dispersion scheme is designed for seeking diverse policy representation. Specifically, we first adopt a transformer-based method to learn policy embeddings. Then, we stack the policy embeddings to construct a dispersion matrix to induce a set of diverse policies. Finally, we prove that if the dispersion matrix is positive definite, the dispersed embeddings can effectively enlarge the disagreements across policies, yielding a diverse expression for the original policy embedding distribution. Experimental results show that this dispersion scheme can obtain more expressive diverse policies, which then derive more robust performance than recent learning baselines under various learning environments.