AITopics | Young, Steve

This paper presents two ways of dealing with scarce data in semantic decoding using N-Best speech recognition hypotheses. First, we learn features by using a deep learning architecture in which the weights for the unknown and known categories are jointly optimised. Second, an unsupervised method is used for further tuning the weights. Sharing weights injects prior knowledge to unknown categories. The unsupervised tuning (i.e. the risk minimisation) improves the F-Measure when recognising nearly zero-shot data on the DSTC3 corpus. This unsupervised method can be applied subject to two assumptions: the rank of the class marginal is assumed to be known and the class-conditional scores of the classifier are assumed to follow a Gaussian distribution.

deep learning, neural network, risk minimisation, (18 more...)

arXiv.org Artificial Intelligence

1806.05484

Country:

North America > United States (0.47)
Europe (0.29)
Asia (0.28)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Casanueva, Iñigo, Budzianowski, Paweł, Su, Pei-Hao, Mrkšić, Nikola, Wen, Tsung-Hsien, Ultes, Stefan, Rojas-Barahona, Lina, Young, Steve, Gašić, Milica

arXiv.org Machine LearningNov-29-2017

Dialogue assistants are rapidly becoming an indispensable daily aid. To avoid the significant effort needed to hand-craft the required dialogue flow, the Dialogue Management (DM) module can be cast as a continuous Markov Decision Process (MDP) and trained through Reinforcement Learning (RL). Several RL models have been investigated over recent years. However, the lack of a common benchmarking framework makes it difficult to perform a fair comparison between different models and their capability to generalise to different environments. Therefore, this paper proposes a set of challenging simulated environments for dialogue model development and evaluation. To provide some baselines, we investigate a number of representative parametric algorithms, namely deep reinforcement learning algorithms - DQN, A2C and Natural Actor-Critic and compare them to a non-parametric model, GP-SARSA. Both the environments and policy models are implemented using the publicly available PyDial toolkit and released on-line, in order to establish a testbed framework for further experiments and to facilitate experimental reproducibility.

algorithm, deep learning, neural network, (19 more...)

arXiv.org Machine Learning

1711.11023

Country: Europe > Germany (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning

Ultes, Stefan, Budzianowski, Paweł, Casanueva, Iñigo, Mrkšić, Nikola, Rojas-Barahona, Lina, Su, Pei-Hao, Wen, Tsung-Hsien, Gašić, Milica, Young, Steve

arXiv.org Machine LearningJul-19-2017

Reinforcement learning is widely used for dialogue policy optimization where the reward function often consists of more than one component, e.g., the dialogue success and the dialogue length. In this work, we propose a structured method for finding a good balance between these components by searching for the optimal reward component weighting. To render this search feasible, we use multi-objective reinforcement learning to significantly reduce the number of training dialogues required. We apply our proposed method to find optimized component weights for six domains and compare them to a default baseline.

artificial intelligence, dialogue, natural language, (16 more...)

arXiv.org Machine Learning

1707.06299

Country:

North America > United States (0.28)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Genre: Research Report (0.83)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Latent Intention Dialogue Models

Wen, Tsung-Hsien, Miao, Yishu, Blunsom, Phil, Young, Steve

arXiv.org Machine LearningMay-29-2017

Developing a dialogue agent that is capable of making autonomous decisions and communicating by natural language is one of the long-term goals of machine learning research. Traditional approaches either rely on hand-crafting a small state-action set for applying reinforcement learning that is not scalable or constructing deterministic models for learning dialogue sentences that fail to capture natural conversational variability. In this paper, we propose a Latent Intention Dialogue Model (LIDM) that employs a discrete latent variable to learn underlying dialogue intentions in the framework of neural variational inference. In a goal-oriented dialogue scenario, these latent intentions can be interpreted as actions guiding the generation of machine responses, which can be further refined autonomously by reinforcement learning. The experimental evaluation of LIDM shows that the model out-performs published benchmarks for both corpus-based and human evaluation, demonstrating the effectiveness of discrete latent variable models for learning goal-oriented dialogues.

deep learning, intention, neural network, (14 more...)

arXiv.org Machine Learning

1705.10229

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Network-based End-to-End Trainable Task-oriented Dialogue System

Wen, Tsung-Hsien, Vandyke, David, Mrksic, Nikola, Gasic, Milica, Rojas-Barahona, Lina M., Su, Pei-Hao, Ultes, Stefan, Young, Steve

arXiv.org Artificial IntelligenceApr-24-2017

Teaching machines to accomplish tasks by conversing naturally with humans is challenging. Currently, developing task-oriented dialogue systems requires creating multiple components and typically this involves either a large amount of handcrafting, or acquiring costly labelled datasets to solve a statistical learning problem for each component. In this work we introduce a neural network-based text-in, text-out end-to-end trainable goal-oriented dialogue system along with a new way of collecting dialogue data based on a novel pipe-lined Wizard-of-Oz framework. This approach allows us to develop dialogue systems easily and without making too many assumptions about the task at hand. The results show that the model can converse with human subjects naturally whilst helping them to accomplish tasks in a restaurant search domain.

deep learning, dialogue, neural network, (20 more...)

arXiv.org Artificial Intelligence

1604.04562

Country:

Europe (1.00)
North America > United States > California (0.14)
Asia > Middle East > Qatar (0.14)
North America > United States > Maryland (0.14)

Industry: Consumer Products & Services > Restaurants (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Neural Belief Tracker: Data-Driven Dialogue State Tracking

Mrkšić, Nikola, Séaghdha, Diarmuid Ó, Wen, Tsung-Hsien, Thomson, Blaise, Young, Steve

arXiv.org Artificial IntelligenceApr-21-2017

One of the core components of modern spoken dialogue systems is the belief tracker, which estimates the user's goal at every step of the dialogue. However, most current approaches have difficulty scaling to larger, more complex dialogue domains. This is due to their dependency on either: a) Spoken Language Understanding models that require large amounts of annotated training data; or b) hand-crafted lexicons for capturing some of the linguistic variation in users' language. We propose a novel Neural Belief Tracking (NBT) framework which overcomes these problems by building on recent advances in representation learning. NBT models reason over pre-trained word vectors, learning to compose them into distributed representations of user utterances and dialogue context. Our evaluation on two datasets shows that this approach surpasses past limitations, matching the performance of state-of-the-art models which rely on hand-crafted semantic lexicons and outperforming them when such lexicons are not provided.

Add feedback

Conditional Generation and Snapshot Learning in Neural Dialogue Systems

Wen, Tsung-Hsien, Gasic, Milica, Mrksic, Nikola, Rojas-Barahona, Lina M., Su, Pei-Hao, Ultes, Stefan, Vandyke, David, Young, Steve

arXiv.org Machine LearningJun-10-2016

Recently a variety of LSTM-based conditional language models (LM) have been applied across a range of language generation tasks. In this work we study various model architectures and different ways to represent and aggregate the source information in an end-to-end neural dialogue system framework. A method called snapshot learning is also proposed to facilitate learning from supervised sequential signals by applying a companion cross-entropy objective function to the conditioning vector. The experimental and analytical results demonstrate firstly that competition occurs between the conditioning vector and the LM, and the differing architectures provide different trade-offs between the two. Secondly, the discriminative power and transparency of the conditioning vector is key to providing both model interpretability and better performance. Thirdly, snapshot learning leads to consistent performance improvements independent of which architecture is used.

conditioning vector, deep learning, neural network, (18 more...)

arXiv.org Machine Learning

1606.03352

Country: Europe > United Kingdom (0.14)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Statistical Modeling in Continuous Speech Recognition (CSR)(Invited Talk)

Young, Steve

arXiv.org Artificial IntelligenceJan-10-2013

Automatic continuous speech recognition (CSR) is sufficiently mature that a variety of real world applications are now possible including large vocabulary transcription and interactive spoken dialogues. This paper reviews the evolution of the statistical modelling techniques which underlie current-day systems, specifically hidden Markov models (HMMs) and N-grams. Starting from a description of the speech signal and its parameterisation, the various modelling assumptions and their consequences are discussed. It then describes various techniques by which the effects of these assumptions can be mitigated. Despite the progress that has been made, the limitations of current modelling techniques are still evident. The paper therefore concludes with a brief review of some of the more fundamental modelling work now in progress.

artificial intelligence, natural language, speech recognition, (17 more...)

arXiv.org Artificial Intelligence

1301.2318

Country:

North America > United States (1.00)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Reports on the Twenty-First National Conference on Artificial Intelligence (AAAI-06) Workshop Program

Achtner, Wolfgang, Aimeur, Esma, Anand, Sarabjot Singh, Appelt, Doug, Ashish, Naveen, Barnes, Tiffany, Beck, Joseph E., Dias, M. Bernardine, Doshi, Prashant, Drummond, Chris, Elazmeh, William, Felner, Ariel, Freitag, Dayne, Geffner, Hector, Geib, Christopher W., Goodwin, Richard, Holte, Robert C., Hutter, Frank, Isaac, Fair, Japkowicz, Nathalie, Kaminka, Gal A., Koenig, Sven, Lagoudakis, Michail G., Leake, David B., Lewis, Lundy, Liu, Hugo, Metzler, Ted, Mihalcea, Rada, Mobasher, Bamshad, Poupart, Pascal, Pynadath, David V., Roth-Berghofer, Thomas, Ruml, Wheeler, Schulz, Stefan, Schwarz, Sven, Seneff, Stephanie, Sheth, Amit, Sun, Ron, Thielscher, Michael, Upal, Afzal, Williams, Jason, Young, Steve, Zelenko, Dmitry

AI MagazineDec-15-2006

The Workshop program of the Twenty-First Conference on Artificial Intelligence was held July 16-17, 2006 in Boston, Massachusetts. The program was chaired by Joyce Chai and Keith Decker. The titles of the 17 workshops were AIDriven Technologies for Service-Oriented Computing; Auction Mechanisms for Robot Coordination; Cognitive Modeling and Agent-Based Social Simulations, Cognitive Robotics; Computational Aesthetics: Artificial Intelligence Approaches to Beauty and Happiness; Educational Data Mining; Evaluation Methods for Machine Learning; Event Extraction and Synthesis; Heuristic Search, Memory- Based Heuristics, and Their Applications; Human Implications of Human-Robot Interaction; Intelligent Techniques in Web Personalization; Learning for Search; Modeling and Retrieval of Context; Modeling Others from Observations; and Statistical and Empirical Approaches for Spoken Dialogue Systems.

artificial intelligence, management and information, natural language, (3 more...)

AI Magazine

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.70)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.70)

Add feedback

Reports on the Twenty-First National Conference on Artificial Intelligence (AAAI-06) Workshop Program

Achtner, Wolfgang, Aimeur, Esma, Anand, Sarabjot Singh, Appelt, Doug, Ashish, Naveen, Barnes, Tiffany, Beck, Joseph E., Dias, M. Bernardine, Doshi, Prashant, Drummond, Chris, Elazmeh, William, Felner, Ariel, Freitag, Dayne, Geffner, Hector, Geib, Christopher W., Goodwin, Richard, Holte, Robert C., Hutter, Frank, Isaac, Fair, Japkowicz, Nathalie, Kaminka, Gal A., Koenig, Sven, Lagoudakis, Michail G., Leake, David B., Lewis, Lundy, Liu, Hugo, Metzler, Ted, Mihalcea, Rada, Mobasher, Bamshad, Poupart, Pascal, Pynadath, David V., Roth-Berghofer, Thomas, Ruml, Wheeler, Schulz, Stefan, Schwarz, Sven, Seneff, Stephanie, Sheth, Amit, Sun, Ron, Thielscher, Michael, Upal, Afzal, Williams, Jason, Young, Steve, Zelenko, Dmitry

AI MagazineDec-15-2006

The Workshop program of the Twenty-First Conference on Artificial Intelligence was held July 16-17, 2006 in Boston, Massachusetts. The program was chaired by Joyce Chai and Keith Decker. The titles of the 17 workshops were AIDriven Technologies for Service-Oriented Computing; Auction Mechanisms for Robot Coordination; Cognitive Modeling and Agent-Based Social Simulations, Cognitive Robotics; Computational Aesthetics: Artificial Intelligence Approaches to Beauty and Happiness; Educational Data Mining; Evaluation Methods for Machine Learning; Event Extraction and Synthesis; Heuristic Search, Memory- Based Heuristics, and Their Applications; Human Implications of Human-Robot Interaction; Intelligent Techniques in Web Personalization; Learning for Search; Modeling and Retrieval of Context; Modeling Others from Observations; and Statistical and Empirical Approaches for Spoken Dialogue Systems.

artificial intelligence, neural network, workshop, (18 more...)

AI Magazine

Country:

Europe (1.00)
North America > United States > California (0.46)
North America > United States > Massachusetts > Suffolk County > Boston (0.24)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Leisure & Entertainment > Games (0.46)
Education > Educational Setting (0.46)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

Filters

Collaborating Authors

Young, Steve

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Nearly Zero-Shot Learning for Semantic Decoding in Spoken Dialogue Systems

A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management

Reward-Balancing for Statistical Spoken Dialogue Systems using Multi-objective Reinforcement Learning

Latent Intention Dialogue Models

A Network-based End-to-End Trainable Task-oriented Dialogue System

Neural Belief Tracker: Data-Driven Dialogue State Tracking

Conditional Generation and Snapshot Learning in Neural Dialogue Systems

Statistical Modeling in Continuous Speech Recognition (CSR)(Invited Talk)

Reports on the Twenty-First National Conference on Artificial Intelligence (AAAI-06) Workshop Program

Reports on the Twenty-First National Conference on Artificial Intelligence (AAAI-06) Workshop Program