Mohamed, Abdel-rahman
Differentiable Greedy Networks
Powers, Thomas, Fakoor, Rasool, Shakeri, Siamak, Sethy, Abhinav, Kainth, Amanjit, Mohamed, Abdel-rahman, Sarikaya, Ruhi
Optimal selection of a subset of items from a given set is a hard problem that requires combinatorial optimization. In this paper, we propose a subset selection algorithm that is trainable with gradient based methods yet achieves near optimal performance via submodular optimization. We focus on the task of identifying a relevant set of sentences for claim verification in the context of the FEVER task. Conventional methods for this task look at sentences on their individual merit and thus do not optimize the informativeness of sentences as a set. We show that our proposed method which builds on the idea of unfolding a greedy algorithm into a computational graph allows both interpretability and gradient based training. The proposed differentiable greedy network (DGN) outperforms discrete optimization algorithms as well as other baseline methods in terms of precision and recall. In this paper, we develop a subset selection algorithm that is differentiable and discrete, which can be trained on supervised data and can model complex dependencies between elements in a straightforward and comprehensible way. This is of particular interest in natural language processing tasks such as fact extraction, fact verification, and question answering where the proposed optimization scheme can be used for evidence retrieval.
Direct optimization of F-measure for retrieval-based personal question answering
Fakoor, Rasool, Kainth, Amanjit, Shakeri, Siamak, Winestock, Christopher, Mohamed, Abdel-rahman, Sarikaya, Ruhi
DIRECT OPTIMIZA TION OF F-MEASURE FOR RETRIEV AL-BASED PERSONAL QUESTION ANSWERING Rasool Fakoorโ , Amanjit Kainth, Siamak Shakeri, Christopher Winestock, Abdel-rahman Mohamed, Ruhi Sarikaya Amazon ABSTRACT Recent advances in spoken language technologies and the introduction of many customer facing products, have given rise to a wide customer reliance on smart personal assistants for many of their daily tasks. In this paper, we present a system to reduce users' cognitive load by extending personal assistants with long-term personal memory where users can store and retrieve by voice, arbitrary pieces of information. The problem is framed as a neural retrieval based question answering system where answers are selected from previously stored user memories. We propose to directly optimize the end-to-end retrieval performance, measured by the F1-score, using reinforcement learning, leading to better performance on our experimental test set(s). Index Terms-- Question Answering, Spoken information retrieval, Reinforcement Learning, Personal Assistants 1. INTRODUCTION Recent advances in speech recognition [1, 2], speech enhancement [3, 4], natural language understanding [5, 6], question answering [7, 8, 9], and dialogue systems [10, 11] have fueled the current surge in research and development for smart personal assistants [12] like Alexa, Siri, Google assistant, and Cortana, with many use cases around shopping, music, etc. In this paper we present a system for providing personal assistants a long term personal memory that enable users to store anything they want to remember by voice, and then later ask questions about it. An example use case is shown in Table 1.
Mean Actor Critic
Asadi, Kavosh, Allen, Cameron, Roderick, Melrose, Mohamed, Abdel-rahman, Konidaris, George, Littman, Michael
We propose a new algorithm, Mean Actor-Critic (MAC), for discrete-action continuous-state reinforcement learning. MAC is a policy gradient algorithm that uses the agent's explicit representation of all action values to estimate the gradient of the policy, rather than using only the actions that were actually executed. This significantly reduces variance in the gradient updates and removes the need for a variance reduction baseline. We show empirical results on two control domains where MAC performs as well as or better than other policy gradient approaches, and on five Atari games, where MAC is competitive with state-of-the-art policy search algorithms.
Phone Recognition with the Mean-Covariance Restricted Boltzmann Machine
Dahl, George, Ranzato, Marc', aurelio, Mohamed, Abdel-rahman, Hinton, Geoffrey E.
Straightforward application of Deep Belief Nets (DBNs) to acoustic modeling produces a rich distributed representation of speech data that is useful for recognition and yields impressive results on the speaker-independent TIMIT phone recognition task. However, the first-layer Gaussian-Bernoulli Restricted Boltzmann Machine (GRBM) has an important limitation, shared with mixtures of diagonal-covariance Gaussians: GRBMs treat different components of the acoustic input vector as conditionally independent given the hidden state. The mean-covariance restricted Boltzmann machine (mcRBM), first introduced for modeling natural images, is a much more representationally efficient and powerful way of modeling the covariance structure of speech data. Every configuration of the precision units of the mcRBM specifies a different precision matrix for the conditional distribution over the acoustic space. In this work, we use the mcRBM to learn features of speech data that serve as input into a standard DBN. The mcRBM features combined with DBNs allow us to achieve a phone error rate of 20.5\%, which is superior to all published results on speaker-independent TIMIT to date.