AITopics | Mitchell, Tom

Plotting

Mitchell, Tom

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Conversational Neuro-Symbolic Commonsense Reasoning

Arabshahi, Forough, Lee, Jennifer, Gawarecki, Mikayla, Mazaitis, Kathryn, Azaria, Amos, Mitchell, Tom

arXiv.org Artificial IntelligenceJun-19-2020

One aspect of human commonsense reasoning is the ability to make presumptions about daily experiences, activities and social interactions with others. We propose a new commonsense reasoning benchmark where the task is to uncover commonsense presumptions implied by imprecisely stated natural language commands in the form of if-then-because statements. For example, in the command "If it snows at night then wake me up early because I don't want to be late for work" the speaker relies on commonsense reasoning of the listener to infer the implicit presumption that it must snow enough to cause traffic slowdowns. Such if-then-because commands are particularly important when users instruct conversational agents. We release a benchmark data set for this task, collected from humans and annotated with commonsense presumptions. We develop a neuro-symbolic theorem prover that extracts multi-hop reasoning chains and apply it to this problem. We further develop an interactive conversational framework that evokes commonsense knowledge from humans for completing reasoning chains.

deep learning, neural network, presumption, (19 more...)

arXiv.org Artificial Intelligence

2006.10022

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Jelly Bean World: A Testbed for Never-Ending Learning

Platanios, Emmanouil Antonios, Saparov, Abulhair, Mitchell, Tom

arXiv.org Artificial IntelligenceFeb-14-2020

Machine learning has shown growing success in recent years. However, current machine learning systems are highly specialized, trained for particular problems or domains, and typically on a single narrow dataset. Human learning, on the other hand, is highly general and adaptable. Never-ending learning is a machine learning paradigm that aims to bridge this gap, with the goal of encouraging researchers to design machine learning systems that can learn to perform a wider variety of inter-related tasks in more complex environments. To date, there is no environment or testbed to facilitate the development and evaluation of never-ending learning systems. To this end, we propose the Jelly Bean World testbed. The Jelly Bean World allows experimentation over two-dimensional grid worlds which are filled with items and in which agents can navigate. This testbed provides environments that are sufficiently complex and where more generally intelligent algorithms ought to perform better than current state-of-the-art reinforcement learning approaches. It does so by producing non-stationary environments and facilitating experimentation with multi-task, multi-agent, multi-modal, and curriculum learning settings. We hope that this new freely-available software will prompt new research and interest in the development and evaluation of never-ending learning systems and more broadly, general intelligence systems.

agent, computer game, deep learning, (21 more...)

arXiv.org Artificial Intelligence

2002.06306

Country: North America > United States > New York (0.28)

Genre: Research Report (0.50)

Industry:

Education (0.68)
Leisure & Entertainment > Games > Computer Games (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Learning Data Manipulation for Augmentation and Weighting

Hu, Zhiting, Tan, Bowen, Salakhutdinov, Ruslan, Mitchell, Tom, Xing, Eric P.

arXiv.org Machine LearningOct-28-2019

Manipulating data, such as weighting data examples or augmenting with new instances, has been increasingly used to improve model training. Previous work has studied various rule- or learning-based approaches designed for specific types of data manipulation. In this work, we propose a new method that supports learning different manipulation schemes with the same gradient-based algorithm. Our approach builds upon a recent connection of supervised learning and reinforcement learning (RL), and adapts an off-the-shelf reward learning algorithm from RL for joint data manipulation learning and model training. Different parameterization of the "data reward" function instantiates different manipulation schemes. We showcase data augmentation that learns a text transformation network, and data weighting that dynamically adapts the data sample importance. Experiments show the resulting algorithms significantly improve the image and text classification performance in low data regime and class-imbalance problems.

deep learning, manipulation, neural network, (20 more...)

arXiv.org Machine Learning

1910.12795

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.91)

Add feedback

Leveraging Knowledge Bases in LSTMs for Improving Machine Reading

Yang, Bishan, Mitchell, Tom

arXiv.org Artificial IntelligenceFeb-25-2019

This paper focuses on how to take advantage of external knowledge bases (KBs) to improve recurrent neural networks for machine reading. Traditional methods that exploit knowledge from KBs encode knowledge as discrete indicator features. Not only do these features generalize poorly, but they require task-specific feature engineering to achieve good performance. We propose KBLSTM, a novel neural model that leverages continuous representations of KBs to enhance the learning of recurrent neural networks for machine reading. To effectively integrate background knowledge with information from the currently processed text, our model employs an attention mechanism with a sentinel to adaptively decide whether to attend to background knowledge and which information from KBs is useful. Experimental results show that our model achieves accuracies that surpass the previous state-of-the-art results for both entity extraction and event extraction on the widely used ACE2005 dataset.

deep learning, knowledge, neural network, (19 more...)

arXiv.org Artificial Intelligence

1902.09091

Country: North America > United States (0.47)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Contextual Parameter Generation for Universal Neural Machine Translation

Platanios, Emmanouil Antonios, Sachan, Mrinmaya, Neubig, Graham, Mitchell, Tom

arXiv.org Machine LearningAug-25-2018

We propose a simple modification to existing neural machine translation (NMT) models that enables using a single universal model to translate between multiple languages while allowing for language specific parameterization, and that can also be used for domain adaptation. Our approach requires no changes to the model architecture of a standard NMT system, but instead introduces a new component, the contextual parameter generator (CPG), that generates the parameters of the system (e.g., weights in a neural network). This parameter generator accepts source and target language embeddings as input, and generates the parameters for the encoder and the decoder, respectively. The rest of the model remains unchanged and is shared across all languages. We show how this simple modification enables the system to use monolingual data for training and also perform zero-shot translation. We further show it is able to surpass state-of-the-art performance for both the IWSLT-15 and IWSLT-17 datasets and that the learned language embeddings are able to uncover interesting relationships between languages.

deep learning, neural network, translation, (18 more...)

arXiv.org Machine Learning

1808.08493

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Inferring Interpersonal Relations in Narrative Summaries

Srivastava, Shashank (Carnegie Mellon University) | Chaturvedi, Snigdha (University of Maryland, College Park) | Mitchell, Tom (Carnegie Mellon University)

AAAI ConferencesApr-19-2016

Characterizing relationships between people is fundamental for the understanding of narratives. In this work, we address the problem of inferring the polarity of relationships between people in narrative summaries. We formulate the problem as a joint structured prediction for each narrative, and present a general model that combines evidence from linguistic and semantic features, as well as features based on the structure of the social community in the text. We additionally provide a clustering-based approach that can exploit regularities in narrative types. e.g., learn an affinity for love-triangles in romantic stories. On a dataset of movie summaries from Wikipedia, our structured models provide more than 30% error-reduction over a competitive baseline that considers pairs of characters in isolation.

inductive learning, neural network, relation, (21 more...)

AAAI Conferences

Thirtieth AAAI Conference on Artificial Intelligence

Country:

North America > United States > Ohio (0.14)
North America > United States > Maryland (0.14)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

Combining Vector Space Embeddings with Symbolic Logical Inference over Open-Domain Text

Gardner, Matt (Carnegie Mellon University) | Talukdar, Partha (Indian Institute of Science) | Mitchell, Tom (Carnegie Mellon University)

AAAI ConferencesMar-16-2015

We have recently shown how to combine random walk inference over knowledge bases with vector space representations of surface forms, improving performance on knowledge base inference. In this paper, we formalize the connection of our prior work to logical inference rules, giving some general observations about methods for incorporating vector space representations into symbolic logic systems. Additionally, we present some promising preliminary work that extends these techniques to learning open-domain relations for the purpose of answering multiple choice questions, achieving 67% accuracy on a small test set.

combining vector space embedding, open-domain text, symbolic logical inference

AAAI Conferences

2015 AAAI Spring Symposium Series

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Data Science > Data Quality (0.80)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning > Representation Of Examples (0.80)

Add feedback

The 2005 AAAI Classic Paper Awards

Mitchell, Tom, Levesque, Hector

AI MagazineDec-15-2005

Mitchell and Levesque provide commentary on the two AAAI Classic Paper awards, given at the AAAI-05 conference in Pittsburgh, Pennsylvania. The two winning papers were "Quantifying the Inductive Bias in Concept Learning," by David Haussler, and "Default Reasoning, Nonmonotonic Logics, and the Frame Problem," by Steve Hanks and Drew McDermott.

artificial intelligence, belief revision, Classic Paper Award, (1 more...)

AI Magazine

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (0.92)

Add feedback

The 2005 AAAI Classic Paper Awards

Mitchell, Tom, Levesque, Hector

AI MagazineDec-15-2005

Twenty years later that link is firmly established, and the two research communities have largely merged into one. Problem," by Steve Hanks and Drew Mc-or does not hold after a sequence of learning was the "inductive bias" of a The idea is this: Normally, an object learn the target concept--the more in 1986, helped initiate a very constraining the inductive bias, the is unaffected by an action. If a window fruitful integration of a branch of machine less training data needed. Starting in the of PAC learning was being developed, an action. There are clear exceptions, 1950s, with work like Samuels's program which allowed deriving quantitative however, such as the act of closing the that learned strategies for playing bounds on the probability of window. A variety of formal systems checkers, AI researchers had designed successful learning as a function of the have been proposed that would allow and experimented with a number of training examples and the us to infer in the absence of conflicting variety of learning algorithms and complexity of the learner's hypothesis information that the window remains had also developed a number of theoretical space (as measured by its Vapnik-open (or that a polar bear is results, such as convergence Chervonenkis dimension). What white or that a violin has four strings, proofs for perceptrons and "learning Haussler's paper did was help introduce and so on).

artificial intelligence, default reasoning, inductive learning, (16 more...)

AI Magazine

Country: North America > United States > Pennsylvania (0.16)

Genre: Personal > Honors (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.39)

Add feedback

In Memoriam: Charles Rosen, Norman Nielsen, and Saul Amarel

Hart, Peter E., Nilsson, Nils J., Perrault, Ray, Mitchell, Tom, Kulikowski, Casimir A., Leake, David B.

AI MagazineMar-15-2003

In the span of a few months, the AI community lost four important figures. The fall of 2002 marked the passing of Ray Reiter, for whom a memorial article by Jack Minker appears in this issue. As the issue was going to press, AI lost Saul Amarel, Norm Nielsen, and Charles Rosen. This section of AI Magazine commemorates these friends, leaders, and AI pioneers. We thank Tom Mitchell and Casimir Kulikowski for their memorial to Saul Amarel, Ray Perrault for his remembrance of Norm Nielsen, and Peter Hart and Nils Nilsson for their tribute to Charles Rosen. The AI community mourns our lost colleagues and gratefully remembers their contributions, which meant so much to so many and to the advancement of artificial intelligence as a whole.

amarel, neural network, us government, (21 more...)

AI Magazine

Country:

North America > United States (1.00)
Europe (0.70)
Asia > Middle East > Israel (0.28)
(2 more...)

Genre: Personal > Obituary (1.00)

Industry:

Health & Medicine (1.00)
Government > Military (0.69)
Government > Regional Government > North America Government > United States Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.94)
Information Technology > Artificial Intelligence > Robots (0.72)
(2 more...)

Add feedback