AITopics | Grammars & Parsing

Collaborating Authors

Grammars & Parsing

News Overviews Instructional Materials AI-Alerts Classics

Measuring Compositionality in Representation Learning

arXiv.org Machine LearningFeb-19-2019

Many machine learning algorithms represent input data with vector embeddings or discrete codes. When inputs exhibit compositional structure (e.g. objects built from parts or procedures from subroutines), it is natural to ask whether this compositional structure is reflected in the the inputs' learned representations. While the assessment of compositionality in languages has received significant attention in linguistics and adjacent fields, the machine learning literature lacks general-purpose tools for producing graded measurements of compositional structure in more general (e.g. vector-valued) representation spaces. We describe a procedure for evaluating compositionality by measuring how well the true representation-producing model can be approximated by a model that explicitly composes a collection of inferred representational primitives. We use the procedure to provide formal and empirical characterizations of compositional structure in a variety of settings, exploring the relationship between compositionality and learning dynamics, human judgments, representational similarity, and generalization.

compositionality, proceedings, representation, (15 more...)

arXiv.org Machine Learning

1902.07181

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(3 more...)

Genre: Research Report (0.82)

Industry: Education > Curriculum > Subject-Specific Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
(2 more...)

Add feedback

Learning to Generalize from Sparse and Underspecified Rewards

Agarwal, Rishabh, Liang, Chen, Schuurmans, Dale, Norouzi, Mohammad

arXiv.org Machine LearningFeb-19-2019

We consider the problem of learning from sparse and underspecified rewards, where an agent receives a complex input, such as a natural language instruction, and needs to generate a complex response, such as an action sequence, while only receiving binary success-failure feedback. Such success-failure rewards are often underspecified: they do not distinguish between purposeful and accidental success. Generalization from underspecified rewards hinges on discounting spurious trajectories that attain accidental success, while learning from sparse feedback requires effective exploration. We address exploration by using a mode covering direction of KL divergence to collect a diverse set of successful trajectories, followed by a mode seeking KL divergence to train a robust policy. We propose Meta Reward Learning (MeRL) to construct an auxiliary reward function that provides more refined feedback for learning. The parameters of the auxiliary reward function are optimized with respect to the validation performance of a trained policy. The MeRL approach outperforms our alternative reward learning technique based on Bayesian Optimization, and achieves the state-of-the-art on weakly-supervised semantic parsing. It improves previous work by 1.2% and 2.4% on WikiTableQuestions and WikiSQL datasets respectively.

learning, reward function, trajectory, (16 more...)

arXiv.org Machine Learning

1902.07198

Country:

North America > Canada > Alberta (0.14)
North America > United States (0.04)
Asia > India (0.04)

Genre: Research Report (0.64)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
(2 more...)

Add feedback

Parsing the Shadow Docket

SlateFeb-16-2019, 15:45:18 GMT

Slate Plus members get extended, ad-free versions of our podcasts--and much more. Sign up today and try it free for two weeks. Copy this link and add it in your podcast app. For detailed instructions, see our Slate Plus podcasts page. Listen to Amicus via Apple Podcasts, Overcast, Spotify, Stitcher, or Google Podcasts.

amicus, parsing, shadow docket, (1 more...)

Slate

Country: North America > United States > Virginia (0.47)

Industry:

Education > Educational Setting > Higher Education (0.47)
Education > Curriculum > Subject-Specific Education (0.47)
Law > Government & the Courts (0.40)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.40)
Information Technology > Communications > Social Media (0.36)

Add feedback

Improving Semantic Parsing for Task Oriented Dialog

Einolghozati, Arash, Pasupat, Panupong, Gupta, Sonal, Shah, Rushin, Mohit, Mrinal, Lewis, Mike, Zettlemoyer, Luke

arXiv.org Artificial IntelligenceFeb-15-2019

Semantic parsing using hierarchical representations has recently been proposed for task oriented dialog with promising results [Gupta et al 2018]. In this paper, we present three different improvements to the model: contextualized embeddings, ensembling, and pairwise re-ranking based on a language model. We taxonomize the errors possible for the hierarchical representation, such as wrong top intent, missing spans or split spans, and show that the three approaches correct different kinds of errors. The best model combines the three techniques and gives 6.4% better exact match accuracy than the state-of-the-art, with an error reduction of 33%, resulting in a new state-of-the-art result on the Task Oriented Parsing (TOP) dataset.

computational linguistic, parser, representation, (13 more...)

arXiv.org Artificial Intelligence

1902.06

Country:

North America > United States > Texas > Bexar County > San Antonio (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
(3 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

Parsing of Audit Work Creates Opening for Technology Firms

WSJ.com: WSJD - TechnologyFeb-12-2019, 07:37:54 GMT

Dividing the work could pave the way for companies to automate elements of the audit process, allowing them to free up human resources to focus on improving controls and preventing fraud. "When clients decide to split a professional service, it paves the way for change in the competitive landscape, and that's what's happening in audit at the moment," said Fiona Czerniawska, co-founder of Source Global, which surveyed 150 executives in the U.S. and U.K who are involved in the selection of external auditors. "People are already starting to act on this." Fifty-nine percent of executives said technology firms would gather data faster and at a lower cost than external accounting and audit firms, the report said. Sixty-one percent said technology firms would do a better job of automating financial processes than these firms, according to the report.

artificial intelligence, auditor, natural language, (7 more...)

WSJ.com: WSJD - Technology

Country: North America > United States (0.27)

Industry: Information Technology (0.88)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.40)

Add feedback

LS-Tree: Model Interpretation When the Data Are Linguistic

Chen, Jianbo, Jordan, Michael I.

arXiv.org Machine LearningFeb-11-2019

We study the problem of interpreting trained classification models in the setting of linguistic data sets. Leveraging a parse tree, we propose to assign least-squares based importance scores to each word of an instance by exploiting syntactic constituency structure. We establish an axiomatic characterization of these importance scores by relating them to the Banzhaf value in coalitional game theory. Based on these importance scores, we develop a principled method for detecting and quantifying interactions between words in a sentence. We demonstrate that the proposed method can aid in interpretability and diagnostics for several widely-used language models.

interaction score, model interpretation, node, (14 more...)

arXiv.org Machine Learning

1902.04187

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Genre: Research Report (0.83)

Industry: Leisure & Entertainment (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.92)

Add feedback

Non-Monotonic Sequential Text Generation

Welleck, Sean, Brantley, Kianté, Daumé, Hal III, Cho, Kyunghyun

arXiv.org Machine LearningFeb-5-2019

Standard sequential generation methods assume a pre-specified generation order, such as text generation methods which generate words from left to right. In this work, we propose a framework for training models of text generation that operate in non-monotonic orders; the model directly learns good orders, without any additional annotation. Our framework operates by generating a word at an arbitrary position, and then recursively generating words to its left and then words to its right, yielding a binary tree. Learning is framed as imitation learning, including a coaching method which moves from imitating an oracle to reinforcing the policy's own preferences. Experimental results demonstrate that using the proposed method, it is possible to learn policies which generate text without pre-specifying a generation order, while achieving competitive performance with conventional left-to-right generation.

arxiv preprint arxiv, oracle, sequence, (12 more...)

arXiv.org Machine Learning

1902.02192

Country:

North America > Canada > Quebec > Montreal (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > New York (0.04)
(3 more...)

Genre: Research Report (0.70)

Industry:

Health & Medicine (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.93)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.71)

Add feedback

Introduction to StanfordNLP with Python Implementation

#artificialintelligenceFeb-4-2019, 01:43:50 GMT

A common challenge I came across while learning Natural Language Processing (NLP) – can we build models for non-English languages? The answer has been no for quite a long time. Each language has its own grammatical patterns and linguistic nuances. I could barely contain my excitement when I read the news last week. The authors claimed StanfordNLP could support more than 53 human languages!

artificial intelligence, natural language, stanfordnlp, (17 more...)

#artificialintelligence

Technology:

Information Technology > Software (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.48)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.31)

Add feedback

StanfordNLP

#artificialintelligenceFeb-2-2019, 08:56:18 GMT

StanfordNLP is the combination of the software package used by the Stanford team in the CoNLL 2018 Shared Task on Universal Dependency Parsing, and the group's official Python interface to the Stanford CoreNLP software. Aside from the functions it inherits from CoreNLP, it contains tools to convert a string of text to lists of sentences and words, generate base forms of those words, their parts of speech and morphological features, and a syntactic structure that is designed to be parallel among more than 70 languages. This package is built with highly accurate neural network components that enables efficient training and evaluation with your own annotated data. The modules are built on top of PyTorch. To see StanfordNLP's neural pipeline in action, you can launch the Python interactive interpreter, and try the following commands At the end, you should be able to see the dependency parse of the first sentence in the example.

artificial intelligence, natural language, stanfordnlp, (8 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Add feedback

The Latest: Trump Says He's 'Set the Stage' for Wall Action

U.S. NewsFeb-1-2019, 04:13:12 GMT

The DEA has reported that land ports of entry are the primary means for getting drugs into the country, not stretches of the border without barriers. The agency says the most common trafficking technique by transnational criminal organizations is to hide drugs in passenger vehicles or tractor-trailers.

artificial intelligence, natural language, wall action, (3 more...)

U.S. News

Industry:

Automobiles & Trucks (0.94)
Transportation > Passenger (0.44)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.40)

Add feedback