AITopics | Grammars & Parsing

Collaborating Authors

Grammars & Parsing

News Overviews Instructional Materials AI-Alerts Classics

Generative Grading: Neural Approximate Parsing for Automated Student Feedback

Malik, Ali, Wu, Mike, Vasavada, Vrinda, Song, Jinpeng, Mitchell, John, Goodman, Noah, Piech, Chris

arXiv.org Machine LearningMay-23-2019

Open access to high-quality education is limited by the difficulty of providing student feedback. In this paper, we present Generative Grading with Neural Approximate Parsing (GG-NAP): a novel approach for providing feedback at scale that is capable of both accurately grading student work while also providing verifiability--a property where the model is able to substantiate its claims with a provable certificate. Our approach uses generative descriptions of student cognition, written as probabilistic programs, to synthesise millions of labelled example solutions to a problem; it then trains inference networks to approximately parse real student solutions according to these generative models. We achieve feedback prediction accuracy comparable to professional human experts in a variety of settings: short-answer questions, programs with graphical output, block-based programming, and short Java programs. In a real classroom, we ran an experiment where humans used GG-NAP to grade, yielding doubled grading accuracy while halving grading time.

machine learning, moveforward, natural language, (13 more...)

arXiv.org Machine Learning

1905.09916

Country:

Europe > United Kingdom > England (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Genre:

Research Report > Promising Solution (0.48)
Overview > Innovation (0.34)

Industry:

Education > Educational Setting (0.93)
Education > Assessment & Standards > Student Performance (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Content-Dependent Versus Content-Independent Features for Gender and Age Range Identification in Different Types of Texts

Kurdi, M. Zakaria (University of Lynchburg)

AAAI ConferencesMay-15-2019

This paper is about the comparison of content-dependent and content-independent features for the identification of short texts author’s age range and gender. Eight content-dependent features based on profiles of ngrams of words are used. In addition, ninety-eight content-independent features covering all the linguistic aspects of texts from phonology to discourse are used. These features were extracted from three corpora of different sizes and types. Were also conducted some experiments using four different machine learning algorithms combined with these features. The results show that content-dependent features do a better job for gender identification on the three corpora. However, content-independent features did better with the task of age range identification.

artificial intelligence, machine learning, natural language, (16 more...)

AAAI Conferences

The Thirty-Second International Flairs Conference

Country:

North America > United States > South Carolina > Charleston County > Charleston (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
North America > Mexico > Quintana Roo > Cancún (0.04)
(2 more...)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.69)

Add feedback

A Typedriven Vector Semantics for Ellipsis with Anaphora using Lambek Calculus with Limited Contraction

Wijnholds, Gijs, Sadrzadeh, Mehrnoosh

arXiv.org Artificial IntelligenceMay-5-2019

We develop a vector space semantics for verb phrase ellipsis with anaphora using type-driven compositional distributional semantics based on the Lambek calculus with limited contraction (LCC) of J\"ager (2006). Distributional semantics has a lot to say about the statistical collocation-based meanings of content words, but provides little guidance on how to treat function words. Formal semantics on the other hand, has powerful mechanisms for dealing with relative pronouns, coordinators, and the like. Type-driven compositional distributional semantics brings these two models together. We review previous compositional distributional models of relative pronouns, coordination and a restricted account of ellipsis in the DisCoCat framework of Coecke et al. (2010, 2013). We show how DisCoCat cannot deal with general forms of ellipsis, which rely on copying of information, and develop a novel way of connecting typelogical grammar to distributional semantics by assigning vector interpretable lambda terms to derivations of LCC in the style of Muskens & Sadrzadeh (2016). What follows is an account of (verb phrase) ellipsis in which word meanings can be copied: the meaning of a sentence is now a program with non-linear access to individual word embeddings. We present the theoretical setting, work out examples, and demonstrate our results on a toy distributional model motivated by data.

artificial intelligence, easychair, natural language, (17 more...)

arXiv.org Artificial Intelligence

1905.01647

Country:

Europe > Netherlands > North Holland > Amsterdam (0.04)
North America > United States > Pennsylvania (0.04)
Europe > Netherlands > South Holland > Dordrecht (0.04)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.85)

Add feedback

ASER: A Large-scale Eventuality Knowledge Graph

Zhang, Hongming, Liu, Xin, Pan, Haojie, Song, Yangqiu, Wing-Ki, Cane, Leung, null

arXiv.org Artificial IntelligenceMay-1-2019

Understanding human's language requires complex world knowledge. However, existing large-scale knowledge graphs mainly focus on knowledge about entities while ignoring knowledge about activities, states, or events, which are used to describe how entities or things act in the real world. To fill this gap, we develop ASER (activities, states, events, and their relations), a large-scale eventuality knowledge graph extracted from more than 11-billion-token unstructured textual data. ASER contains 15 relation types belonging to five categories, 194-million unique eventualities, and 64-million unique edges among them. Both human and extrinsic evaluations demonstrate the quality and effectiveness of ASER.

artificial intelligence, machine learning, natural language, (23 more...)

arXiv.org Artificial Intelligence

1905.0027

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.95)
(3 more...)

Add feedback

Learning from Implicit Information in Natural Language Instructions for Robotic Manipulations

Can, Ozan Arkan, Martires, Pedro Zuidberg Dos, Persson, Andreas, Gaal, Julian, Loutfi, Amy, De Raedt, Luc, Yuret, Deniz, Saffiotti, Alessandro

arXiv.org Artificial IntelligenceApr-30-2019

Human-robot interaction often occurs in the form of instructions given from a human to a robot. For a robot to successfully follow instructions, a common representation of the world and objects in it should be shared between humans and the robot so that the instructions can be grounded. Achieving this representation can be done via learning, where both the world representation and the language grounding are learned simultaneously. However, in robotics this can be a difficult task due to the cost and scarcity of data. In this paper, we tackle the problem by separately learning the world representation of the robot and the language grounding. While this approach can address the challenges in getting sufficient data, it may give rise to inconsistencies between both learned components. Therefore, we further propose Bayesian learning to resolve such inconsistencies between the natural language grounding and a robot's world representation by exploiting spatio-relational information that is implicitly present in instructions given by a human. Moreover, we demonstrate the feasibility of our approach on a scenario involving a robotic arm in the physical world.

machine learning, natural language, object-oriented architecture, (19 more...)

arXiv.org Artificial Intelligence

1904.13324

Country:

Europe > Sweden > Örebro County > Örebro (0.04)
Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.04)
North America > United States > California > San Mateo County > Menlo Park (0.04)
Asia > Middle East > Republic of Türkiye (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.49)
Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (0.46)
(2 more...)

Add feedback

The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision

Mao, Jiayuan, Gan, Chuang, Kohli, Pushmeet, Tenenbaum, Joshua B., Wu, Jiajun

arXiv.org Artificial IntelligenceApr-26-2019

We propose the Neuro-Symbolic Concept Learner (NS-CL), a model that learns visual concepts, words, and semantic parsing of sentences without explicit supervision on any of them; instead, our model learns by simply looking at images and reading paired questions and answers. Our model builds an object-based scene representation and translates sentences into executable, symbolic programs. To bridge the learning of two modules, we use a neuro-symbolic reasoning module that executes these programs on the latent scene representation. Analogical to human concept learning, the perception module learns visual concepts based on the language description of the object being referred to. Meanwhile, the learned visual concepts facilitate learning new words and parsing new sentences. We use curriculum learning to guide the searching over the large compositional space of images and language. Extensive experiments demonstrate the accuracy and efficiency of our model on learning visual concepts, word representations, and semantic parsing of sentences. Further, our method allows easy generalization to new object attributes, compositions, language concepts, scenes and questions, and even new program domains. It also empowers applications including visual question answering and bidirectional image-text retrieval.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

1904.12584

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Obfuscation for Privacy-preserving Syntactic Parsing

Hu, Zhifeng, Havrylov, Serhii, Titov, Ivan, Cohen, Shay B.

arXiv.org Artificial IntelligenceApr-21-2019

The goal of homomorphic encryption is to encrypt data such that another party can operate on it without being explicitly exposed to the content of the original data. We introduce an idea for a privacy-preserving transformation on natural language data, inspired by homomorphic encryption. Our primary tool is {\em obfuscation}, relying on the properties of natural language. Specifically, a given text is obfuscated using a neural model that aims to preserve the syntactic relationships of the original sentence so that the obfuscated sentence can be parsed instead of the original one. The model works at the word level, and learns to obfuscate each word separately by changing it into a new word that has a similar syntactic role. The text encrypted by our model leads to better performance on three syntactic parsers (two dependency and one constituency parsers) in comparison to a strong random baseline. The substituted words have similar syntactic properties, but different semantic content, compared to the original words.

accuracy, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

1904.09585

Country:

Europe (1.00)
North America > United States (0.93)

Genre:

Research Report > New Finding (0.46)
Research Report > Experimental Study (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Parsing signal and noise in the brain

ScienceApr-19-2019, 03:57:37 GMT

Like engineers who characterize the fidelity of signals flowing through a circuit, neuroscientists focus on quantifying the degree to which neuronal signals are "noisy" (1, 2). Engineers have the benefit of designing the system and knowing the form of the signal, making identification of corrupting noise relatively straightforward. For neuroscientists, the task is harder, as it entails figuring out first what the signal is, and only then, what the noise is. On pages 254, 253, and 255 of this issue, Gründemann et al. (3), Allen et al. (4), and Stringer et al. (5), respectively, report findings from large-scale neural recordings in the brains of mice and find brainwide activity that correlates with behavior that might usually be ignored as noise. These studies prompt reconsideration of the origin and impacts of "noise" in the nervous system.

artificial intelligence, natural language, parsing signal and noise, (1 more...)

Science

Industry: Health & Medicine > Therapeutic Area > Neurology (0.95)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.40)

Add feedback

CraftAssist Instruction Parsing: Semantic Parsing for a Minecraft Assistant

Jernite, Yacine, Srinet, Kavya, Gray, Jonathan, Szlam, Arthur

arXiv.org Artificial IntelligenceApr-17-2019

We propose a large scale semantic parsing dataset focused on instruction-driven communication with an agent in Minecraft. We describe the data collection process which yields additional 35K human generated instructions with their semantic annotations. We report the performance of three baseline models and find that while a dataset of this size helps us train a usable instruction parser, it still poses interesting generalization challenges which we hope will help develop better and more robust models.

artificial intelligence, natural language, node, (17 more...)

arXiv.org Artificial Intelligence

1905.01978

Genre: Research Report (0.40)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Add feedback

Unsupervised Recurrent Neural Network Grammars

Kim, Yoon, Rush, Alexander M., Yu, Lei, Kuncoro, Adhiguna, Dyer, Chris, Melis, Gábor

arXiv.org Machine LearningApr-15-2019

Recurrent neural network grammars (RNNG) are generative models of language which jointly model syntax and surface structure by incrementally generating a syntax tree and sentence in a top-down, left-to-right order. Supervised RNNGs achieve strong language modeling and parsing performance, but require an annotated corpus of parse trees. In this work, we experiment with unsupervised learning of RNNGs. Since directly marginalizing over the space of latent trees is intractable, we instead apply amortized variational inference. To maximize the evidence lower bound, we develop an inference network parameterized as a neural CRF constituency parser. On language modeling, unsupervised RNNGs perform as well their supervised counterparts on benchmarks in English and Chinese. On constituency grammar induction, they are competitive with recent neural language models that induce tree structures from words through attention mechanisms.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

1904.03746

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback