Grammars & Parsing
Generative Grading: Neural Approximate Parsing for Automated Student Feedback
Malik, Ali, Wu, Mike, Vasavada, Vrinda, Song, Jinpeng, Mitchell, John, Goodman, Noah, Piech, Chris
Open access to high-quality education is limited by the difficulty of providing student feedback. In this paper, we present Generative Grading with Neural Approximate Parsing (GG-NAP): a novel approach for providing feedback at scale that is capable of both accurately grading student work while also providing verifiability--a property where the model is able to substantiate its claims with a provable certificate. Our approach uses generative descriptions of student cognition, written as probabilistic programs, to synthesise millions of labelled example solutions to a problem; it then trains inference networks to approximately parse real student solutions according to these generative models. We achieve feedback prediction accuracy comparable to professional human experts in a variety of settings: short-answer questions, programs with graphical output, block-based programming, and short Java programs. In a real classroom, we ran an experiment where humans used GG-NAP to grade, yielding doubled grading accuracy while halving grading time.
Content-Dependent Versus Content-Independent Features for Gender and Age Range Identification in Different Types of Texts
Kurdi, M. Zakaria (University of Lynchburg)
This paper is about the comparison of content-dependent and content-independent features for the identification of short texts author’s age range and gender. Eight content-dependent features based on profiles of ngrams of words are used. In addition, ninety-eight content-independent features covering all the linguistic aspects of texts from phonology to discourse are used. These features were extracted from three corpora of different sizes and types. Were also conducted some experiments using four different machine learning algorithms combined with these features. The results show that content-dependent features do a better job for gender identification on the three corpora. However, content-independent features did better with the task of age range identification.
A Typedriven Vector Semantics for Ellipsis with Anaphora using Lambek Calculus with Limited Contraction
Wijnholds, Gijs, Sadrzadeh, Mehrnoosh
We develop a vector space semantics for verb phrase ellipsis with anaphora using type-driven compositional distributional semantics based on the Lambek calculus with limited contraction (LCC) of J\"ager (2006). Distributional semantics has a lot to say about the statistical collocation-based meanings of content words, but provides little guidance on how to treat function words. Formal semantics on the other hand, has powerful mechanisms for dealing with relative pronouns, coordinators, and the like. Type-driven compositional distributional semantics brings these two models together. We review previous compositional distributional models of relative pronouns, coordination and a restricted account of ellipsis in the DisCoCat framework of Coecke et al. (2010, 2013). We show how DisCoCat cannot deal with general forms of ellipsis, which rely on copying of information, and develop a novel way of connecting typelogical grammar to distributional semantics by assigning vector interpretable lambda terms to derivations of LCC in the style of Muskens & Sadrzadeh (2016). What follows is an account of (verb phrase) ellipsis in which word meanings can be copied: the meaning of a sentence is now a program with non-linear access to individual word embeddings. We present the theoretical setting, work out examples, and demonstrate our results on a toy distributional model motivated by data.
ASER: A Large-scale Eventuality Knowledge Graph
Zhang, Hongming, Liu, Xin, Pan, Haojie, Song, Yangqiu, Wing-Ki, Cane, Leung, null
Understanding human's language requires complex world knowledge. However, existing large-scale knowledge graphs mainly focus on knowledge about entities while ignoring knowledge about activities, states, or events, which are used to describe how entities or things act in the real world. To fill this gap, we develop ASER (activities, states, events, and their relations), a large-scale eventuality knowledge graph extracted from more than 11-billion-token unstructured textual data. ASER contains 15 relation types belonging to five categories, 194-million unique eventualities, and 64-million unique edges among them. Both human and extrinsic evaluations demonstrate the quality and effectiveness of ASER.
Learning from Implicit Information in Natural Language Instructions for Robotic Manipulations
Can, Ozan Arkan, Martires, Pedro Zuidberg Dos, Persson, Andreas, Gaal, Julian, Loutfi, Amy, De Raedt, Luc, Yuret, Deniz, Saffiotti, Alessandro
Human-robot interaction often occurs in the form of instructions given from a human to a robot. For a robot to successfully follow instructions, a common representation of the world and objects in it should be shared between humans and the robot so that the instructions can be grounded. Achieving this representation can be done via learning, where both the world representation and the language grounding are learned simultaneously. However, in robotics this can be a difficult task due to the cost and scarcity of data. In this paper, we tackle the problem by separately learning the world representation of the robot and the language grounding. While this approach can address the challenges in getting sufficient data, it may give rise to inconsistencies between both learned components. Therefore, we further propose Bayesian learning to resolve such inconsistencies between the natural language grounding and a robot's world representation by exploiting spatio-relational information that is implicitly present in instructions given by a human. Moreover, we demonstrate the feasibility of our approach on a scenario involving a robotic arm in the physical world.
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision
Mao, Jiayuan, Gan, Chuang, Kohli, Pushmeet, Tenenbaum, Joshua B., Wu, Jiajun
We propose the Neuro-Symbolic Concept Learner (NS-CL), a model that learns visual concepts, words, and semantic parsing of sentences without explicit supervision on any of them; instead, our model learns by simply looking at images and reading paired questions and answers. Our model builds an object-based scene representation and translates sentences into executable, symbolic programs. To bridge the learning of two modules, we use a neuro-symbolic reasoning module that executes these programs on the latent scene representation. Analogical to human concept learning, the perception module learns visual concepts based on the language description of the object being referred to. Meanwhile, the learned visual concepts facilitate learning new words and parsing new sentences. We use curriculum learning to guide the searching over the large compositional space of images and language. Extensive experiments demonstrate the accuracy and efficiency of our model on learning visual concepts, word representations, and semantic parsing of sentences. Further, our method allows easy generalization to new object attributes, compositions, language concepts, scenes and questions, and even new program domains. It also empowers applications including visual question answering and bidirectional image-text retrieval.
Obfuscation for Privacy-preserving Syntactic Parsing
Hu, Zhifeng, Havrylov, Serhii, Titov, Ivan, Cohen, Shay B.
The goal of homomorphic encryption is to encrypt data such that another party can operate on it without being explicitly exposed to the content of the original data. We introduce an idea for a privacy-preserving transformation on natural language data, inspired by homomorphic encryption. Our primary tool is {\em obfuscation}, relying on the properties of natural language. Specifically, a given text is obfuscated using a neural model that aims to preserve the syntactic relationships of the original sentence so that the obfuscated sentence can be parsed instead of the original one. The model works at the word level, and learns to obfuscate each word separately by changing it into a new word that has a similar syntactic role. The text encrypted by our model leads to better performance on three syntactic parsers (two dependency and one constituency parsers) in comparison to a strong random baseline. The substituted words have similar syntactic properties, but different semantic content, compared to the original words.
Parsing signal and noise in the brain
Like engineers who characterize the fidelity of signals flowing through a circuit, neuroscientists focus on quantifying the degree to which neuronal signals are "noisy" (1, 2). Engineers have the benefit of designing the system and knowing the form of the signal, making identification of corrupting noise relatively straightforward. For neuroscientists, the task is harder, as it entails figuring out first what the signal is, and only then, what the noise is. On pages 254, 253, and 255 of this issue, Gründemann et al. (3), Allen et al. (4), and Stringer et al. (5), respectively, report findings from large-scale neural recordings in the brains of mice and find brainwide activity that correlates with behavior that might usually be ignored as noise. These studies prompt reconsideration of the origin and impacts of "noise" in the nervous system.
CraftAssist Instruction Parsing: Semantic Parsing for a Minecraft Assistant
Jernite, Yacine, Srinet, Kavya, Gray, Jonathan, Szlam, Arthur
We propose a large scale semantic parsing dataset focused on instruction-driven communication with an agent in Minecraft. We describe the data collection process which yields additional 35K human generated instructions with their semantic annotations. We report the performance of three baseline models and find that while a dataset of this size helps us train a usable instruction parser, it still poses interesting generalization challenges which we hope will help develop better and more robust models.
Unsupervised Recurrent Neural Network Grammars
Kim, Yoon, Rush, Alexander M., Yu, Lei, Kuncoro, Adhiguna, Dyer, Chris, Melis, Gábor
Recurrent neural network grammars (RNNG) are generative models of language which jointly model syntax and surface structure by incrementally generating a syntax tree and sentence in a top-down, left-to-right order. Supervised RNNGs achieve strong language modeling and parsing performance, but require an annotated corpus of parse trees. In this work, we experiment with unsupervised learning of RNNGs. Since directly marginalizing over the space of latent trees is intractable, we instead apply amortized variational inference. To maximize the evidence lower bound, we develop an inference network parameterized as a neural CRF constituency parser. On language modeling, unsupervised RNNGs perform as well their supervised counterparts on benchmarks in English and Chinese. On constituency grammar induction, they are competitive with recent neural language models that induce tree structures from words through attention mechanisms.