AITopics | Grammars & Parsing

Collaborating Authors

Grammars & Parsing

News Overviews Instructional Materials AI-Alerts Classics

Toward Natural Language Computation '

AI ClassicsJan-25-2015, 20:27:20 GMT

The ability how they can be combined. Thus the user would be to program in natural language instead of traditional taxed more heavily with a natural language system programming languages would enable people to use than with a traditional system. A second argument familiar constructs in expressing their requests, thus against natural language programming relates to its making machines accessible to a wider user group.

institute of electrical and electronics engineers (ieee), machine learning, the ohio state university, (22 more...)

AI Classics

Country: North America > United States (1.00)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Add feedback

Bibliography

AI ClassicsJan-25-2015, 20:26:40 GMT

IDijkstra 19591 Dijkstra, E., NA Note on Two Problems in Connection with Graphs," Numeri.scht'

relx group plc, united nations, university of wisconsin, (45 more...)

AI Classics

Country:

Europe (1.00)
North America > United States > California > Santa Clara County (0.28)

Genre: Overview (0.46)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
(10 more...)

Add feedback

The SP theory of intelligence: an overview

Wolff, J. Gerard

arXiv.org Artificial IntelligenceJan-7-2015

This article is an overview of the "SP theory of intelligence". The theory aims to simplify and integrate concepts across artificial intelligence, mainstream computing and human perception and cognition, with information compression as a unifying theme. It is conceived as a brain-like system that receives 'New' information and stores some or all of it in compressed form as 'Old' information. It is realised in the form of a computer model -- a first version of the SP machine. The concept of "multiple alignment" is a powerful central idea. Using heuristic techniques, the system builds multiple alignments that are 'good' in terms of information compression. For each multiple alignment, probabilities may be calculated. These provide the basis for calculating the probabilities of inferences. The system learns new structures from partial matches between patterns. Using heuristic techniques, the system searches for sets of structures that are 'good' in terms of information compression. These are normally ones that people judge to be 'natural', in accordance with the 'DONSVIC' principle -- the discovery of natural structures via information compression. The SP theory may be applied in several areas including 'computing', aspects of mathematics and logic, representation of knowledge, natural language processing, pattern recognition, several kinds of reasoning, information storage and retrieval, planning and problem solving, information compression, neuroscience, and human perception and cognition. Examples include the parsing and production of language including discontinuous dependencies in syntax, pattern recognition at multiple levels of abstraction and its integration with part-whole relations, nonmonotonic reasoning and reasoning with default values, reasoning in Bayesian networks including 'explaining away', causal diagnosis, and the solving of a geometric analogy problem.

machine learning, multiple alignment, pattern recognition, (21 more...)

arXiv.org Artificial Intelligence

doi: 10.3390/info4030283

1306.3888

Country:

North America > United States (0.46)
Europe (0.27)

Genre: Research Report (0.81)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)
(4 more...)

Add feedback

Declarative Statistical Modeling with Datalog

Barany, Vince, Cate, Balder ten, Kimelfeld, Benny, Olteanu, Dan, Vagena, Zografoula

arXiv.org Artificial IntelligenceJan-5-2015

Formalisms for specifying statistical models, such as probabilistic-programming languages, typically consist of two components: a specification of a stochastic process (the prior), and a specification of observations that restrict the probability space to a conditional subspace (the posterior). Use cases of such formalisms include the development of algorithms in machine learning and artificial intelligence. We propose and investigate a declarative framework for specifying statistical models on top of a database, through an appropriate extension of Datalog. By virtue of extending Datalog, our framework offers a natural integration with the database, and has a robust declarative semantics. Our Datalog extension provides convenient mechanisms to include numerical probability functions; in particular, conclusions of rules may contain values drawn from such functions. The semantics of a program is a probability distribution over the possible outcomes of the input database with respect to the program; these outcomes are minimal solutions with respect to a related program with existentially quantified variables in conclusions. Observations are naturally incorporated by means of integrity constraints over the extensional and intensional relations. We focus on programs that use discrete numerical distributions, but even then the space of possible outcomes may be uncountable (as a solution can be infinite). We define a probability measure over possible outcomes by applying the known concept of cylinder sets to a probabilistic chase procedure. We show that the resulting semantics is robust under different chases. We also identify conditions guaranteeing that all possible outcomes are finite (and then the probability space is discrete). We argue that the framework we propose retains the purely declarative nature of Datalog, and allows for natural specifications of statistical models.

machine learning, natural language, possible outcome, (19 more...)

arXiv.org Artificial Intelligence

1412.2221

Country:

Europe (0.67)
Asia > Middle East (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Learning Distributed Representations for Structured Output Prediction

Srikumar, Vivek, Manning, Christopher D.

Neural Information Processing SystemsDec-31-2014

In recent years, distributed representations of inputs have led to performance gains in many applications by allowing statistical information to be shared across inputs. However, the predicted outputs (labels, and more generally structures) are still treated as discrete objects even though outputs are often not discrete units of meaning. In this paper, we present a new formulation for structured prediction where we represent individual labels in a structure as dense vectors and allow semantically similar labels to share parameters. We extend this representation to larger structures by defining compositionality using tensor products to give a natural generalization of standard structured prediction approaches. We define a learning objective for jointly learning the model parameters and the label vectors and propose an alternating minimization algorithm for learning. We show that our formulation outperforms structural SVM baselines in two tasks: multiclass document classification and part-of-speech tagging.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.94)

Industry: Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Supervised Learning (0.91)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.89)

Add feedback

Unsupervised Induction of Semantic Roles within a Reconstruction-Error Minimization Framework

Titov, Ivan, Khoddam, Ehsan

arXiv.org Machine LearningDec-8-2014

We introduce a new approach to unsupervised estimation of feature-rich semantic role labeling models. Our model consists of two components: (1) an encoding component: a semantic role labeling model which predicts roles given a rich set of syntactic and lexical features; (2) a reconstruction component: a tensor factorization model which relies on roles to predict argument fillers. When the components are estimated jointly to minimize errors in argument reconstruction, the induced roles largely correspond to roles defined in annotated resources. Our method performs on par with most accurate role induction methods on English and German, even though, unlike these previous approaches, we do not incorporate any prior linguistic knowledge about the languages.

argument, machine learning, natural language, (17 more...)

arXiv.org Machine Learning

1412.2812

Country: North America > United States (0.46)

Genre: Research Report (0.83)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
(2 more...)

Add feedback

Discovering and Characterizing Emerging Events in Big Data

Dorr, Bonnie J. (Institute for Human and Machine Cognition (IHMC)) | Petrovic, Milenko (Institute for Human and Machine Cognition (IHMC)) | Allen, James F. (Institute for Human and Machine Cognition (IHMC)) | Teng, Choh Man (Institute for Human and Machine Cognition (IHMC)) | Dalton, Adam (Institute for Human and Machine Cognition (IHMC))

AAAI ConferencesNov-1-2014

We describe a novel system for discovering and characterizing emerging events. We define event emergence to be a developing situation comprised of a series of sub-events. To detect sub-events from a very large, continuous textual input stream, we use two techniques: (1) frequency-based detection of sub-events that are potentially entailed by an emerging event; and (2) anomaly-based detection of other sub-events that are potentially indicative of an emerging event. Identifying emerging events from detected sub-events involves connecting sub-events to each other and to the relevant emerging events within the event models and estimating the likelihood of possible emerging events. Each sub-event can be part of a number of emerging events and supports various event models to varying degrees. We adopt a coherent and compact model that probabilistically identifies emerging events. The innovative aspect of our work is a well-defined framework where statistical Big Data techniques are informed by event semantics and inference techniques (and vice versa). Our work is strongly grounded in semantics and knowledge representation, which enables us to produce more reliable results than would otherwise be possible with a purely statistical approach.

data mining, machine learning, natural language, (19 more...)

AAAI Conferences

2014 AAAI Fall Symposium Series

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(7 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.88)
(3 more...)

Add feedback

Programming by Demonstration with Situated Semantic Parsing

Artzi, Yoav (University of Washington) | Forbes, Maxwell (University of Washington) | Lee, Kenton (University of Washington) | Cakmak, Maya (University of Washington)

AAAI ConferencesNov-1-2014

Programming by Demonstration (PbD) is an approach to programming robots by demonstrating the desired behavior. Speech is a natural, hands-free way to augment demonstrations with control commands that guide the PbD process. However, existing speech interfaces for PbD systems rely on ad-hoc, predefined command sets that are rigid and require user training. Instead, we aim to develop flexible speech interfaces to accommodate user variations and ambiguous utterances. To that end, we propose to use a situated semantic parser that jointly reasons about the user's speech and the robot's state to resolve ambiguities. In this paper, we describe this approach and compare its utility to a rigid speech command interface.

artificial intelligence, machine learning, natural language, (17 more...)

AAAI Conferences

2014 AAAI Fall Symposium Series

Country: North America > United States > Washington > King County > Seattle (0.05)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Rapid Adaptation of POS Tagging for Domain Specific Uses

Miller, John E., Bloodgood, Michael, Torii, Manabu, Vijay-Shanker, K.

arXiv.org Machine LearningOct-31-2014

Part-of-speech (POS) tagging is a fundamental component for performing natural language tasks such as parsing, information extraction, and question answering. When POS taggers are trained in one domain and applied in significantly different domains, their performance can degrade dramatically. We present a methodology for rapid adaptation of POS taggers to new domains. Our technique is unsupervised in that a manually annotated corpus for the new domain is not necessary. We use suffix information gathered from large amounts of raw text as well as orthographic information to increase the lexical coverage. We present an experiment in the Biological domain where our POS tagger achieves results comparable to POS taggers specifically trained to this domain. Many machine-learning and statistical techniques employed for POS tagging train a model on an annotated corpus, such as the Penn Treebank (Marcus et al, 1993). Most state-of-the-art POS taggers use two main sources of information: 1) Information about neighboring tags, and 2) Information about the word itself. Methods using both sources of information for tagging are: Hidden Markov Modeling, Maximum Entropy modeling, and Transformation Based Learning (Brill, 1995).

artificial intelligence, information, natural language, (15 more...)

arXiv.org Machine Learning

1411.0007

Country: North America > United States > Delaware (0.15)

Genre: Research Report (0.40)

Industry: Health & Medicine (0.48)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Add feedback

Text Rewriting Improves Semantic Role Labeling

Woodsend, K., Lapata, M.

Journal of Artificial Intelligence ResearchSep-19-2014

Large-scale annotated corpora are a prerequisite to developing high-performance NLP systems. Such corpora are expensive to produce, limited in size, often demanding linguistic expertise. In this paper we use text rewriting as a means of increasing the amount of labeled data available for model training. Our method uses automatically extracted rewrite rules from comparable corpora and bitexts to generate multiple versions of sentences annotated with gold standard labels. We apply this idea to semantic role labeling and show that a model trained on rewritten data outperforms the state of the art on the CoNLL-2009 benchmark dataset.

grammar, proceedings, transformation, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.4431

AI Access Foundation

10904

Journal of Artificial Intelligence Research

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.15)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > United States > Oregon > Multnomah County > Portland (0.04)
(24 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Add feedback