AITopics

Twenty-Third International Joint Conference on Artificial Intelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.60)

Bauer, Daniel (Columbia University)

Understanding Descriptions of Visual Scenes Using Graph Grammars

Automatic generation of 3D scenes from descriptions has applications in communication, education, and entertainment, but requires deep understanding of the input text. I propose thesis work on language understanding using graph-based meaning representations that can be decomposed into primitive spatial relations. The techniques used for analyzing text and transforming it into a scene representation are based on context-free graph grammars. The thesis develops methods for semantic parsing with graphs, acquisition of graph grammars, and satisfaction of spatial and world-knowledge constraints during parsing.

graph grammar, visual scene

Twenty-Seventh AAAI Conference on Artificial Intelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.80)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.73)

Integrating Programming by Example and Natural Language Programming

Manshadi, Mehdi H. (University of Rochester) | Gildea, Daniel (Department of Computer Science) | Allen, James F. (University of Rochester)

We motivate the integration of programming by example and natural language programming by developing a system for specifying programs for simple text editing operations based on regular expressions. The programs are described with unconstrained natural language instructions, and providing one or more examples of input/output. We show that natural language allows the system to deduce the correct program much more often and much faster than is possible with the input/output example(s) alone, showing that natural language programming and programming by example can be combined in a way that overcomes the ambiguities that both methods suffer from individually, while providing a more natural interface to the user.

machine learning, natural language, nl description, (18 more...)

Twenty-Seventh AAAI Conference on Artificial Intelligence

Country:

Asia > Middle East > Jordan (0.05)
North America > United States > California > Orange County > Orange (0.04)
North America > United States > New York > Monroe County > Rochester (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.46)

An Extended GHKM Algorithm for Inducing Lambda-SCFG

Li, Peng (Tsinghua University, China) | Liu, Yang | Sun, Maosong

Semantic parsing, which aims at mapping a natural language (NL) sentence into its formal meaning representation (e.g., logical form), has received increasing attention in recent years. While synchronous context-free grammar (SCFG) augmented with lambda calculus (lambda-SCFG) provides an effective mechanism for semantic parsing, how to learn such lambda-SCFG rules still remains a challenge because of the difficulty in determining the correspondence between NL sentences and logical forms. To alleviate this structural divergence problem, we extend the GHKM algorithm, which is a state-of-the-art algorithm for learning synchronous grammars in statistical machine translation, to induce lambda-SCFG from pairs of NL sentences and logical forms. By treating logical forms as trees, we reformulate the theory behind GHKM that gives formal semantics to the alignment between NL words and logical form tokens. Experiments on the GEOQUERY dataset show that our semantic parser achieves an F-measure of 90.2%, the best result published to date.

artificial intelligence, logical form, natural language, (17 more...)

Twenty-Seventh AAAI Conference on Artificial Intelligence

Country:

North America > United States > Alaska (0.04)
Asia > Singapore (0.04)
Asia > China > Beijing > Beijing (0.04)
Africa > Middle East > Egypt > Giza Governorate > Giza (0.04)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Generating Natural-Language Video Descriptions Using Text-Mined Knowledge

Krishnamoorthy, Niveda (University of Texas at Austin) | Malkarnenkar, Girish (University of Texas at Austin) | Mooney, Raymond (University of Texas at Austin) | Saenko, Kate (University of Massachussets Lowell) | Guadarrama, Sergio (University of California, Berkeley)

We present a holistic data-driven technique that generates natural-language descriptions for videos. We combine the output of state-of-the-art object and activity detectors with "real-world' knowledge to select the most probable subject-verb-object triplet for describing a video. We show that this knowledge, automatically mined from web-scale text corpora, enhances the triplet selection algorithm by providing it contextual information and leads to a four-fold increase in activity identification. Unlike previous methods, our approach can annotate arbitrary videos without requiring the expensive collection and annotation of a similar training video corpus. We evaluate our technique against a baseline that does not use text-mined knowledge and show that humans prefer our descriptions 61% of the time.

artificial intelligence, natural language, video, (17 more...)

Twenty-Seventh AAAI Conference on Artificial Intelligence

Country: North America > United States (0.14)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.68)

Joint Extraction and Labeling via Graph Propagation for Dictionary Construction

Kim, Doo Soon (Accenture Technology Labs) | Verma, Kunal (Accenture Technology Labs) | Yeh, Peter Z. (Accenture Technology Labs)

In this paper, we present an approach that jointly infers the boundaries of tokens and their labels to construct dictionaries for Information Extraction. Our approach for joint-inference is based on graph propagation, and extends it in two novel ways. First, we extend the graph representation to capture ambiguities that occur during the token extraction phase. Second, we modify the labeling phase (i.e., label propagation) to utilize this new representation, allowing evidence from labeling to be used for token extraction. Our evaluation shows these extensions (and hence our approach) significantly improve the performance of the outcome dictionaries over pipeline-based approaches by preventing aggressive commitment. Our evaluation also shows that our extensions over a base graph-propagation framework improve the precision without hurting the recall.

extraction, machine learning, natural language, (19 more...)

Twenty-Seventh AAAI Conference on Artificial Intelligence

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Santa Clara County > San Jose (0.04)
North America > United States > California > San Mateo County > Menlo Park (0.04)
(2 more...)

Genre:

Workflow (0.47)
Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Immunology (0.68)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.94)

AAAI ConferencesMar-21-2013

Part of Speech Tagging Bilingual Speech Transcripts with Intrasentential Model Switching

Rodrigues, Paul (University of Maryland) | Kübler, Sandra (Indiana University)

This paper investigates incremental part of speech tagging for speech transcripts that contain multilin- gual intrasentential code-mixing, and compares the accuracy of a monolithic tagging model trained on a heterogeneous-language dataset to a model that switches between two homogeneous-language tagging models dynamically using word-by-word language identification. We find that the dynamic model, even though presented a smaller context consisting of sen- tence fragments, meets the accuracy of the monolithic code-mixing model which is aware of increased context. Our system is modular, and is designed to be expanded to many-language code-mixing.

artificial intelligence, natural language, speech tagging bilingual speech transcript, (1 more...)

2013 AAAI Spring Symposium Series

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.60)

Journal of Artificial Intelligence ResearchFeb-20-2013

Integrative Semantic Dependency Parsing via Efficient Large-scale Feature Selection

Zhao, H., Zhang, X., Kit, C.

Semantic parsing, i.e., the automatic derivation of meaning representation such as an instantiated predicate-argument structure for a sentence, plays a critical role in deep processing of natural language. Unlike all other top systems of semantic dependency parsing that have to rely on a pipeline framework to chain up a series of submodels each specialized for a specific subtask, the one presented in this article integrates everything into one model, in hopes of achieving desirable integrity and practicality for real applications while maintaining a competitive performance. This integrative approach tackles semantic parsing as a word pair classification problem using a maximum entropy classifier. We leverage adaptive pruning of argument candidates and large-scale feature selection engineering to allow the largest feature space ever in use so far in this field, it achieves a state-of-the-art performance on the evaluation data set for CoNLL-2008 shared task, on top of all but one top pipeline system, confirming its feasibility and effectiveness.

dprel, predicate, template, (16 more...)

doi: 10.1613/jair.3717

AI Access Foundation

10801

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)
North America > United States > Colorado > Boulder County > Boulder (0.04)
North America > United States > Ohio > Franklin County > Columbus (0.04)
(11 more...)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Journal of Artificial Intelligence ResearchFeb-20-2013

Generating Extractive Summaries of Scientific Paradigms

Qazvinian, V., Radev, D. R., Mohammad, S. M., Dorr, B., Zajic, D., Whidby, M., Moon, T.

Researchers and scientists increasingly find themselves in the position of having to quickly understand large amounts of technical material. Our goal is to effectively serve this need by using bibliometric text mining and summarization techniques to generate summaries of scientific literature. We show how we can use citations to produce automatically generated, readily consumable, technical extractive summaries. We first propose C-LexRank, a model for summarizing single scientific articles based on citations, which employs community detection and extracts salient information-rich sentences. Next, we further extend our experiments to summarize a set of papers, which cover the same scientific topic. We generate extractive summaries of a set of Question Answering (QA) and Dependency Parsing (DP) papers, their abstracts, and their citation sentences and show that citations have unique information amenable to creating a summary.

citation sentence, factoid, proceedings, (13 more...)

doi: 10.1613/jair.3732

AI Access Foundation

10800

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.28)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)
North America > United States > Maryland > Prince George's County > College Park (0.14)
(21 more...)

Genre:

Research Report > New Finding (1.00)
Overview (0.68)

Industry: Government > Regional Government (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.68)

Catoni, Olivier, Mainguy, Thomas

Toric grammars: a new statistical approach to natural language modeling

arXiv.org Machine LearningFeb-11-2013

We propose a new statistical model for computational linguistics. Rather than trying to estimate directly the probability distribution of a random sentence of the language, we define a Markov chain on finite sets of sentences with many finite recurrent communicating classes and define our language model as the invariant probability measures of the chain on each recurrent communicating class. This Markov chain, that we call a communication model, recombines at each step randomly the set of sentences forming its current state, using some grammar rules. When the grammar rules are fixed and known in advance instead of being estimated on the fly, we can prove supplementary mathematical properties. In particular, we can prove in this case that all states are recurrent states, so that the chain defines a partition of its state space into finite recurrent communicating classes. We show that our approach is a decisive departure from Markov models at the sentence level and discuss its relationships with Context Free Grammars. Although the toric grammars we use are closely related to Context Free Grammars, the way we generate the language from the grammar is qualitatively different. Our communication model has two purposes. On the one hand, it is used to define indirectly the probability distribution of a random sentence of the language. On the other hand it can serve as a (crude) model of language transmission from one speaker to another speaker through the communication of a (large) set of sentences.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Machine Learning

1302.2569

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.75)