AITopics | Grammars & Parsing

Collaborating Authors

Grammars & Parsing

News Overviews Instructional Materials AI-Alerts Classics

Machine Learning with World Knowledge: The Position and Survey

arXiv.org Machine LearningMay-8-2017

Machine learning has become pervasive in multiple domains, impacting a wide variety of applications, such as knowledge discovery and data mining, natural language processing, information retrieval, computer vision, social and health informatics, ubiquitous computing, etc. Two essential problems of machine learning are how to generate features and how to acquire labels for machines to learn. Particularly, labeling large amount of data for each domain-specific problem can be very time consuming and costly. It has become a key obstacle in making learning protocols realistic in applications. In this paper, we will discuss how to use the existing general-purpose world knowledge to enhance machine learning processes, by enriching the features or reducing the labeling work. We start from the comparison of world knowledge with domain-specific knowledge, and then introduce three key problems in using world knowledge in learning processes, i.e., explicit and implicit feature representation, inference for knowledge linking and disambiguation, and learning with direct or indirect supervision. Finally we discuss the future directions of this research topic.

information retrieval, machine learning, natural language, (22 more...)

arXiv.org Machine Learning

1705.02908

Country: North America > United States > Illinois (0.46)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Government > Regional Government > North America Government > United States Government (0.92)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
(7 more...)

Add feedback

SemEval 2017 Task 10: ScienceIE - Extracting Keyphrases and Relations from Scientific Publications

Augenstein, Isabelle, Das, Mrinal, Riedel, Sebastian, Vikraman, Lakshmi, McCallum, Andrew

arXiv.org Machine LearningMay-2-2017

We describe the SemEval task of extracting keyphrases and relations between them from scientific documents, which is crucial for understanding which publications describe which processes, tasks and materials. Although this was a new task, we had a total of 26 submissions across 3 evaluation scenarios. We expect the task and the findings reported in this paper to be relevant for researchers working on understanding scientific content, as well as the broader knowledge base population and information extraction communities.

annotation, keyphrase, proceedings, (13 more...)

arXiv.org Machine Learning

1704.02853

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
Europe > Sweden > Uppsala County > Uppsala (0.04)
Europe > Belarus (0.04)

Genre:

Overview (0.46)
Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.68)

Add feedback

From Language to Programs: Bridging Reinforcement Learning and Maximum Marginal Likelihood

Guu, Kelvin, Pasupat, Panupong, Liu, Evan Zheran, Liang, Percy

arXiv.org Machine LearningApr-25-2017

Our goal is to learn a semantic parser that maps natural language utterances into executable programs when only indirect supervision is available: examples are labeled with the correct execution result, but not the program itself. Consequently, we must search the space of programs for those that output the correct result, while not being misled by spurious programs: incorrect programs that coincidentally output the correct result. We connect two common learning paradigms, reinforcement learning (RL) and maximum marginal likelihood (MML), and then present a new learning algorithm that combines the strengths of both. The new algorithm guards against spurious programs by combining the systematic search traditionally employed in MML with the randomized exploration of RL, and by updating parameters such that probability is spread more evenly across consistent programs. We apply our learning algorithm to a new neural semantic parser and show significant gains over existing state-of-the-art results on a recent context-dependent semantic parsing task.

machine learning, reinforcement learning, spurious program, (19 more...)

arXiv.org Machine Learning

1704.07926

Country: North America > United States (0.28)

Genre: Research Report (0.50)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Abstract Syntax Networks for Code Generation and Semantic Parsing

Rabinovich, Maxim, Stern, Mitchell, Klein, Dan

arXiv.org Machine LearningApr-25-2017

Tasks like code generation and semantic parsing require mapping unstructured (or partially structured) inputs to well-formed, executable outputs. We introduce abstract syntax networks, a modeling framework for these problems. The outputs are represented as abstract syntax trees (ASTs) and constructed by a decoder with a dynamically-determined modular structure paralleling the structure of the output tree. On the benchmark Hearthstone dataset for code generation, our model obtains 79.2 BLEU and 22.7% exact match accuracy, compared to previous state-of-the-art values of 67.1 and 6.1%. Furthermore, we perform competitively on the Atis, Jobs, and Geo semantic parsing datasets with no task-specific engineering.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

1704.07535

Country:

Europe (0.93)
Asia > Middle East (0.68)
North America > United States > California (0.46)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.57)

Add feedback

Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision

Liang, Chen, Berant, Jonathan, Le, Quoc, Forbus, Kenneth D., Lao, Ni

arXiv.org Artificial IntelligenceApr-23-2017

Harnessing the statistical power of neural networks to perform language understanding and symbolic reasoning is difficult, when it requires executing efficient discrete operations against a large knowledge-base. In this work, we introduce a Neural Symbolic Machine, which contains (a) a neural "programmer", i.e., a sequence-to-sequence model that maps language utterances to programs and utilizes a key-variable memory to handle compositionality (b) a symbolic "computer", i.e., a Lisp interpreter that performs program execution, and helps find good programs by pruning the search space. We apply REINFORCE to directly optimize the task reward of this structured prediction problem. To train with weak supervision and improve the stability of REINFORCE, we augment it with an iterative maximum-likelihood training process. NSM outperforms the state-of-the-art on the WebQuestionsSP dataset when trained from question-answer pairs only, without requiring any feature engineering or domain-specific knowledge.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

1611.0002

Country:

North America > United States (0.68)
Asia > Middle East > Israel (0.14)

Genre:

Workflow (0.68)
Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.89)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Stanford CoreNLP

@machinelearnbotApr-22-2017, 17:29:55 GMT

The classpath must include all of the CoreNLP dependencies. The memory requirements of the server are the same as that of CoreNLP, though it will grow as you load more models (e.g., memory increases if you load both the PCFG and Shift-Reduce constituency parser models). A safe minimum is 4gb; 8gb is recommended if you can spare it. If running the server under docker, the container's port 9000 has to be published to the host.

artificial intelligence, natural language, server, (17 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.49)

Add feedback

How to Use Stanford CoreNLP in Python

@machinelearnbotApr-11-2017, 05:05:23 GMT

If you googled'How to use Stanford CoreNLP in Python?' and landed on this post then you already know what it is. For those who don't know, Stanford CoreNLP is an open source software developed by Stanford that provides various Natural Language Processing tools such as: Stemming, Lemmatization, Part-Of-Speech Tagging, Dependency Parsing, Sentiment Analysis, and Entity Extraction. Stanford CoreNLP is written in Java. If your application is in Java you can simply download and import all the needed jars or setup it with maven. However, I find Python to be more flexible in terms of processing text than Java.

artificial intelligence, natural language, stanford corenlp, (13 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.71)

Add feedback

Configurable, Photorealistic Image Rendering and Ground Truth Synthesis by Sampling Stochastic Grammars Representing Indoor Scenes

Jiang, Chenfanfu, Zhu, Yixin, Qi, Siyuan, Huang, Siyuan, Lin, Jenny, Guo, Xingwen, Yu, Lap-Fai, Terzopoulos, Demetri, Zhu, Song-Chun

arXiv.org Machine LearningApr-3-2017

We propose the configurable rendering of massive quantities of photorealistic images with ground truth for the purposes of training, benchmarking, and diagnosing computer vision models. In contrast to the conventional (crowd-sourced) manual labeling of ground truth for a relatively modest number of RGB-D images captured by Kinect-like sensors, we devise a non-trivial configurable pipeline of algorithms capable of generating a potentially infinite variety of indoor scenes using a stochastic grammar, specifically, one represented by an attributed spatial And-Or graph. We employ physics-based rendering to synthesize photorealistic RGB images while automatically synthesizing detailed, per-pixel ground truth data, including visible surface depth and normal, object identity and material information, as well as illumination. Our pipeline is configurable inasmuch as it enables the precise customization and control of important attributes of the generated scenes. We demonstrate that our generated scenes achieve a performance similar to the NYU v2 Dataset on pre-trained deep learning models. By modifying pipeline components in a controllable manner, we furthermore provide diagnostics on common scene understanding tasks; eg., depth and surface normal prediction, semantic segmentation, etc.

computer vision, machine learning, natural language, (15 more...)

arXiv.org Machine Learning

1704.00112

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
(2 more...)

Add feedback

A Neural Probabilistic Structured-Prediction Method for Transition-Based Natural Language Processing

Zhou, Hao, Zhang, Yue, Cheng, Chuan, Huang, Shujian, Dai, Xinyu, Chen, Jiajun

Journal of Artificial Intelligence ResearchMar-29-2017

We propose a neural probabilistic structured-prediction method for transition-based natural language processing, which integrates beam search and contrastive learning. The method uses a global optimization model, which can leverage arbitrary features over non-local context. Beam search is used for efficient heuristic decoding, and contrastive learning is performed for adjusting the model according to search errors. When evaluated on both chunking and dependency parsing tasks, the proposed method achieves significant accuracy improvements over the locally normalized greedy baseline on the two tasks, respectively.

computational linguistic, machine learning, natural language, (21 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.5259

AI Access Foundation

11054

Journal of Artificial Intelligence Research

Country: