Goto

Collaborating Authors

 Grammars & Parsing


Stochastic And-Or Grammars: A Unified Framework and Logic Perspective

arXiv.org Artificial Intelligence

Formal grammars are a popular class of knowledge representation that is traditionally confined to the modeling of natural and computer languages. However, several extensions of grammars have been proposed over time to model other types of data such as images [1, 2, 3] and events [4, 5, 6]. One prominent type of extension is stochastic And-Or grammars (AOG) [2]. A stochastic AOG simultaneously models compositions (i.e., a large pattern is the composition of several small patterns arranged according to a certain configuration) and reconfigurations (i.e., a pattern may have several alternative configurations), and in this way it can compactly represent a probabilistic distribution over a large number of patterns. Stochastic AOGs can be used to parse data samples into their compositional structures, which help solve multiple tasks (such as classification, annotation, and segmentation of the data samples) in a unified manner. This work was supported by the National Natural Science Foundation of China (61503248).


Twitter Natural Language Processing -- Noah's ARK

#artificialintelligence

We provide a dependency parser for English tweets, TweeboParser . The parser is trained on a subset of a new labeled corpus for 929 tweets (12,318 tokens) drawn from the POS-tagged tweet corpus of Owoputi et al. (2013), Tweebank . These were created by Lingpeng Kong, Nathan Schneider, Swabha Swayamdipta, Archna Bhatia, Chris Dyer, and Noah A. Smith. Given a tweet, TweeboParser predicts its syntactic structure, represented by unlabeled dependencies. Since a tweet often contains more than one utterance, the output of TweeboParser will often be a multi-rooted graph over the tweet.


Jobs at x.ai x.ai

#artificialintelligence

Our start-up began 2 years ago and we have successfully gone through 23M round-B funding. We have an awesome dataset, an awesome team of data scientists and an equally awesome NLP challenge. We have the ability to quickly produce labeled datasets and test novel NLP techniques, including semantic parsing, deep learning (Convolutional, Recurrent, Recursive neural nets), and various forms of dialogue modeling (e.g. We are looking for PhD-level candidates (or equivalent) with a strong background in either semantic parsers, entity extraction from text, or human-agent dialogue modeling. The candidate is expected to be able to design, implement and lead the evolution of one of our critical NLP tasks together with a team.


What is the difference between pCFGs and HMMs? • /r/MachineLearning

#artificialintelligence

As other people pointed out, PCFGs can express all probabilistic pushdown automata, while HMMs can express all probabilistic finite state automata. So there are things which PCFGs can model (like recursion) which HMMs can't. On the other hand it's pretty difficult to induce PCFGs from data with EM, as the latent variable space is really huge, while for HMMs this is easy. You can think of a PCFG as a hierarchical HMM in the following way. To generate observations from an HMM you start from the initial state and use the transition probabilities to sample the next state, and then use the state's emission probability to sample a symbol, until the next state sampled is the end of sequence state.


Collection of Machine Learning Interview Questions

#artificialintelligence

Here is the link to coursera course for NLP Pick the software from the The Stanford NLP (Natural Language Processing) Group and input some text to view its parse tree, named entities, part of speech tags, etc.


Word Representations, Tree Models and Syntactic Functions

arXiv.org Machine Learning

Word representations induced from models with discrete latent variables (e.g.\ HMMs) have been shown to be beneficial in many NLP applications. In this work, we exploit labeled syntactic dependency trees and formalize the induction problem as unsupervised learning of tree-structured hidden Markov models. Syntactic functions are used as additional observed variables in the model, influencing both transition and emission components. Such syntactic information can potentially lead to capturing more fine-grain and functional distinctions between words, which, in turn, may be desirable in many NLP applications. We evaluate the word representations on two tasks -- named entity recognition and semantic frame identification. We observe improvements from exploiting syntactic function information in both cases, and the results rivaling those of state-of-the-art representation learning methods. Additionally, we revisit the relationship between sequential and unlabeled-tree models and find that the advantage of the latter is not self-evident.


Synthetic Treebanking for Cross-Lingual Dependency Parsing

Journal of Artificial Intelligence Research

How do we parse the languages for which no treebanks are available? This contribution addresses the cross-lingual viewpoint on statistical dependency parsing, in which we attempt to make use of resource-rich source language treebanks to build and adapt models for the under-resourced target languages. We outline the benefits, and indicate the drawbacks of the current major approaches. We emphasize synthetic treebanking: the automatic creation of target language treebanks by means of annotation projection and machine translation. We present competitive results in cross-lingual dependency parsing using a combination of various techniques that contribute to the overall success of the method. We further include a detailed discussion about the impact of part-of-speech label accuracy on parsing results that provide guidance in practical applications of cross-lingual methods for truly under-resourced languages.


Introduction to the Special Issue on Cross-Language Algorithms and Applications

Journal of Artificial Intelligence Research

With the increasingly global nature of our everyday interactions, the need for multilin- gual technologies to support efficient and effective information access and communication cannot be overemphasized. Computational modeling of language has been the focus of Natural Language Processing, a subdiscipline of Artificial Intelligence. One of the current challenges for this discipline is to design methodologies and algorithms that are cross- language in order to create multilingual technologies rapidly. The goal of this JAIR special issue on Cross-Language Algorithms and Applications (CLAA) is to present leading re- search in this area, with emphasis on developing unifying themes that could lead to the development of the science of multi- and cross-lingualism. In this introduction, we provide the reader with the motivation for this special issue and summarize the contributions of the papers that have been included. The selected papers cover a broad range of cross-lingual technologies including machine translation, domain and language adaptation for sentiment analysis, cross-language lexical resources, dependency parsing, information retrieval and knowledge representation. We anticipate that this special issue will serve as an invaluable resource for researchers interested in topics of cross-lingual natural language processing.


Grammar as a Foreign Language

Neural Information Processing Systems

Syntactic constituency parsing is a fundamental problem in naturallanguage processing which has been the subject of intensive researchand engineering for decades. As a result, the most accurate parsersare domain specific, complex, and inefficient. In this paper we showthat the domain agnostic attention-enhanced sequence-to-sequence modelachieves state-of-the-art results on the most widely used syntacticconstituency parsing dataset, when trained on a large synthetic corpusthat was annotated using existing parsers. It also matches theperformance of standard parsers when trained on a smallhuman-annotated dataset, which shows that this model is highlydata-efficient, in contrast to sequence-to-sequence models without theattention mechanism. Our parser is also fast, processing over ahundred sentences per second with an unoptimized CPU implementation.


Information retrieval in folktales using natural language processing

arXiv.org Artificial Intelligence

Recognising literary characters in various narrative texts is challenging both from the literary and technical perspective. From the literary viewpoint, the meaning of the term "character" leaves space to various interpretations. From the technical perspective, literary texts contain a lot of data about emotions, social life or inner life of the characters, while they are very thin on technical, straightforward messages. To infer the character type from literary texts might pose problems even to the human readers [4]. Interactions between literary characters contain rich social networks.