AITopics | Grammars & Parsing

Collaborating Authors

Grammars & Parsing

News Overviews Instructional Materials AI-Alerts Classics

Stochastic And-Or Grammars: A Unified Framework and Logic Perspective

arXiv.org Artificial IntelligenceApr-11-2016

Formal grammars are a popular class of knowledge representation that is traditionally confined to the modeling of natural and computer languages. However, several extensions of grammars have been proposed over time to model other types of data such as images [1, 2, 3] and events [4, 5, 6]. One prominent type of extension is stochastic And-Or grammars (AOG) [2]. A stochastic AOG simultaneously models compositions (i.e., a large pattern is the composition of several small patterns arranged according to a certain configuration) and reconfigurations (i.e., a pattern may have several alternative configurations), and in this way it can compactly represent a probabilistic distribution over a large number of patterns. Stochastic AOGs can be used to parse data samples into their compositional structures, which help solve multiple tasks (such as classification, annotation, and segmentation of the data samples) in a unified manner. This work was supported by the National Natural Science Foundation of China (61503248).

logic & formal reasoning, machine learning, node, (21 more...)

arXiv.org Artificial Intelligence

1506.00858

Country:

North America > United States (0.46)
Asia > China (0.34)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.68)

Add feedback

Twitter Natural Language Processing -- Noah's ARK

#artificialintelligenceApr-8-2016, 23:33:45 GMT

We provide a dependency parser for English tweets, TweeboParser . The parser is trained on a subset of a new labeled corpus for 929 tweets (12,318 tokens) drawn from the POS-tagged tweet corpus of Owoputi et al. (2013), Tweebank . These were created by Lingpeng Kong, Nathan Schneider, Swabha Swayamdipta, Archna Bhatia, Chris Dyer, and Noah A. Smith. Given a tweet, TweeboParser predicts its syntactic structure, represented by unlabeled dependencies. Since a tweet often contains more than one utterance, the output of TweeboParser will often be a multi-rooted graph over the tweet.

artificial intelligence, natural language, tweeboparser, (2 more...)

#artificialintelligence

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.77)

Add feedback

Jobs at x.ai x.ai

#artificialintelligenceApr-8-2016, 20:05:57 GMT

Our start-up began 2 years ago and we have successfully gone through 23M round-B funding. We have an awesome dataset, an awesome team of data scientists and an equally awesome NLP challenge. We have the ability to quickly produce labeled datasets and test novel NLP techniques, including semantic parsing, deep learning (Convolutional, Recurrent, Recursive neural nets), and various forms of dialogue modeling (e.g. We are looking for PhD-level candidates (or equivalent) with a strong background in either semantic parsers, entity extraction from text, or human-agent dialogue modeling. The candidate is expected to be able to design, implement and lead the evolution of one of our critical NLP tasks together with a team.

deep learning, dialogue modeling, machine learning, (2 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

What is the difference between pCFGs and HMMs? • /r/MachineLearning

#artificialintelligenceApr-1-2016, 16:30:45 GMT

As other people pointed out, PCFGs can express all probabilistic pushdown automata, while HMMs can express all probabilistic finite state automata. So there are things which PCFGs can model (like recursion) which HMMs can't. On the other hand it's pretty difficult to induce PCFGs from data with EM, as the latent variable space is really huge, while for HMMs this is easy. You can think of a PCFG as a hierarchical HMM in the following way. To generate observations from an HMM you start from the initial state and use the transition probabilities to sample the next state, and then use the state's emission probability to sample a symbol, until the next state sampled is the end of sequence state.

artificial intelligence, hmm, natural language, (6 more...)

#artificialintelligence

Industry: Media > News (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Add feedback

Collection of Machine Learning Interview Questions

#artificialintelligenceMar-25-2016, 04:45:41 GMT

Here is the link to coursera course for NLP Pick the software from the The Stanford NLP (Natural Language Processing) Group and input some text to view its parse tree, named entities, part of speech tags, etc.

artificial intelligence, natural language, text processing, (12 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.72)

Industry: Education > Educational Setting > Online (0.38)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.91)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.78)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.38)

Add feedback

Word Representations, Tree Models and Syntactic Functions

Šuster, Simon, van Noord, Gertjan, Titov, Ivan

arXiv.org Machine LearningFeb-5-2016

Word representations induced from models with discrete latent variables (e.g.\ HMMs) have been shown to be beneficial in many NLP applications. In this work, we exploit labeled syntactic dependency trees and formalize the induction problem as unsupervised learning of tree-structured hidden Markov models. Syntactic functions are used as additional observed variables in the model, influencing both transition and emission components. Such syntactic information can potentially lead to capturing more fine-grain and functional distinctions between words, which, in turn, may be desirable in many NLP applications. We evaluate the word representations on two tasks -- named entity recognition and semantic frame identification. We observe improvements from exploiting syntactic function information in both cases, and the results rivaling those of state-of-the-art representation learning methods. Additionally, we revisit the relationship between sequential and unlabeled-tree models and find that the advantage of the latter is not self-evident.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Machine Learning

1508.07709

Country:

North America > United States (0.93)
Europe (0.68)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
(2 more...)

Add feedback

Synthetic Treebanking for Cross-Lingual Dependency Parsing

Tiedemann, Jörg, Agić, Zeljko

Journal of Artificial Intelligence ResearchJan-27-2016

How do we parse the languages for which no treebanks are available? This contribution addresses the cross-lingual viewpoint on statistical dependency parsing, in which we attempt to make use of resource-rich source language treebanks to build and adapt models for the under-resourced target languages. We outline the benefits, and indicate the drawbacks of the current major approaches. We emphasize synthetic treebanking: the automatic creation of target language treebanks by means of annotation projection and machine translation. We present competitive results in cross-lingual dependency parsing using a combination of various techniques that contribute to the overall success of the method. We further include a detailed discussion about the impact of part-of-speech label accuracy on parsing results that provide guidance in practical applications of cross-lingual methods for truly under-resourced languages.

dependency, proceedings, treebank, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.4785

AI Access Foundation

10980

Journal of Artificial Intelligence Research

Country:

Europe > Finland > Uusimaa > Helsinki (0.04)
Europe > Czechia > Prague (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)
Africa > Middle East > Egypt > Giza Governorate > Giza (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Add feedback

Introduction to the Special Issue on Cross-Language Algorithms and Applications

Costa-jussà, Marta R., Bangalore, Srinivas, Lambert, Patrik, Màrquez, Lluís, Montiel-Ponsoda, Elena

Journal of Artificial Intelligence ResearchJan-12-2016

With the increasingly global nature of our everyday interactions, the need for multilin- gual technologies to support efficient and effective information access and communication cannot be overemphasized. Computational modeling of language has been the focus of Natural Language Processing, a subdiscipline of Artificial Intelligence. One of the current challenges for this discipline is to design methodologies and algorithms that are cross- language in order to create multilingual technologies rapidly. The goal of this JAIR special issue on Cross-Language Algorithms and Applications (CLAA) is to present leading re- search in this area, with emphasis on developing unifying themes that could lead to the development of the science of multi- and cross-lingualism. In this introduction, we provide the reader with the motivation for this special issue and summarize the contributions of the papers that have been included. The selected papers cover a broad range of cross-lingual technologies including machine translation, domain and language adaptation for sentiment analysis, cross-language lexical resources, dependency parsing, information retrieval and knowledge representation. We anticipate that this special issue will serve as an invaluable resource for researchers interested in topics of cross-lingual natural language processing.

machine translation, proceedings, translation, (12 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.5022

AI Access Foundation

10973

Journal of Artificial Intelligence Research

Country:

Asia > India > Karnataka > Bengaluru (0.05)
Europe > Czechia > Prague (0.04)
Europe > Sweden > Uppsala County > Uppsala (0.04)
(18 more...)

Genre:

Overview (0.87)
Collection > Journal > Special Issue (0.77)
Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
(2 more...)

Add feedback

Grammar as a Foreign Language

Vinyals, Oriol, Kaiser, Łukasz, Koo, Terry, Petrov, Slav, Sutskever, Ilya, Hinton, Geoffrey

Neural Information Processing SystemsDec-31-2015

Syntactic constituency parsing is a fundamental problem in naturallanguage processing which has been the subject of intensive researchand engineering for decades. As a result, the most accurate parsersare domain specific, complex, and inefficient. In this paper we showthat the domain agnostic attention-enhanced sequence-to-sequence modelachieves state-of-the-art results on the most widely used syntacticconstituency parsing dataset, when trained on a large synthetic corpusthat was annotated using existing parsers. It also matches theperformance of standard parsers when trained on a smallhuman-annotated dataset, which shows that this model is highlydata-efficient, in contrast to sequence-to-sequence models without theattention mechanism. Our parser is also fast, processing over ahundred sentences per second with an unoptimized CPU implementation.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Information retrieval in folktales using natural language processing

Groza, Adrian, Corde, Lidia

arXiv.org Artificial IntelligenceNov-10-2015

Recognising literary characters in various narrative texts is challenging both from the literary and technical perspective. From the literary viewpoint, the meaning of the term "character" leaves space to various interpretations. From the technical perspective, literary texts contain a lot of data about emotions, social life or inner life of the characters, while they are very thin on technical, straightforward messages. To infer the character type from literary texts might pose problems even to the human readers [4]. Interactions between literary characters contain rich social networks.

artificial intelligence, information retrieval, natural language, (18 more...)

arXiv.org Artificial Intelligence

1511.03012

Country: Europe > Romania (0.15)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.52)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.51)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.47)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.46)

Add feedback