AITopics | Grammars & Parsing

Collaborating Authors

Grammars & Parsing

News Overviews Instructional Materials AI-Alerts Classics

VLGrammar: Grounded Grammar Induction of Vision and Language

Hong, Yining, Li, Qing, Zhu, Song-Chun, Huang, Siyuan

arXiv.org Artificial IntelligenceMar-24-2021

Cognitive grammar suggests that the acquisition of language grammar is grounded within visual structures. While grammar is an essential representation of natural language, it also exists ubiquitously in vision to represent the hierarchical part-whole structure. In this work, we study grounded grammar induction of vision and language in a joint learning framework. Specifically, we present VLGrammar, a method that uses compound probabilistic context-free grammars (compound PCFGs) to induce the language grammar and the image grammar simultaneously. We propose a novel contrastive learning framework to guide the joint learning of both modules. To provide a benchmark for the grounded grammar induction task, we collect a large-scale dataset, \textsc{PartIt}, which contains human-written sentences that describe part-level semantics for 3D objects. Experiments on the \textsc{PartIt} dataset show that VLGrammar outperforms all baselines in image grammar induction and language grammar induction. The learned VLGrammar naturally benefits related downstream tasks. Specifically, it improves the image unsupervised clustering accuracy by 30\%, and performs well in image retrieval and text retrieval. Notably, the induced grammar shows superior generalizability by easily generalizing to unseen categories.

grammar, grammar induction, vlgrammar, (15 more...)

arXiv.org Artificial Intelligence

2103.12975

Country: North America > United States > California > Los Angeles County > Los Angeles (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

A Panoramic Survey of Natural Language Processing in the Arab World

Communications of the ACMMar-23-2021, 10:43:23 GMT

Though Arabic NLP has many challenges, it has seen many successes and developments.

arab world, arabic, university, (15 more...)

Communications of the ACM

AI-Alerts: 2021 > 2021-03 > AAAI AI-Alert Education for Mar 23, 2021 (1.00)

Country:

Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)
Asia > Middle East > Lebanon > Beirut Governorate > Beirut (0.04)
North America > United States > New York (0.04)
(14 more...)

Industry:

Education (0.93)
Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.95)
(3 more...)

Add feedback

Structural block driven - enhanced convolutional neural representation for relation extraction

Wang, Dongsheng, Tiwari, Prayag, Garg, Sahil, Zhu, Hongyin, Bruza, Peter

arXiv.org Artificial IntelligenceMar-21-2021

In this paper, we propose a novel lightweight relation extraction approach of structural block driven - convolutional neural learning. Specifically, we detect the essential sequential tokens associated with entities through dependency analysis, named as a structural block, and only encode the block on a block-wise and an inter-block-wise representation, utilizing multi-scale CNNs. This is to 1) eliminate the noisy from irrelevant part of a sentence; meanwhile 2) enhance the relevant block representation with both block-wise and inter-block-wise semantically enriched representation. Our method has the advantage of being independent of long sentence context since we only encode the sequential tokens within a block boundary. Experiments on two datasets i.e., SemEval2010 and KBP37, demonstrate the significant advantages of our method. In particular, we achieve the new state-of-the-art performance on the KBP37 dataset; and comparable performance with the state-of-the-art on the SemEval2010 dataset.

proceedings, relation extraction, representation, (11 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.asoc.2019.105913

2103.11356

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.05)
Europe > Denmark > Capital Region > Copenhagen (0.04)
Oceania > Australia > Queensland (0.04)
(4 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)

Add feedback

TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing

Gui, Tao, Wang, Xiao, Zhang, Qi, Liu, Qin, Zou, Yicheng, Zhou, Xin, Zheng, Rui, Zhang, Chong, Wu, Qinzhuo, Ye, Jiacheng, Pang, Zexiong, Zhang, Yongxin, Li, Zhengyan, Ma, Ruotian, Fei, Zichu, Cai, Ruijian, Zhao, Jun, Hu, Xinwu, Yan, Zhiheng, Tan, Yiding, Hu, Yuan, Bian, Qiyuan, Liu, Zhihua, Zhu, Bolin, Qin, Shan, Xing, Xiaoyu, Fu, Jinlan, Zhang, Yue, Peng, Minlong, Zheng, Xiaoqing, Zhou, Yaqian, Wei, Zhongyu, Qiu, Xipeng, Huang, Xuanjing

arXiv.org Artificial IntelligenceMar-21-2021

Various robustness evaluation methodologies from different perspectives have been proposed for different natural language processing (NLP) tasks. These methods have often focused on either universal or task-specific generalization capabilities. In this work, we propose a multilingual robustness evaluation platform for NLP tasks (TextFlint) that incorporates universal text transformation, task-specific transformation, adversarial attack, subpopulation, and their combinations to provide comprehensive robustness analysis. TextFlint enables practitioners to automatically evaluate their models from all aspects or to customize their evaluations as desired with just a few lines of code. To guarantee user acceptability, all the text transformations are linguistically based, and we provide a human evaluation for each one. TextFlint generates complete analytical reports as well as targeted augmented data to address the shortcomings of the model's robustness. To validate TextFlint's utility, we performed large-scale empirical evaluations (over 67,000 evaluations) on state-of-the-art deep learning models, classic supervised methods, and real-world systems. Almost all models showed significant performance degradation, including a decline of more than 50% of BERT's prediction accuracy on tasks such as aspect-level sentiment classification, named entity recognition, and natural language inference. Therefore, we call for the robustness to be included in the model evaluation, so as to promote the healthy development of NLP technology.

dataset, evaluation, transformation, (14 more...)

arXiv.org Artificial Intelligence

2103.11441

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Beijing > Beijing (0.04)
(10 more...)

Genre: Research Report (1.00)

Industry: Government > Military (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.88)

Add feedback

Dependency Graph-to-String Statistical Machine Translation

Li, Liangyou, Way, Andy, Liu, Qun

arXiv.org Artificial IntelligenceMar-20-2021

We present graph-based translation models which translate source graphs into target strings. Source graphs are constructed from dependency trees with extra links so that non-syntactic phrases are connected. Inspired by phrase-based models, we first introduce a translation model which segments a graph into a sequence of disjoint subgraphs and generates a translation by combining subgraph translations left-to-right using beam search. However, similar to phrase-based models, this model is weak at phrase reordering. Therefore, we further introduce a model based on a synchronous node replacement grammar which learns recursive translation rules. We provide two implementations of the model with different restrictions so that source graphs can be parsed efficiently. Experiments on Chinese--English and German--English show that our graph-based models are significantly better than corresponding sequence- and tree-based baselines.

proceedings, subgraph, translation, (15 more...)

arXiv.org Artificial Intelligence

2103.11089

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Africa > South Africa (0.05)
South America > Brazil (0.04)
(43 more...)

Genre: Research Report > New Finding (0.46)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.88)

Add feedback

Technical Lead - Language Engineering - DeepSource

#artificialintelligenceMar-19-2021, 08:10:55 GMT

Been a Senior Software engineer or Technical Lead. A polyglot developer and can write proficient code in 2 mainstream programming languages, and you are genuinely interested in working with syntax trees, code transformations, lexical parsing, etc. Can articulate complex technical problems clearly using written and verbal communication, and set a vision that gets your team members excited. Can articulate complex business problems clearly using written and verbal communication, and set a vision that gets your team members excited. Have experience with building tools where the intended user are developers; this is an advantage but not a requirement.

analyzer, deepsource, technical lead, (9 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.58)

Add feedback

Improving Code Summarization with Block-wise Abstract Syntax Tree Splitting

Lin, Chen, Ouyang, Zhichao, Zhuang, Junqing, Chen, Jianqiang, Li, Hui, Wu, Rongxin

arXiv.org Artificial IntelligenceMar-18-2021

Automatic code summarization frees software developers from the heavy burden of manual commenting and benefits software development and maintenance. Abstract Syntax Tree (AST), which depicts the source code's syntactic structure, has been incorporated to guide the generation of code summaries. However, existing AST based methods suffer from the difficulty of training and generate inadequate code summaries. In this paper, we present the Block-wise Abstract Syntax Tree Splitting method (BASTS for short), which fully utilizes the rich tree-form syntax structure in ASTs, for improving code summarization. BASTS splits the code of a method based on the blocks in the dominator tree of the Control Flow Graph, and generates a split AST for each code split. Each split AST is then modeled by a Tree-LSTM using a pre-training strategy to capture local non-linear syntax encoding. The learned syntax encoding is combined with code encoding, and fed into Transformer to generate high-quality code summaries. Comprehensive experiments on benchmarks have demonstrated that BASTS significantly outperforms state-of-the-art approaches in terms of various evaluation metrics. To facilitate reproducibility, our implementation is available at https://github.com/XMUDM/BASTS.

ast, code summarization, summarization, (16 more...)

arXiv.org Artificial Intelligence

2103.07845

Country:

Asia > China > Fujian Province > Xiamen (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Code Completion by Modeling Flattened Abstract Syntax Trees as Graphs

Wang, Yanlin, Li, Hui

arXiv.org Artificial IntelligenceMar-17-2021

Code completion has become an essential component of integrated development environments. Contemporary code completion methods rely on the abstract syntax tree (AST) to generate syntactically correct code. However, they cannot fully capture the sequential and repetitive patterns of writing code and the structural information of the AST. To alleviate these problems, we propose a new code completion approach named CCAG, which models the flattened sequence of a partial AST as an AST graph. CCAG uses our proposed AST Graph Attention Block to capture different dependencies in the AST graph for representation learning in code completion. The sub-tasks of code completion are optimized via multi-task learning in CCAG, and the task balance is automatically achieved using uncertainty without the need to tune task weights. The experimental results show that CCAG has superior performance than state-of-the-art approaches and it is able to provide intelligent code completion.

ast, code completion, node, (15 more...)

arXiv.org Artificial Intelligence

2103.09499

Country: Asia > China > Fujian Province > Xiamen (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.70)

Add feedback

Probabilistic Grammatical Evolution

Mégane, Jessica, Lourenço, Nuno, Machado, Penousal

arXiv.org Artificial IntelligenceMar-15-2021

Grammatical Evolution (GE) is one of the most popular Genetic Programming (GP) variants, and it has been used with success in several problem domains. Since the original proposal, many enhancements have been proposed to GE in order to address some of its main issues and improve its performance. In this paper we propose Probabilistic Grammatical Evolution (PGE), which introduces a new genotypic representation and new mapping mechanism for GE. Specifically, we resort to a Probabilistic Context-Free Grammar (PCFG) where its probabilities are adapted during the evolutionary process, taking into account the productions chosen to construct the fittest individual. The genotype is a list of real values, where each value represents the likelihood of selecting a derivation rule. We evaluate the performance of PGE in two regression problems and compare it with GE and Structured Grammatical Evolution (SGE). The results show that PGE has a a better performance than GE, with statistically significant differences, and achieved similar performance when comparing with SGE.

derivation rule, grammatical evolution, probability, (13 more...)

arXiv.org Artificial Intelligence

2103.08389

Country:

Europe > Portugal > Coimbra > Coimbra (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.92)

Add feedback

Whole brain Probabilistic Generative Model toward Realizing Cognitive Architecture for Developmental Robots

Taniguchi, Tadahiro, Yamakawa, Hiroshi, Nagai, Takayuki, Doya, Kenji, Sakagami, Masamichi, Suzuki, Masahiro, Nakamura, Tomoaki, Taniguchi, Akira

arXiv.org Artificial IntelligenceMar-15-2021

Building a humanlike integrative artificial cognitive system, that is, an artificial general intelligence, is one of the goals in artificial intelligence and developmental robotics. Furthermore, a computational model that enables an artificial cognitive system to achieve cognitive development will be an excellent reference for brain and cognitive science. This paper describes the development of a cognitive architecture using probabilistic generative models (PGMs) to fully mirror the human cognitive system. The integrative model is called a whole-brain PGM (WB-PGM). It is both brain-inspired and PGMbased. In this paper, the process of building the WB-PGM and learning from the human brain to build cognitive architectures is described.

architecture, cognitive architecture, cognitive system, (16 more...)

arXiv.org Artificial Intelligence

2103.08183

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > Middle East > Jordan (0.04)
(7 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Education (0.93)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
(3 more...)

Add feedback