AITopics | Grammars & Parsing

Collaborating Authors

Grammars & Parsing

News Overviews Instructional Materials AI-Alerts Classics

Synapse: Learning Preferential Concepts from Visual Demonstrations

Modak, Sadanand, Patton, Noah, Dillig, Isil, Biswas, Joydeep

arXiv.org Artificial IntelligenceMay-6-2024

This paper addresses the problem of preference learning, which aims to learn user-specific preferences (e.g., "good parking spot", "convenient drop-off location") from visual input. Despite its similarity to learning factual concepts (e.g., "red cube"), preference learning is a fundamentally harder problem due to its subjective nature and the paucity of person-specific training data. We address this problem using a new framework called Synapse, which is a neuro-symbolic approach designed to efficiently learn preferential concepts from limited demonstrations. Synapse represents preferences as neuro-symbolic programs in a domain-specific language (DSL) that operates over images, and leverages a novel combination of visual parsing, large language models, and program synthesis to learn programs representing individual preferences. We evaluate Synapse through extensive experimentation including a user case study focusing on mobility-related concepts in mobile robotics and autonomous driving. Our evaluation demonstrates that Synapse significantly outperforms existing baselines as well as its own ablations. The code and other details can be found on the project website https://amrl.cs.utexas.edu/synapse .

concept library, demonstration, synapse, (13 more...)

arXiv.org Artificial Intelligence

2403.16689

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
Asia > Japan > Shikoku > Kagawa Prefecture > Takamatsu (0.04)

Genre: Research Report > New Finding (0.67)

Industry:

Transportation > Ground > Road (0.68)
Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(3 more...)

Add feedback

Grammatical Error Correction for Code-Switched Sentences by Learners of English

Chan, Kelvin Wey Han, Bryant, Christopher, Nguyen, Li, Caines, Andrew, Yuan, Zheng

arXiv.org Artificial IntelligenceMay-6-2024

Code-switching (CSW) is a common phenomenon among multilingual speakers where multiple languages are used in a single discourse or utterance. Mixed language utterances may still contain grammatical errors however, yet most existing Grammar Error Correction (GEC) systems have been trained on monolingual data and not developed with CSW in mind. In this work, we conduct the first exploration into the use of GEC systems on CSW text. Through this exploration, we propose a novel method of generating synthetic CSW GEC datasets by translating different spans of text within existing GEC corpora. We then investigate different methods of selecting these spans based on CSW ratio, switch-point factor and linguistic constraints, and identify how they affect the performance of GEC systems on CSW text. Our best model achieves an average increase of 1.57 $F_{0.5}$ across 3 CSW test sets (English-Chinese, English-Korean and English-Japanese) without affecting the model's performance on a monolingual dataset. We furthermore discovered that models trained on one CSW language generalise relatively well to other typologically similar CSW languages.

computational linguistic, dataset, proceedings, (10 more...)

arXiv.org Artificial Intelligence

2404.12489

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.46)
North America > United States > New York (0.04)
Oceania > Australia > Victoria > Melbourne (0.04)
(11 more...)

Genre: Research Report > New Finding (1.00)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Data Science > Data Quality > Data Cleaning (0.72)

Add feedback

Neural Automated Writing Evaluation with Corrective Feedback

Wang, Izia Xiaoxiao, Wu, Xihan, Coates, Edith, Zeng, Min, Kuang, Jiexin, Liu, Siliang, Qiu, Mengyang, Park, Jungyeul

arXiv.org Artificial IntelligenceMay-6-2024

The utilization of technology in second language learning and teaching has become ubiquitous. For the assessment of writing specifically, automated writing evaluation (AWE) and grammatical error correction (GEC) have become immensely popular and effective methods for enhancing writing proficiency and delivering instant and individualized feedback to learners. By leveraging the power of natural language processing (NLP) and machine learning algorithms, AWE and GEC systems have been developed separately to provide language learners with automated corrective feedback and more accurate and unbiased scoring that would otherwise be subject to examiners. In this paper, we propose an integrated system for automated writing evaluation with corrective feedback as a means of bridging the gap between AWE and GEC results for second language learners. This system enables language learners to simulate the essay writing tests: a student writes and submits an essay, and the system returns the assessment of the writing along with suggested grammatical error corrections. Given that automated scoring and grammatical correction are more efficient and cost-effective than human grading, this integrated system would also alleviate the burden of manually correcting innumerable essays.

computational linguistic, language learner, proceedings, (11 more...)

arXiv.org Artificial Intelligence

2402.17613

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Georgia > Clarke County > Athens (0.14)
North America > Canada > Ontario > Toronto (0.05)
(15 more...)

Genre:

Research Report (1.00)
Overview (0.93)

Industry:

Education > Curriculum > Subject-Specific Education (0.67)
Education > Educational Technology > Educational Software > Computer-Aided Assessment (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.57)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Modern Information Technologies in Scientific Research and Educational Activities

Malakhov, Kyrylo, Kaverinskiy, Vadislav, Ivanova, Liliia, Romanyuk, Oleksandr, Romaniuk, Oksana, Voinova, Svitlana, Kotlyk, Sergii, Sokolova, Oksana

arXiv.org Artificial IntelligenceMay-4-2024

Nowadays, there is a rapid development of information technology, which entails the need to constantly improve and expand the capabilities of interactive artificial intelligence systems This monograph combines several current topics related to the field of information technology One of the key topics is the methodology for enhancing the capabilities of conversational systems, with a focus on ChatGPT, which represents the latest advance in the field of artificial intelligence The monograph also discusses text generation systems based on ontological representations, which open up wide opportunities for creating high-quality content A special place in the work is given to an automated computer system for diagnosing the competitiveness of specialists in the field of information technology This helps to effectively assess the professionalism of specialists and determine the need for advanced training Theoretical aspects of correct color rendering and informatization of educational and research work of graduate students are important in ensuring the quality of education and scientific research And finally, the use of technology for creating 3D models has become an integral part of the modern information environment, which makes it possible to bring the most daring ideas and projects to life Research and development in these areas contribute to the improvement of information technologies, finding application in various fields of activity The purpose of our monograph is to conduct analysis and research in these areas in order to promote the development of information technologies and increase their efficiency The monograph was compiled based on the results of the XVI international scientific and practical conference "Information technologies and automation -- 2023", which took place in October 2023 at Odessa National University of Technology

photogrammetric coordinate system, physical and rehabilitation medicine, scientific research and educational activity, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.31274/isudp.2024.151

2407.10296

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.13)
Europe > Ukraine > Kyiv Oblast > Kyiv (0.04)
Europe > Ukraine > Vinnytsia Oblast > Vinnytsia (0.04)
(14 more...)

Genre:

Research Report > Promising Solution (1.00)
Research Report > Experimental Study (1.00)
Instructional Material (1.00)
Research Report > New Finding (0.67)

Industry:

Media > Photography (1.00)
Media > Film (1.00)
Materials (1.00)
(11 more...)

Technology:

Information Technology > Communications > Web > Semantic Web (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
(6 more...)

Add feedback

Towards Incremental Transformers: An Empirical Analysis of Transformer Models for Incremental NLU

Kahardipraja, Patrick, Madureira, Brielen, Schlangen, David

arXiv.org Artificial IntelligenceMay-2-2024

Incremental processing allows interactive systems to respond based on partial inputs, which is a desirable property e.g. in dialogue agents. The currently popular Transformer architecture inherently processes sequences as a whole, abstracting away the notion of time. Recent work attempts to apply Transformers incrementally via restart-incrementality by repeatedly feeding, to an unchanged model, increasingly longer input prefixes to produce partial outputs. However, this approach is computationally costly and does not scale efficiently for long sequences. In parallel, we witness efforts to make Transformers more efficient, e.g. the Linear Transformer (LT) with a recurrence mechanism. In this work, we examine the feasibility of LT for incremental NLU in English. Our results show that the recurrent LT model has better incremental performance and faster inference speed compared to the standard Transformer and LT with restart-incrementality, at the cost of part of the non-incremental (full sequence) quality. We show that the performance drop can be mitigated by training the model to wait for right context before committing to an output and that training with input prefixes is beneficial for delivering correct partial outputs.

computational linguistic, glove & po 0, proceedings, (14 more...)

arXiv.org Artificial Intelligence

2109.07364

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Germany > Brandenburg > Potsdam (0.04)
(14 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.46)

Add feedback

Silencing the Risk, Not the Whistle: A Semi-automated Text Sanitization Tool for Mitigating the Risk of Whistleblower Re-Identification

Staufer, Dimitri, Pallas, Frank, Berendt, Bettina

arXiv.org Artificial IntelligenceMay-2-2024

Whistleblowing is essential for ensuring transparency and accountability in both public and private sectors. However, (potential) whistleblowers often fear or face retaliation, even when reporting anonymously. The specific content of their disclosures and their distinct writing style may re-identify them as the source. Legal measures, such as the EU WBD, are limited in their scope and effectiveness. Therefore, computational methods to prevent re-identification are important complementary tools for encouraging whistleblowers to come forward. However, current text sanitization tools follow a one-size-fits-all approach and take an overly limited view of anonymity. They aim to mitigate identification risk by replacing typical high-risk words (such as person names and other NE labels) and combinations thereof with placeholders. Such an approach, however, is inadequate for the whistleblowing scenario since it neglects further re-identification potential in textual features, including writing style. Therefore, we propose, implement, and evaluate a novel classification and mitigation strategy for rewriting texts that involves the whistleblower in the assessment of the risk and utility. Our prototypical tool semi-automatically evaluates risk at the word/term level and applies risk-adapted anonymization techniques to produce a grammatically disjointed yet appropriately sanitized text. We then use a LLM that we fine-tuned for paraphrasing to render this text coherent and style-neutral. We evaluate our tool's effectiveness using court cases from the ECHR and excerpts from a real-world whistleblower testimony and measure the protection against authorship attribution (AA) attacks and utility loss statistically using the popular IMDb62 movie reviews dataset. Our method can significantly reduce AA accuracy from 98.81% to 31.22%, while preserving up to 73.1% of the original content's semantics.

identifier, information, whistleblower, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3630106.3658936

2405.01097

Country:

North America > United States (1.00)
Europe > Austria > Vienna (0.14)
North America > Montserrat (0.04)
(12 more...)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
Government > Regional Government > North America Government > United States Government (0.93)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(6 more...)

Add feedback

Qualia and the Formal Structure of Meaning

Arsiwalla, Xerxes D.

arXiv.org Artificial IntelligenceMay-2-2024

This work explores the hypothesis that subjectively attributed meaning constitutes the phenomenal content of conscious experience. That is, phenomenal content is semantic. This form of subjective meaning manifests as an intrinsic and non-representational character of qualia. Empirically, subjective meaning is ubiquitous in conscious experiences. We point to phenomenological studies that lend evidence to support this. Furthermore, this notion of meaning closely relates to what Frege refers to as "sense", in metaphysics and philosophy of language. It also aligns with Peirce's "interpretant", in semiotics. We discuss how Frege's sense can also be extended to the raw feels of consciousness. Sense and reference both play a role in phenomenal experience. Moreover, within the context of the mind-matter relation, we provide a formalization of subjective meaning associated to one's mental representations. Identifying the precise maps between the physical and mental domains, we argue that syntactic and semantic structures transcend language, and are realized within each of these domains. Formally, meaning is a relational attribute, realized via a map that interprets syntactic structures of a formal system within an appropriate semantic space. The image of this map within the mental domain is what is relevant for experience, and thus comprises the phenomenal content of qualia. We conclude with possible implications this may have for experience-based theories of consciousness.

consciousness, relation, representation, (17 more...)

arXiv.org Artificial Intelligence

2405.01148

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > New York (0.04)
North America > Canada > Ontario > Toronto (0.04)
(5 more...)

Genre: Research Report (0.40)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.68)

Add feedback

Modeling Orthographic Variation in Occitan's Dialects

Hopton, Zachary William, Aepli, Noëmi

arXiv.org Artificial IntelligenceApr-30-2024

Effectively normalizing textual data poses a considerable challenge, especially for low-resource languages lacking standardized writing systems. In this study, we fine-tuned a multilingual model with data from several Occitan dialects and conducted a series of experiments to assess the model's representations of these dialects. For evaluation purposes, we compiled a parallel lexicon encompassing four Occitan dialects. Intrinsic evaluations of the model's embeddings revealed that surface similarity between the dialects strengthened representations. When the model was further fine-tuned for part-of-speech tagging and Universal Dependency parsing, its performance was robust to dialectical variation, even when trained solely on part-of-speech data from a single dialect. Our findings suggest that large multilingual models minimize the need for spelling normalization during pre-processing.

computational linguistic, dialect, occitan, (15 more...)

arXiv.org Artificial Intelligence

2404.19315

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
(11 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.56)

Add feedback

Mining patterns in syntax trees to automate code reviews of student solutions for programming exercises

Van Petegem, Charlotte, Demeyere, Kasper, Maertens, Rien, Strijbol, Niko, De Wever, Bram, Mesuere, Bart, Dawyndt, Peter

arXiv.org Artificial IntelligenceApr-26-2024

In programming education, providing manual feedback is essential but labour-intensive, posing challenges in consistency and timeliness. We introduce ECHO, a machine learning method to automate the reuse of feedback in educational code reviews by analysing patterns in abstract syntax trees. This study investigates two primary questions: whether ECHO can predict feedback annotations to specific lines of student code based on previously added annotations by human reviewers (RQ1), and whether its training and prediction speeds are suitable for using ECHO for real-time feedback during live code reviews by human reviewers (RQ2). Our results, based on annotations from both automated linting tools and human reviewers, show that ECHO can accurately and quickly predict appropriate feedback annotations. Its efficiency in processing and its flexibility in adapting to feedback patterns can significantly reduce the time and effort required for manual feedback provisioning in educational settings.

annotation, submission, subtree, (15 more...)

arXiv.org Artificial Intelligence

2405.01579

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > Belgium > Flanders (0.04)

Genre:

Research Report > New Finding (1.00)
Instructional Material (1.00)

Industry: Education > Educational Setting (0.88)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.69)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.48)

Add feedback

A Bionic Natural Language Parser Equivalent to a Pushdown Automaton

Wei, Zhenghao, Lin, Kehua, Feng, Jianlin

arXiv.org Artificial IntelligenceApr-26-2024

Assembly Calculus (AC), proposed by Papadimitriou et al., aims to reproduce advanced cognitive functions through simulating neural activities, with several applications based on AC having been developed, including a natural language parser proposed by Mitropolsky et al. However, this parser lacks the ability to handle Kleene closures, preventing it from parsing all regular languages and rendering it weaker than Finite Automata (FA). In this paper, we propose a new bionic natural language parser (BNLP) based on AC and integrates two new biologically rational structures, Recurrent Circuit and Stack Circuit which are inspired by RNN and short-term memory mechanism. In contrast to the original parser, the BNLP can fully handle all regular languages and Dyck languages. Therefore, leveraging the Chomsky-Sch \H{u}tzenberger theorem, the BNLP which can parse all Context-Free Languages can be constructed. We also formally prove that for any PDA, a Parser Automaton corresponding to BNLP can always be formed, ensuring that BNLP has a description ability equal to that of PDA and addressing the deficiencies of the original parser.

disinhibit, fiber, neuron, (13 more...)

arXiv.org Artificial Intelligence

2404.17343

Country:

Asia > Middle East > Jordan (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback