AITopics | xlnet model

Collaborating Authors

xlnet model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

TextAge: A Curated and Diverse Text Dataset for Age Classification

Cheekati, Shravan, Gupta, Mridul, Raghu, Vibha, Raj, Pranav

arXiv.org Artificial IntelligenceMay-2-2024

Age-related language patterns play a crucial role in understanding linguistic differences and developing age-appropriate communication strategies. However, the lack of comprehensive and diverse datasets has hindered the progress of research in this area. To address this issue, we present TextAge, a curated text dataset that maps sentences to the age and age group of the producer, as well as an underage (under 13) label. TextAge covers a wide range of ages and includes both spoken and written data from various sources such as CHILDES, Meta, Poki Poems-by-kids, JUSThink, and the TV show "Survivor." The dataset undergoes extensive cleaning and preprocessing to ensure data quality and consistency. We demonstrate the utility of TextAge through two applications: Underage Detection and Generational Classification. For Underage Detection, we train a Naive Bayes classifier, fine-tuned RoBERTa, and XLNet models to differentiate between language patterns of minors and young-adults and over. For Generational Classification, the models classify language patterns into different age groups (kids, teens, twenties, etc.). The models excel at classifying the "kids" group but struggle with older age groups, particularly "fifties," "sixties," and "seventies," likely due to limited data samples and less pronounced linguistic differences. TextAge offers a valuable resource for studying age-related language patterns and developing age-sensitive language models. The dataset's diverse composition and the promising results of the classification tasks highlight its potential for various applications, such as content moderation, targeted advertising, and age-appropriate communication. Future work aims to expand the dataset further and explore advanced modeling techniques to improve performance on older age groups.

age group, dataset, language pattern, (11 more...)

arXiv.org Artificial Intelligence

2406.1689

Genre: Research Report (0.64)

Industry:

Education (0.48)
Media > Television (0.35)
Leisure & Entertainment (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.58)

Add feedback

Argumentation Element Annotation Modeling using XLNet

Ormerod, Christopher, Burkhardt, Amy, Young, Mackenzie, Lottridge, Sue

arXiv.org Artificial IntelligenceNov-10-2023

This study demonstrates the effectiveness of XLNet, a transformer-based language model, for annotating argumentative elements in persuasive essays. XLNet's architecture incorporates a recurrent mechanism that allows it to model long-term dependencies in lengthy texts. Fine-tuned XLNet models were applied to three datasets annotated with different schemes - a proprietary dataset using the Annotations for Revisions and Reflections on Writing (ARROW) scheme, the PERSUADE corpus, and the Argument Annotated Essays (AAE) dataset. The XLNet models achieved strong performance across all datasets, even surpassing human agreement levels in some cases. This shows XLNet capably handles diverse annotation schemes and lengthy essays. Comparisons between the model outputs on different datasets also revealed insights into the relationships between the annotation tags. Overall, XLNet's strong performance on modeling argumentative structures across diverse datasets highlights its suitability for providing automated feedback on essay organization.

annotation scheme, dataset, persuade corpus, (13 more...)

arXiv.org Artificial Intelligence

2311.06239

Country:

North America > United States > California > Santa Clara County > Mountain View (0.04)
North America > Canada > Ontario > Toronto (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

An Introduction to Working with BERT in Practice

#artificialintelligenceFeb-13-2022, 00:46:20 GMT

Luckily, smaller pretrained BERT or XLNET models are becoming increasingly available for free, and they may well serve as stepping stones for fine-tuning. This means that, in practice, you start from downloading a pre-trained BERT or XLNET model, incorporate it into your network, and fine-tune it with much more manageable, smaller datasets. In this article, we'll see how that works. First, let's start with incorporating existing BERT models in our models. For this to work, we need a dedicated BERT layer: a landing hub for BERT models.

bert model, xlnet model

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.80)

Add feedback

Summarize Twitter Live data using Pretrained NLP models

#artificialintelligenceNov-7-2020, 19:50:49 GMT

Understand how you can leverage Natural Language Processing (NLP) pre-trained models to summarize live twitter data based on hashtags.

large language model, natural language, twitter data, (12 more...)

#artificialintelligence

Industry: Information Technology > Services (0.39)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.53)

Add feedback