AITopics | Text Classification

Collaborating Authors

Text Classification

"A text classifier is an automated means of determining some metadata about a document. Text classifiers are used for such diverse needs as spam filtering, suggesting categories for indexing a document created in a content management system, or automatically sorting help desk requests."
– John Graham-Cumming, Naive Bayesian Text Classification. Dr. Dobb's. May 1 2005.

News Overviews Instructional Materials AI-Alerts Classics

Meta-tuning Language Models to Answer Prompts Better

Zhong, Ruiqi, Lee, Kristy, Zhang, Zheng, Klein, Dan

arXiv.org Artificial IntelligenceApr-16-2021

Large pretrained language models like GPT-3 have acquired a surprising ability to perform zero-shot classification (ZSC). For example, to classify review sentiments, we can "prompt" the language model with the review and the question "Is the review positive?" as the context, and ask it to predict whether the next word is "Yes" or "No". However, these models are not specialized for answering these prompts. To address this weakness, we propose meta-tuning, which trains the model to specialize in answering prompts but still generalize to unseen tasks. To create the training data, we aggregated 43 existing datasets, annotated 441 label descriptions in total, and unified them into the above question answering (QA) format. After meta-tuning, our model outperforms a same-sized QA model for most labels on unseen tasks, and we forecast that the performance would improve for even larger models. Therefore, measuring ZSC performance on non-specialized language models might underestimate their true capability, and community-wide efforts on aggregating datasets and unifying their formats can help build models that understand prompts better.

classification, dataset, label description, (16 more...)

arXiv.org Artificial Intelligence

2104.0467

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > Oregon (0.04)
(19 more...)

Genre: Research Report (0.82)

Industry:

Leisure & Entertainment (1.00)
Media > Film (0.94)
Health & Medicine > Therapeutic Area (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Consistency Training with Virtual Adversarial Discrete Perturbation

Park, Jungsoo, Kim, Gyuwan, Kang, Jaewoo

arXiv.org Artificial IntelligenceApr-15-2021

We propose an effective consistency training framework that enforces a training model's predictions given original and perturbed inputs to be similar by adding a discrete noise that would incur the highest divergence between predictions. This virtual adversarial discrete noise obtained by replacing a small portion of tokens while keeping original semantics as much as possible efficiently pushes a training model's decision boundary. Moreover, we perform an iterative refinement process to alleviate the degraded fluency of the perturbed sentence due to the conditional independence assumption. Experimental results show that our proposed method outperforms other consistency training baselines with text editing, paraphrasing, or a continuous noise on semi-supervised text classification tasks and a robustness benchmark.

arxiv preprint arxiv, dataset, prediction, (15 more...)

arXiv.org Artificial Intelligence

2104.07284

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.68)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.50)

Add feedback

Text Guide: Improving the quality of long text classification by a text selection method based on feature importance

Fiok, Krzysztof, Karwowski, Waldemar, Gutierrez, Edgar, Davahli, Mohammad Reza, Wilamowski, Maciej, Ahram, Tareq, Al-Juaid, Awad, Zurada, Jozef

arXiv.org Artificial IntelligenceApr-15-2021

The performance of text classification methods has improved greatly over the last decade for text instances of less than 512 tokens. This limit has been adopted by most state-of-the-research transformer models due to the high computational cost of analyzing longer text instances. To mitigate this problem and to improve classification for longer texts, researchers have sought to resolve the underlying causes of the computational cost and have proposed optimizations for the attention mechanism, which is the key element of every transformer model. In our study, we are not pursuing the ultimate goal of long text classification, i.e., the ability to analyze entire text instances at one time while preserving high performance at a reasonable computational cost. Instead, we propose a text truncation method called Text Guide, in which the original text length is reduced to a predefined limit in a manner that improves performance over naive and semi-naive approaches while preserving low computational costs. Text Guide benefits from the concept of feature importance, a notion from the explainable artificial intelligence domain. We demonstrate that Text Guide can be used to improve the performance of recent language models specifically designed for long text classification, such as Longformer. Moreover, we discovered that parameter optimization is the key to Text Guide performance and must be conducted before the method is deployed. Future experiments may reveal additional benefits provided by this new method.

experiment, text guide, transformer model, (15 more...)

arXiv.org Artificial Intelligence

2104.07225

Country:

North America > United States > Florida > Orange County > Orlando (0.28)
Europe > Poland > Masovia Province > Warsaw (0.04)
Asia > Middle East > Saudi Arabia (0.04)
(3 more...)

Genre: Research Report > New Finding (0.89)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Distributed Word Representation in Tsetlin Machine

Yadav, Rohan Kumar, Jiao, Lei, Granmo, Ole-Christoffer, Goodwin, Morten

arXiv.org Artificial IntelligenceApr-14-2021

Tsetlin Machine (TM) is an interpretable pattern recognition algorithm based on propositional logic. The algorithm has demonstrated competitive performance in many Natural Language Processing (NLP) tasks, including sentiment analysis, text classification, and Word Sense Disambiguation (WSD). To obtain human-level interpretability, legacy TM employs Boolean input features such as bag-of-words (BOW). However, the BOW representation makes it difficult to use any pre-trained information, for instance, word2vec and GloVe word representations. This restriction has constrained the performance of TM compared to deep neural networks (DNNs) in NLP. To reduce the performance gap, in this paper, we propose a novel way of using pre-trained word representations for TM. The approach significantly enhances the TM performance and maintains interpretability at the same time. We achieve this by extracting semantically related words from pre-trained word representations as input features to the TM. Our experiments show that the accuracy of the proposed approach is significantly higher than the previous BOW-based TM, reaching the level of DNN-based models.

classification, representation, word representation, (13 more...)

arXiv.org Artificial Intelligence

2104.06901

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Oceania > Australia > Victoria > Melbourne (0.04)
Asia > Middle East > Qatar > Ad-Dawhah > Doha (0.04)
(12 more...)

Genre: Research Report > New Finding (0.68)

Industry: Media (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (1.00)
(2 more...)

Add feedback

Continual Learning for Text Classification with Information Disentanglement Based Regularization

Huang, Yufan, Zhang, Yanzhe, Chen, Jiaao, Wang, Xuezhi, Yang, Diyi

arXiv.org Artificial IntelligenceApr-12-2021

Continual learning has become increasingly important as it enables NLP models to constantly learn and gain knowledge over time. Previous continual learning methods are mainly designed to preserve knowledge from previous tasks, without much emphasis on how to well generalize models to new tasks. In this work, we propose an information disentanglement based regularization method for continual learning on text classification. Our proposed method first disentangles text hidden spaces into representations that are generic to all tasks and representations specific to each individual task, and further regularizes these representations differently to better constrain the knowledge required to generalize. We also introduce two simple auxiliary tasks: next sentence prediction and task-id prediction, for learning better generic and specific representation spaces. Experiments conducted on large-scale benchmarks demonstrate the effectiveness of our method in continual text classification tasks with various sequences and lengths over state-of-the-art baselines. We have publicly released our code at https://github.com/GT-SALT/IDBR.

computational linguistic, proceedings, representation, (13 more...)

arXiv.org Artificial Intelligence

2104.05489

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Italy > Tuscany > Florence (0.04)
North America > United States > California > Alameda County > Oakland (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Classifying the Unstructured IT Service Desk Tickets Using Ensemble of Classifiers

C, Ramya, P, Paramesh S., S, Shreedhara K

arXiv.org Artificial IntelligenceMar-30-2021

Manual classification of IT service desk tickets may result in routing of the tickets to the wrong resolution group. Incorrect assignment of IT service desk tickets leads to reassignment of tickets, unnecessary resource utilization and delays the resolution time. Traditional machine learning algorithms can be used to automatically classify the IT service desk tickets. Service desk ticket classifier models can be trained by mining the historical unstructured ticket description and the corresponding label. The model can then be used to classify the new service desk ticket based on the ticket description. The performance of the traditional classifier systems can be further improved by using various ensemble of classification techniques. This paper brings out the three most popular ensemble methods ie, Bagging, Boosting and Voting ensemble for combining the predictions from different models to further improve the accuracy of the ticket classifier system. The performance of the ensemble classifier system is checked against the individual base classifiers using various performance metrics. Ensemble of classifiers performed well in comparison with the corresponding base classifiers. The advantages of building such an automated ticket classifier systems are simplified user interface, faster resolution time, improved productivity, customer satisfaction and growth in business. The real world service desk ticket data from a large enterprise IT infrastructure is used for our research purpose.

classifier, classifier model, ticket, (13 more...)

arXiv.org Artificial Intelligence

2103.15822

Country:

Asia > India > Karnataka (0.05)
Asia > Singapore (0.04)

Genre: Research Report (0.83)

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.97)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.95)
(2 more...)

Add feedback

How I achieved 90% accuracy on a text classification problem with ZERO preprocessing

#artificialintelligenceMar-27-2021, 17:45:22 GMT

I chose to use the AG news benchmark dataset. I recuperated the training and test test from John Snow Labs (a must see reference for all things NLP). This dataset is divided into four balanced categories for a total of 120,000 rows as seen below. The dataset is formatted into 2 columns, category and description. Because I want this to be a succinct post, I will refer you to my previous article to find out how to use Spark NLP in Colab.

accuracy, bert sentence, text classification problem, (3 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.40)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Grey-box Adversarial Attack And Defence For Sentiment Classification

Xu, Ying, Zhong, Xu, Yepes, Antonio Jimeno, Lau, Jey Han

arXiv.org Artificial IntelligenceMar-22-2021

We introduce a grey-box adversarial attack and defence framework for sentiment classification. We address the issues of differentiability, label preservation and input reconstruction for adversarial attack and defence in one unified framework. Our results show that once trained, the attacking model is capable of generating high-quality adversarial examples substantially faster (one order of magnitude less in time) than state-of-the-art attacking methods. These examples also preserve the original sentiment according to human evaluation. Additionally, our framework produces an improved classifier that is robust in defending against multiple adversarial attacking methods. Code is available at: https://github.com/ibm-aur-nlp/adv-def-text-dist.

adversarial example, classifier, target model, (13 more...)

arXiv.org Artificial Intelligence

2103.11576

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States (0.04)
North America > Canada (0.04)
Europe > United Kingdom (0.04)

Genre: Research Report > New Finding (0.86)

Industry:

Information Technology > Security & Privacy (0.82)
Government > Military (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.86)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.71)

Add feedback

Text Classification using Transformers

#artificialintelligenceMar-18-2021, 05:55:44 GMT

In this part, we will try to understand the Encoder-Decoder architecture of the Multi-Head Self-Attention Transformer network with some code in PyTorch. There won't be any theory involved(better theoretical version can be found here) just the barebones of the network and how can one write this network on its own in PyTorch. The architecture comprising the Transformer model is divided into two parts -- the Encoder part and the Decoder part. Several other things combine to form the Encoder and Decoder parts. Let's start with the Encoder.

decoder part, encoder part, text classification, (5 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.40)

Add feedback

An Amharic News Text classification Dataset

Azime, Israel Abebe, Mohammed, Nebil

arXiv.org Artificial IntelligenceMar-10-2021

In NLP, text classification is one of the primary problems we try to solve and its uses in language analyses are indisputable. The lack of labeled training data made it harder to do these tasks in low resource languages like Amharic. The task of collecting, labeling, annotating, and making valuable this kind of data will encourage junior researchers, schools, and machine learning practitioners to implement existing classification models in their language. In this short paper, we aim to introduce the Amharic text classification dataset that consists of more than 50k news articles that were categorized into 6 classes. This dataset is made available with easy baseline performances to encourage studies and better performance experiments.

classification, dataset, text classification, (9 more...)

arXiv.org Artificial Intelligence

2103.05639

Country:

Asia > Middle East > Israel (0.05)
Africa > Ethiopia > Addis Ababa > Addis Ababa (0.05)

Genre: Research Report (0.40)

Industry: Media > News (0.32)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.32)

Add feedback