AITopics | Text Classification

Collaborating Authors

Text Classification

"A text classifier is an automated means of determining some metadata about a document. Text classifiers are used for such diverse needs as spam filtering, suggesting categories for indexing a document created in a content management system, or automatically sorting help desk requests."
– John Graham-Cumming, Naive Bayesian Text Classification. Dr. Dobb's. May 1 2005.

News Overviews Instructional Materials AI-Alerts Classics

Azimuth: Systematic Error Analysis for Text Classification

Gauthier-Melançon, Gabrielle, Ayala, Orlando Marquez, Brin, Lindsay, Tyler, Chris, Branchaud-Charron, Frédéric, Marinier, Joseph, Grande, Karine, Le, Di

arXiv.org Artificial IntelligenceDec-18-2022

We present Azimuth, an open-source and easy-to-use tool to perform error analysis for text classification. Compared to other stages of the ML development cycle, such as model training and hyper-parameter tuning, the process and tooling for the error analysis stage are less mature. However, this stage is critical for the development of reliable and trustworthy AI systems. To make error analysis more systematic, we propose an approach comprising dataset analysis and model quality assessment, which Azimuth facilitates. We aim to help AI practitioners discover and address areas where the model does not generalize by leveraging and integrating a range of ML techniques, such as saliency maps, similarity, uncertainty, and behavioral analyses, all in one tool. Our code and documentation are available at github.com/servicenow/azimuth.

machine learning, natural language, text classification, (20 more...)

arXiv.org Artificial Intelligence

2212.08216

Country:

Asia > China > Hong Kong (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > Dominican Republic (0.04)
Europe > Italy (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.66)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.61)

Add feedback

Utilizing distilBert transformer model for sentiment classification of COVID-19's Persian open-text responses

Masoumi, Fatemeh Sadat, Bahrani, Mohammad

arXiv.org Artificial IntelligenceDec-16-2022

The COVID-19 pandemic has caused drastic alternations in human's life in all aspects. The government's laws in this regard affected the lifestyle of all people. Due to this fact studying about the sentiment of individuals is important to be aware of the future impacts of the coming pandemics. To contribute to this aim, we proposed a NLP (Natural Language Processing) model to analyze open-text answers in a survey in Persian and detect positive and negative feelings of the people in Iran. In this study, a distilBert transformer model was applied to take on this task. We deployed three approaches to perform comparison, and our best model could gain accuracy: 0.824, Precision: 0.824, Recall: 0.798 and F1score: 0.804.

machine learning, natural language, sentiment, (17 more...)

arXiv.org Artificial Intelligence

2212.08407

Country:

Asia > Middle East > Iran (0.24)
Asia > China (0.15)
Asia > Pakistan (0.04)
(2 more...)

Genre: Research Report (0.71)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Government > Regional Government > North America Government > United States Government (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.50)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.50)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.41)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.30)

Add feedback

Improve Text Classification Accuracy with Intent Information

Xie, Yifeng

arXiv.org Artificial IntelligenceDec-15-2022

In addition, existing text classification approaches only consider the utterances in the coarse granularity level, which In recent years, goal-oriented dialogue systems have been may less the possibility of the model to explore relationship widely applied in intelligent voice assistant, e.g., Apple Siri, between token-level information and label information. For Amazon Alexa, where intent classification technology plays a example, in Figure 1, if there is a token "When" in the crucial part. Given input utterance in natural language, the intent input sentence, then the "NUM" intent is more likely to be classification module aims to detect the user's intent [10], recognized as high correlation with it, while the "LOC" intent [11], [14], [23]. Previous works have been proposed for better does not.

machine learning, natural language, text classification, (18 more...)

arXiv.org Artificial Intelligence

2212.07649

Country:

North America > United States > New York (0.04)
Asia > China (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.89)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.89)
(2 more...)

Add feedback

FastClass: A Time-Efficient Approach to Weakly-Supervised Text Classification

Xia, Tingyu, Wang, Yue, Tian, Yuan, Chang, Yi

arXiv.org Artificial IntelligenceDec-14-2022

Weakly-supervised text classification aims to train a classifier using only class descriptions and unlabeled data. Recent research shows that keyword-driven methods can achieve state-of-the-art performance on various tasks. However, these methods not only rely on carefully-crafted class descriptions to obtain class-specific keywords but also require substantial amount of unlabeled data and takes a long time to train. This paper proposes FastClass, an efficient weakly-supervised classification approach. It uses dense text representation to retrieve class-relevant documents from external unlabeled corpus and selects an optimal subset to train a classifier. Compared to keyword-driven methods, our approach is less reliant on initial class descriptions as it no longer needs to expand each class description into a set of class-specific keywords. Experiments on a wide range of classification tasks show that the proposed approach frequently outperforms keyword-driven models in terms of classification accuracy and often enjoys orders-of-magnitude faster training speed.

classification, information retrieval, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2212.05506

Country:

North America > United States > North Carolina (0.04)
Asia > China > Jilin Province (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (1.00)
(2 more...)

Add feedback

Structured Prompting: Scaling In-Context Learning to 1,000 Examples

Hao, Yaru, Sun, Yutao, Dong, Li, Han, Zhixiong, Gu, Yuxian, Wei, Furu

arXiv.org Artificial IntelligenceDec-13-2022

Large language models have exhibited intriguing in-context learning capability, achieving promising zero- and few-shot performance without updating the parameters. However, conventional in-context learning is usually restricted by length constraints, rendering it ineffective to absorb supervision from a large number of examples. In order to go beyond few shots, we introduce structured prompting that breaks the length limit and scales in-context learning to thousands of examples. Specifically, demonstration examples are separately encoded with well-designed position embeddings, and then they are jointly attended by the test example using a rescaled attention mechanism. So we can scale the number of exemplars with linear complexity instead of quadratic complexity with respect to length. Experimental results on a diverse set of tasks show that our approach improves end-task performance and reduces evaluation variance over conventional in-context learning as the number of demonstration examples increases. Code has been released at https://aka.ms/structured-prompting.

in-context learning, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2212.06713

Country:

North America > United States > Washington > King County > Seattle (0.28)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(6 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.47)

Add feedback

Moto: Enhancing Embedding with Multiple Joint Factors for Chinese Text Classification

Tang, Xunzhu, Zhu, Rujie, Sun, Tiezhu, Wang, Shi

arXiv.org Artificial IntelligenceDec-9-2022

Recently, language representation techniques have achieved great performances in text classification. However, most existing representation models are specifically designed for English materials, which may fail in Chinese because of the huge difference between these two languages. Actually, few existing methods for Chinese text classification process texts at a single level. However, as a special kind of hieroglyphics, radicals of Chinese characters are good semantic carriers. In addition, Pinyin codes carry the semantic of tones, and Wubi reflects the stroke structure information, \textit{etc}. Unfortunately, previous researches neglected to find an effective way to distill the useful parts of these four factors and to fuse them. In our works, we propose a novel model called Moto: Enhancing Embedding with \textbf{M}ultiple J\textbf{o}int Fac\textbf{to}rs. Specifically, we design an attention mechanism to distill the useful parts by fusing the four-level information above more effectively. We conduct extensive experiments on four popular tasks. The empirical results show that our Moto achieves SOTA 0.8316 ($F_1$-score, 2.11\% improvement) on Chinese news titles, 96.38 (1.24\% improvement) on Fudan Corpus and 0.9633 (3.26\% improvement) on THUCNews.

machine learning, natural language, text classification, (16 more...)

arXiv.org Artificial Intelligence

2212.08105

Country:

North America > United States > Florida > Orange County > Orlando (0.14)
Asia > China > Hubei Province > Wuhan (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.83)

Add feedback

G-MAP: General Memory-Augmented Pre-trained Language Model for Domain Tasks

Wan, Zhongwei, Yin, Yichun, Zhang, Wei, Shi, Jiaxin, Shang, Lifeng, Chen, Guangyong, Jiang, Xin, Liu, Qun

arXiv.org Artificial IntelligenceDec-7-2022

Recently, domain-specific PLMs have been proposed to boost the task performance of specific domains (e.g., biomedical and computer science) by continuing to pre-train general PLMs with domain-specific corpora. However, this Domain-Adaptive Pre-Training (DAPT; Gururangan et al. (2020)) tends to forget the previous general knowledge acquired by general PLMs, which leads to a catastrophic forgetting phenomenon and sub-optimal performance. To alleviate this problem, we propose a new framework of General Memory Augmented Pre-trained Language Model (G-MAP), which augments the domain-specific PLM by a memory representation built from the frozen general PLM without losing any general knowledge. Specifically, we propose a new memory-augmented layer, and based on it, different augmented strategies are explored to build the memory representation and then adaptively fuse it into the domain-specific PLM. We demonstrate the effectiveness of G-MAP on various domains (biomedical and computer science publications, news, and reviews) and different kinds (text classification, QA, NER) of tasks, and the extensive results show that the proposed G-MAP can achieve SOTA results on all tasks.

machine learning, natural language, plm, (17 more...)

arXiv.org Artificial Intelligence

2212.03613

Country:

Asia > Middle East > Jordan (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > New Finding (0.88)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.67)

Add feedback

Learning Label Modular Prompts for Text Classification in the Wild

Chen, Hailin, Saha, Amrita, Joty, Shafiq, Hoi, Steven C. H.

arXiv.org Artificial IntelligenceDec-5-2022

Machine learning models usually assume i.i.d data during training and testing, but data and tasks in real world often change over time. To emulate the transient nature of real world, we propose a challenging but practical task: text classification in-the-wild, which introduces different non-stationary training/testing stages. Decomposing a complex task into modular components can enable robust generalisation under such non-stationary environment. However, current modular approaches in NLP do not take advantage of recent advances in parameter efficient tuning of pretrained language models. To close this gap, we propose MODULARPROMPT, a label-modular prompt tuning framework for text classification tasks. In MODULARPROMPT, the input prompt consists of a sequence of soft label prompts, each encoding modular knowledge related to the corresponding class label. In two of most formidable settings, MODULARPROMPT outperforms relevant baselines by a large margin demonstrating strong generalisation ability. We also conduct comprehensive analysis to validate whether the learned prompts satisfy properties of a modular representation.

machine learning, natural language, text classification, (16 more...)

arXiv.org Artificial Intelligence

2211.17142

Country:

Europe > Finland > Pirkanmaa > Tampere (0.04)
Europe > Sweden (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
(4 more...)

Genre: Research Report (0.64)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.81)

Add feedback

Meta Learning for Few-Shot Medical Text Classification

Sharma, Pankaj, Qureshi, Imran, Tran, Minh

arXiv.org Artificial IntelligenceDec-3-2022

Medical professionals frequently work in a data constrained setting to provide insights across a unique demographic. A few medical observations, for instance, informs the diagnosis and treatment of a patient. This suggests a unique setting for meta-learning, a method to learn models quickly on new tasks, to provide insights unattainable by other methods. We investigate the use of meta-learning and robustness techniques on a broad corpus of benchmark text and medical data. To do this, we developed new data pipelines, combined language models with meta-learning approaches, and extended existing meta-learning algorithms to minimize worst case loss. We find that meta-learning on text is a suitable framework for text-based data, providing better data efficiency and comparable performance to few-shot language models and can be successfully applied to medical note data. Furthermore, meta-learning models coupled with DRO can improve worst case loss across disease codes.

machine learning, natural language, text classification, (19 more...)

arXiv.org Artificial Intelligence

2212.01552

Country:

Asia > Middle East > Israel (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)

Genre:

Research Report (1.00)
Instructional Material > Online (0.40)
Instructional Material > Course Syllabus & Notes (0.40)

Industry:

Health & Medicine > Health Care Providers & Services (0.70)
Health & Medicine > Health Care Technology > Medical Record (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.40)

Add feedback

Generalised Spherical Text Embedding

Banerjee, Souvik, Mishra, Bamdev, Jawanpuria, Pratik, Shrivastava, Manish

arXiv.org Artificial IntelligenceNov-30-2022

This paper aims to provide an unsupervised modelling approach that allows for a more flexible representation of text embeddings. It jointly encodes the words and the paragraphs as individual matrices of arbitrary column dimension with unit Frobenius norm. The representation is also linguistically motivated with the introduction of a novel similarity metric. The proposed modelling and the novel similarity metric exploits the matrix structure of embeddings. We then go on to show that the same matrices can be reshaped into vectors of unit norm and transform our problem into an optimization problem over the spherical manifold. We exploit manifold optimization to efficiently train the matrix embeddings. We also quantitatively verify the quality of our text embeddings by showing that they demonstrate improved results in document classification, document clustering, and semantic textual similarity benchmark tests.

machine learning, natural language, text classification, (19 more...)

arXiv.org Artificial Intelligence

2211.16801

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
(4 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.49)
(2 more...)

Add feedback