Goto

Collaborating Authors

 Text Classification


Fine-tuned Language Models for Text Classification

arXiv.org Machine Learning

Transfer learning has revolutionized computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Fine-tuned Language Models (FitLaM), an effective transfer learning method that can be applied to any task in NLP, and introduce techniques that are key for fine-tuning a state-of-the-art language model. Our method significantly outperforms the state-of-the-art on five text classification tasks, reducing the error by 18-24% on the majority of datasets. We open-source our pretrained models and code to enable adoption by the community.


Email Spam Filtering : A python implementation with scikit-learn

@machinelearnbot

This article was written by ML bot2 on Machine Learning in Action. Text mining (deriving information from text) is a wide field which has gained popularity with the huge text data being generated. Automation of a number of applications like sentiment analysis, document classification, topic classification, text summarization, machine translation, etc has been done using machine learning models. Spam filtering is a beginner's example of document classification task which involves classifying an email as spam or non-spam (a.k.a. Spam box in your Gmail account is the best example of this.


Automated Text Classification Using Machine Learning

#artificialintelligence

Digitization has changed the way we process and analyze information. There is an exponential increase in online availability of information. From web pages to emails, science journals, e-books, learning content, news and social media are all full of textual data. The idea is to create, analyze and report information fast. This is when automated text classification steps up.


A framework for automated rating of online reviews against the underlying topics

@machinelearnbot

Even though the most online review systems offer star rating in addition to free text reviews, this only applies to the overall review. However, different users may have different preferences in relation to different aspects of a product or a service and may struggle to extract relevant information from a massive amount of consumer reviews available online. In this paper, we present a framework for extracting prevalent topics from online reviews and automatically rating them on a 5-star scale. It consists of five modules, including linguistic pre-processing, topic modelling, text classification, sentiment analysis, and rating. Topic modelling is used to extract prevalent topics, which are then used to classify individual sentences against these topics.


Aggressive Sampling for Multi-class to Binary Reduction with Applications to Text Classification

Neural Information Processing Systems

We address the problem of multi-class classification in the case where the number of classes is very large. We propose a double sampling strategy on top of a multi-class to binary reduction strategy, which transforms the original multi-class problem into a binary classification problem over pairs of examples. The aim of the sampling strategy is to overcome the curse of long-tailed class distributions exhibited in majority of large-scale multi-class classification problems and to reduce the number of pairs of examples in the expanded data. We show that this strategy does not alter the consistency of the empirical risk minimization principle defined over the double sample reduction. Experiments are carried out on DMOZ and Wikipedia collections with 10,000 to 100,000 classes where we show the efficiency of the proposed approach in terms of training and prediction time, memory consumption, and predictive performance with respect to state-of-the-art approaches.


Tensorflow Docker Production ready AI product – #WeCoCreate – Medium

#artificialintelligence

Everyone is talking about training the Deep Learning models and fine tuning them but very few talks about the deployment and the scalability aspects. In BotSupply, we focus not only on building accurate Machine Learning models, but also on delivering them to the clients with the greater efficiency. In this article, we will learn to deploy a sentiment analysis model trained on "Character-level Convolutional Networks for Text Classification" (Xiang Zhang, Junbo Zhao, Yann LeCun) which uses character-level ConvNet networks for text classification. Check out his great blog post on CNN classification. As explained in the above blog about the training process, I am pre-assuming that you have already trained your sentiment analysis model.


[D] Max-over-time pooling vs no max-pooling for text classification? • r/MachineLearning

@machinelearnbot

Kim 2014 and Collobert 2011 argue that max-over-time pooling helps getting the words from a sentence that are most important to the semantics. Then I read a blog post from the Googler Lakshmanan V on text classification. The author argues that spatial invariance isn't wanted because it's important where words are placed in a sentence. Thus he doesn't recommend maxpool. Are there empirical studies that compares the two approaches?


TensorFlow for R

@machinelearnbot

You'll work with the IMDB dataset: a set of 50,000 highly polarized reviews from the Internet Movie Database. They're split into 25,000 reviews for training and 25,000 reviews for testing, each set consisting of 50% negative and 50% positive reviews. Because you should never test a machine-learning model on the same data that you used to train it! Just because a model performs well on its training data doesn't mean it will perform well on data it has never seen; and what you care about is your model's performance on new data (because you already know the labels of your training data – obviously you don't need your model to predict those). For instance, it's possible that your model could end up merely memorizing a mapping between your training samples and their targets, which would be useless for the task of predicting targets for data the model has never seen before.


On Extending Neural Networks with Loss Ensembles for Text Classification

arXiv.org Machine Learning

Ensemble techniques are powerful approaches that combine several weak learners to build a stronger one. As a meta learning framework, ensemble techniques can easily be applied to many machine learning techniques. In this paper we propose a neural network extended with an ensemble loss function for text classification. The weight of each weak loss function is tuned within the training phase through the gradient propagation optimization method of the neural network. The approach is evaluated on several text classification datasets. We also evaluate its performance in various environments with several degrees of label noise. Experimental results indicate an improvement of the results and strong resilience against label noise in comparison with other methods.


Advanced Topics: Classification with Spotfire

@machinelearnbot

How can I predict my customer base? In this webinar, we'll answer real data science questions like this using Spotfire and TERR to make smarter decisions. For our next webinar, we'll be managing a hotel's marketing group, using classification methods inside of Spotfire. This is the fourth step in our five-part webinar series called the Building Blocks of Data Science. In this series, we will explore solving real data science questions using Spotfire and TERR.