AITopics | Text Classification

Collaborating Authors

Text Classification

"A text classifier is an automated means of determining some metadata about a document. Text classifiers are used for such diverse needs as spam filtering, suggesting categories for indexing a document created in a content management system, or automatically sorting help desk requests."
– John Graham-Cumming, Naive Bayesian Text Classification. Dr. Dobb's. May 1 2005.

News Overviews Instructional Materials AI-Alerts Classics

[1607.01759] Bag of Tricks for Efficient Text Classification

#artificialintelligenceJul-7-2016, 21:25:51 GMT

Which authors of this paper are endorsers? Disable MathJax (What is MathJax?)

artificial intelligence, efficient text classification, natural language, (2 more...)

#artificialintelligence

Genre: Research Report (0.90)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Classification (1.00)

Add feedback

Implementing a CNN for Text Classification in TensorFlow

#artificialintelligenceJul-6-2016, 19:45:45 GMT

Another TensorFlow feature you typically want to use is checkpointing – saving the parameters of your model to restore them later on. Checkpoints can be used to continue training at a later point, or to pick the best parameters setting using early stopping. Checkpoints are created using a Saver object. Before we can train our model we also need to initialize the variables in our graph. The initialize_all_variables function is a convenience function run all of the initializers we've defined for our variables. You can also call the initializer of your variables manually. That's useful if you want to initialize your embeddings with pre-trained values for example. Let's now define a function for a single training step, evaluating the model on a batch of data and updating the model parameters.

machine learning, natural language, text classification, (15 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.41)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.33)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.31)

Add feedback

Text Classification and Sentiment Analysis

#artificialintelligenceJun-28-2016, 21:22:16 GMT

For a more technical explanation, this and this article can be read. Here you can find a good explanation as well as a list of the mostly used Kernel functions.

machine learning, natural language, text classification, (16 more...)

#artificialintelligence

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.68)
(3 more...)

Add feedback

Interactive Semantic Featuring for Text Classification

Jandot, Camille, Simard, Patrice, Chickering, Max, Grangier, David, Suh, Jina

arXiv.org Machine LearningJun-23-2016

In text classification, dictionaries can be used to define human-comprehensible features. We propose an improvement to dictionary features called smoothed dictionary features. These features recognize document contexts instead of n-grams. We describe a principled methodology to solicit dictionary features from a teacher, and present results showing that models built using these human-comprehensible features are competitive with models trained with Bag of Words features.

machine learning, natural language, text classification, (16 more...)

arXiv.org Machine Learning

1606.07545

Country: North America > United States > Wisconsin (0.14)

Genre: Research Report > New Finding (0.68)

Industry: Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.72)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.49)

Add feedback

Sentiment classification on node level for RNTN and SVN • /r/MachineLearning

@machinelearnbotJun-13-2016, 17:10:55 GMT

I have question regarding this paper (http://nlp.stanford.edu/ In the paper there are some results on page 7 in Table 1. There are results for All and Root. For the results All they use the results of all nodes of the tree. For Root they use the results on sentence level.

artificial intelligence, sentiment classification, text classification, (4 more...)

@machinelearnbot

Country: North America > United States > California > Santa Clara County > Palo Alto (0.37)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.40)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.40)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.40)

Add feedback

Supervised Learning for Document Classification with Scikit-Learn - QuantStart

#artificialintelligenceJun-1-2016, 07:11:56 GMT

This is the first article in what will become a set of tutorials on how to carry out natural language document classification, for the purposes of sentiment analysis and, ultimately, automated trade filter or signal generation. This particular article will make use of Support Vector Machines (SVM) to classify text documents into mutually exclusive groups. Since this is the first article written in 2015, I feel it is now time to move on from Python 2.7.x and make use of the latest 3.4.x Hence all code in this article will be written with 3.4.x in mind. There are a significant number of steps to carry out between viewing a text document on a web site, say, and using its content as an input to an automated trading strategy to generate trade filters or signals. In this particular article we will avoid discussion of how to download multiple articles from external sources and make use of a given dataset that already comes with its own provided labels. This will allow us to concentrate on the implementation of the "classification pipeline", rather than spend a substantial amount of time obtaining and tagging documents. In subsequent articles in this series we will make use of Python libraries, such as ScraPy and BeautifulSoup to automatically obtain many web-based articles and effectively extract their text-based data from the HTML.

classifier, machine learning, natural language, (18 more...)

#artificialintelligence

Country:

Asia > Thailand (0.04)
Asia > Japan (0.04)

Industry: Banking & Finance > Trading (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.59)

Add feedback

Text Analysis 101; A Basic Understanding for Business Users: Document Classification

@machinelearnbotMay-28-2016, 16:39:34 GMT

This blog was originally posted as part of our Text Analysis 101 blog series. It aims to explain how the classification of text works as part of Natural Language Processing. The automatic classification of documents is an example of how Machine Learning (ML) and Natural Language Processing (NLP) can be leveraged to enable machines to better understand human language. By classifying text, we are aiming to assign one or more classes or categories to a document or piece of text, making it easier to manage and sort the documents. Manually categorizing and grouping text sources can be extremely laborious and time-consuming, especially for publishers, news sites, blogs or anyone who deals with a lot of content.

category, natural language, text classification, (15 more...)

@machinelearnbot

Country: North America > United States > New York (0.05)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Classification (1.00)

Add feedback

100 Machine Learning videos you can't find in Google • /r/MachineLearning

#artificialintelligenceMay-14-2016, 00:16:17 GMT

Serious answer: I tend to dive deep into a particular algorithm...learning the math better, getting used to different applications of it, etc. So that's where I usually spend my time - along with the advice /u/Jigsus offered...focusing my learning around the kinds of needs I'm working on problem-/data-wise. Sounds like survival analysis, so I try to find as much material focused around that. On the flip side, I haven't done anything like sentiment analysis, so I know next to nothing about Naive Bayes text classification. I tend to read over a rather wide selection of ML and statistics blogs, so I'm not entirely unclear about such things, it's just that I don't spend a copious amount of time other than playing with a toy dataset now and then.

machine learning video, natural language, text classification, (2 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.66)

Add feedback

Bulletin April/May 2013

#artificialintelligenceMay-8-2016, 10:35:46 GMT

Specifically, the assignment of meaningful tags (annotations) to each unique data granule is best achieved through collaborative participation of data providers, curators and end users to augment and validate the results derived from machine learning (data mining) classification algorithms. The annotations provide curation, provenance and semantic (scientifically meaningful) metadata about the data source and the data object being studied. The design and specification of a unique, meaningful, searchable and scientifically impactful set of tags can be achieved through collaborative (human-plus-machine) annotation efforts and through discovery informatics research. These steps will produce a searchable classification and indexing scheme for the curation, classification, discovery, reuse, interoperability, integration and understanding of digital repositories.

annotation, artificial intelligence, data management, (23 more...)

#artificialintelligence

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.58)

Add feedback

Sentiment Classification Using Negation as a Proxy for Negative Sentiment

Ohana, Bruno (Dublin Institute of Technology) | Tierney, Brendan (Dublin Institute of Technology) | Delany, Sarah Jane (Dublin Institute of Technology)

AAAI ConferencesMay-8-2016

We explore the relationship between negated text and negative sentiment in the task of sentiment classification. We propose a novel adjustment factor based on negation occurrences as a proxy for negative sentiment that can be applied to lexicon-based classifiers equipped with a negation detection pre-processing step. We performed an experiment on a multi-domain customer reviews dataset obtaining accuracy improvements over a baseline, and we further improved our results using out-of-domain data to calibrate the adjustment factor. We see future work possibilities in exploring negation detection refinements, and expanding the experiment to a broader spectrum of opinionated discourse, beyond that of customer reviews.

negation, negative sentiment, sentiment classification, (1 more...)

AAAI Conferences

The Twenty-Ninth International Flairs Conference

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.60)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.60)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.60)

Add feedback