Online Learning for Latent Dirichlet Allocation

Neural Information Processing Systems

We develop an online variational Bayes (VB) algorithm for Latent Dirichlet Allocation (LDA). Online LDA is based on online stochastic optimization with a natural gradient step, which we show converges to a local optimum of the VB objective function. It can handily analyze massive document collections, including those arriving in a stream. We study the performance of online LDA in several ways, including by fitting a 100-topic topic model to 3.3M articles from Wikipedia in a single pass. We demonstrate that online LDA finds topic models as good or better than those found with batch VB, and in a fraction of the time.


Learning Machine Learning

#artificialintelligence

Machine learning is a hot topic for developers, but where can one learn about how to use the technology? A lot depends on your current background and your long-term goals. I have already written about the basic differences between machine-learning techniques, but this was done at a relatively high level. Getting into the details can range from learning about machine-learning methodologies at an abstract level to examining deep-learning frameworks used to develop applications. Here, we'll take a more detailed look at some of the online resources available to you, and include links to websites with much more information about machine-learning classes, frameworks, and resources.


A framework for automated rating of online reviews against the underlying topics

@machinelearnbot

Even though the most online review systems offer star rating in addition to free text reviews, this only applies to the overall review. However, different users may have different preferences in relation to different aspects of a product or a service and may struggle to extract relevant information from a massive amount of consumer reviews available online. In this paper, we present a framework for extracting prevalent topics from online reviews and automatically rating them on a 5-star scale. It consists of five modules, including linguistic pre-processing, topic modelling, text classification, sentiment analysis, and rating. Topic modelling is used to extract prevalent topics, which are then used to classify individual sentences against these topics.


TACAM: Topic And Context Aware Argument Mining

arXiv.org Machine Learning

In this work we address the problem of argument search. The purpose of argument search is the distillation of pro and contra arguments for requested topics from large text corpora. In previous works, the usual approach is to use a standard search engine to extract text parts which are relevant to the given topic and subsequently use an argument recognition algorithm to select arguments from them. The main challenge in the argument recognition task, which is also known as argument mining, is that often sentences containing arguments are structurally similar to purely informative sentences without any stance about the topic. In fact, they only differ semantically. Most approaches use topic or search term information only for the first search step and therefore assume that arguments can be classified independently of a topic. We argue that topic information is crucial for argument mining, since the topic defines the semantic context of an argument. Precisely, we propose different models for the classification of arguments, which take information about a topic of an argument into account. Moreover, to enrich the context of a topic and to let models understand the context of the potential argument better, we integrate information from different external sources such as Knowledge Graphs or pre-trained NLP models. Our evaluation shows that considering topic information, especially in connection with external information, provides a significant performance boost for the argument mining task.


The Evolving State of AI-Supplemented Computer-Assisted Instruction

#artificialintelligence

The traditional CAI Computer-Assisted Instruction) system depends on the instructors who provide the course material and decide the criteria of evaluation for the students. The advanced versions we have these days have a'reactive learning environment' – where students are actively engaged in their online learning programs. The latter systems employ AI (Artificial Intelligence) tools and techniques to take students' interests and performance factors into account and proceed with tutorial dialogues accordingly. Hence, they are known as AICAI (Artificial Intelligence Computer-Assisted Instruction) System), or simply ICAI for intelligent CAI. Such AICAIs include a domain expert component (which knows all about the topic that is being taught), a student model that can analyze the responses of the learners and decode their knowledge levels as well as misconceptions, and a component which contains information on appropriate teaching strategies in different scenarios.