University of Illinois
Dual-Clustering Maximum Entropy with Application to Classification and Word Embedding
Wang, Xiaolong (University of Illinois ) | Wang, Jingjing (University of Illinois) | Zhai, Chengxiang (University of Illinois)
Maximum Entropy (ME), as a general-purpose machine learning model, has been successfully applied to various fields such as text mining and natural language processing. It has been used as a classification technique and recently also applied to learn word embedding. ME establishes a distribution of the exponential form over items (classes/words). When training such a model, learning efficiency is guaranteed by globally updating the entire set of model parameters associated with all items at each training instance. This creates a significant computational challenge when the number of items is large. To achieve learning efficiency with affordable computational cost, we propose an approach named Dual-Clustering Maximum Entropy (DCME). Exploiting the primal-dual form of ME, it conducts clustering in the dual space and approximates each dual distribution by the corresponding cluster center. This naturally enables a hybrid online-offline optimization algorithm whose time complexity per instance only scales as the product of the feature/word vector dimensionality and the cluster number. Experimental studies on text classification and word embedding learning demonstrate that DCME effectively strikes a balance between training speed and model quality, substantially outperforming state-of-the-art methods.
Incidental Supervision: Moving beyond Supervised Learning
Roth, Dan (University of Illinois)
Machine Learning and Inference methods have become ubiquitous in our attempt to induce more abstract representations of natural language text, visual scenes, and other messy, naturally occurring data, and support decisions that depend on it. However, learning models for these tasks is difficult partly because generating the necessary supervision signals for it is costly and does not scale. This paper describes several learning paradigms that are designed to alleviate the supervision bottleneck. It will illustrate their benefit in the context of multiple problems, all pertaining to inducing various levels of semantic representations from text. In particular, we discuss (i) esponse Driven Learning of models, a learning protocol that supports inducing meaning representations simply by observing the model's behavior in its environment, (ii) the exploitation of Incidental Supervision signals that exist in the data, independently of the task at hand, to learn models that identify and classify semantic predicates, and (iii) the use of weak supervision to combine simple models to support global decisions where joint supervision is not available.
Surpassing Humans and Computers with JELLYBEAN: Crowd-Vision-Hybrid Counting Algorithms
Sarma, Akash Das (Stanford University) | Jain, Ayush (University of Illinois) | Nandi, Arnab (The Ohio State University) | Parameswaran, Aditya (University of Illinois) | Widom, Jennifer (Stanford University)
Counting objects is a fundamental image processisng primitive, and has many scientific, health, surveillance, security, and military applications. Existing supervised computer vision techniques typically require large quantities of labeled training data, and even with that, fail to return accurate results in all but the most stylized settings. Using vanilla crowdsourcing, on the other hand, can lead to significant errors, especially on images with many objects. In this paper, we present our JellyBean suite of algorithms, that combines the best of crowds and computer vision to count objects in images, and uses judicious decomposition of images to greatly improve accuracy at low cost. Our algorithms have several desirable properties: (i) they are theoretically optimal or near-optimal , in that they ask as few questions as possible to humans (under certain intuitively reasonable assumptions that we justify in our paper experimentally); (ii) they operate under stand-alone or hybrid modes, in that they can either work independent of computer vision algorithms, or work in concert with them, depending on whether the computer vision techniques are available or useful for the given setting; (iii) they perform very well in practice, returning accurate counts on images that no individual worker or computer vision algorithm can count correctly, while not incurring a high cost.
Structure and Intractability of Optimal Multi-Robot Path Planning on Graphs
Yu, Jingjin (University of Illinois) | LaValle, Steven M. (University of Illinois)
In this paper, we study the structure and computational complexity of optimal multi-robot path planning problems on graphs. Our results encompass three formulations of the discrete multi-robot path planning problem, including a variant that allows synchronous rotations of robots along fully occupied, disjoint cycles on the graph. Allowing rotation of robots provides a more natural model for multi-robot path planning because robots can communicate. Our optimality objectives are to minimize the total arrival time, the makespan (last arrival time), and the total distance. On the structure side, we show that, in general, these objectives demonstrate a pairwise Pareto optimal structure and cannot be simultaneously optimized. On the computational complexity side, we extend previous work and show that, regardless of the underlying multi-robot path planning problem, these objectives are all intractable to compute. In particular, our NP-hardness proof for the time optimal versions, based on a minimal and direct reduction from the 3-satisfiability problem, shows that these problems remain NP-hard even when there are only two groups of robots (i.e. robots within each group are interchangeable).
On the Identification of Humor Markers in Computer-Mediated Communication
Adams, Audrey Claire (University of Illinois)
This study presents a quantitative analysis of humor markers in computer-mediated communication (CMC). The data for this analysis consists of naturally occurring asynchronous CMC interactions from a public fan forum. Posts were tagged and coded as either humorous or non-humorous, and each individual humorous unit was coded as being one of 8 specific forms of humor. Next, each post was tagged and coded for the use of linguistic markers in the following categories: Punctuation, formatting, emoticons, laughter, and explicit. Descriptive and inferential statistics determined the following in the present data set: 1) Markers from each of the 5 marker categories occurred significantly more in humorous than non- humorous turns (p > 0.001); 2) Each of the 8 forms of humor present in the data were tested for the use of each marker-type, which suggests the existence of correlations between the iconic use of formatting in hyperbole (p > 0.001), the use of laughter in jocularity (p = 0.019) and insult (p = 0.024), and the use of emoticon in jocularity (p = 0.031); and 3) Humorous units which used humor markers gained significantly more humor response than unmarked humorous units (p > 0.001). These results provide a better understanding of features potentially related to the automated identification of humor.
Can a Robot Learn Language as a Child Does?
Levinson, Stephen Eliot (University of Illinois) | Niehaus, Logan (Beckman Institute) | Majure, Lydia (Beckman Institute) | Silver, Aaron (Beckman Institute) | Wendt, Luke (Beckman Institute)
This paper gives a brief retrospective of a research project begun in 1987 and continuing to the present on the topic of language acquisition by an autonomous humanoid robot. We recount the motivations for, theoretical bases of and experimental results on this subject. Important results include novel models and algorithms resulting in interesting linguistic function of our robots.
Toward Social Causality: An Analysis of Interpersonal Relationships in Online Blogs and Forums
Girju, Roxana (University of Illinois)
In this paper we present encouraging preliminary results into the problem of social causality (causal reasoning used by intelligent agents in a social environment) in online social interactions based on a model of reciprocity. At every level, social relationships are guided by the shared understanding that most actions call for appropriate reactions, and that inappropriate reactions require management. Thus, we present an analysis of interpersonal relationships in English reciprocal contexts. Specifically, we rely here on a large and recently built database of 10,882 reciprocal relation instances in online media. The resource is analyzed along a set of novel and important dimensions: symmetry, affective value, gender}, and {\em intentionality of action which are highly interconnected. At a larger level, we automatically generate {\em chains of causal relations} between verbs indicating interpersonal relationships. Statistics along these dimensions give insights into people's behavior, judgments, and thus their social interactions.
Autonomous and Semiautonomous Control Simulator
Burns, Chad Raymond (University of Illinois) | Zearing, Joseph (University of Illinois) | Wang, Ranxiao Frances (University of Illinois) | Stipanovic, Dusan (University of Illinois)
This paper presents a simulator that is being developed to study the performance of certain types of vehicle navigation. The performance metric looks at a likelihood of accomplishing a task and the cost of the strategy – measuring both robustness and efficiency. We present results involving only autonomous control strategies, yet the simulator will be used to compare human performance in completing the same task.