Goto

Collaborating Authors

 true quality




Counterbalancing Learning and Strategic Incentives in Allocation Markets

Neural Information Processing Systems

Motivated by the high discard rate of donated organs in the United States, we study an allocation problem in the presence of learning and strategic incentives. We consider a setting where a benevolent social planner decides whether and how to allocate a single indivisible object to a queue of strategic agents. The object has a common true quality, good or bad, which is ex-ante unknown to everyone. Each agent holds an informative, yet noisy, private signal about the quality.


Learning-Theoretic Foundations of Algorithm Configuration for Combinatorial Partitioning Problems

arXiv.org Artificial Intelligence

Max-cut, clustering, and many other partitioning problems that are of significant importance to machine learning and other scientific fields are NP-hard, a reality that has motivated researchers to develop a wealth of approximation algorithms and heuristics. Although the best algorithm to use typically depends on the specific application domain, a worst-case analysis is often used to compare algorithms. This may be misleading if worst-case instances occur infrequently, and thus there is a demand for optimization methods which return the algorithm configuration best suited for the given application's typical inputs. We address this problem for clustering, max-cut, and other partitioning problems, such as integer quadratic programming, by designing computationally efficient and sample efficient learning algorithms which receive samples from an application-specific distribution over problem instances and learn a partitioning algorithm with high expected performance. Our algorithms learn over common integer quadratic programming and clustering algorithm families: SDP rounding algorithms and agglomerative clustering algorithms with dynamic programming. For our sample complexity analysis, we provide tight bounds on the pseudodimension of these algorithm classes, and show that surprisingly, even for classes of algorithms parameterized by a single parameter, the pseudo-dimension is superconstant. In this way, our work both contributes to the foundations of algorithm configuration and pushes the boundaries of learning theory, since the algorithm classes we analyze consist of multi-stage optimization procedures and are significantly more complex than classes typically studied in learning theory.


Ranked Voting on Social Networks

AAAI Conferences

They pinpoint families of voting rules that exhibit robustness: they are accurate in the limit with respect to a wide Classic social choice theory assumes that votes are range of noise models, which govern the way noisy votes are independent (but possibly conditioned on an underlying generated, given the ground truth [Caragiannis et al., 2013; objective ground truth). This assumption 2014]. is unrealistic in settings where the voters are connected While these results are promising, they rely on a crucial via an underlying social network structure, modeling assumption: votes are independent. This assumption as social interactions lead to correlated votes. We is clearly satisfied in some settings -- when votes are establish a general framework -- based on random submitted by computer Go programs [Jiang et al., 2014], say.


Star Quality: Aggregating Reviews to Rank Products and Merchants

AAAI Conferences

Given a set of reviews of products or merchants from a wide range of authors and several reviews websites, how can we measure the true quality of the product or merchant?  How do we remove the bias of individual authors or sources?  How do we compare reviews obtained from different websites, where ratings may be on different scales (1-5 stars, A/B/C, etc.)?  How do we filter out unreliable reviews to use only the ones with ``star quality''?  Taking into account these considerations, we analyze data sets from a variety of different reviews sites (the first paper, to our knowledge, to do this). These data sets include 8 million product reviews and 1.5 million merchant reviews. We explore statistic- and heuristic- based models for estimating the true quality of a product or merchant, and compare the performance of these estimators on the task of ranking pairs of objects.  We also apply the same models to the task of using Netflix ratings data to rank pairs of movies, and discover that the performance of the different models is surprisingly similar on this data set.