Goto

Collaborating Authors

 Overview


Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. This paper considers weighted majority algorithm and establishes consistency (error rate of the aggregator tending to zero) results under two settings: (1) when the competence level (risk of each expert) is known in advance and (2) when it is estimated. For case (2), frequentist and Bayesian methods for estimating the competence level are provided. For case (1), consistency is established in terms of providing upper and lower bounds on the error rate of the aggregator, which involve standard calculations ( apart from the fact that upper bound is established by invoking a result by Kearns and Saul, instead of Hoeffding's inequality). For case (2) under the frequentist setting, an independent set of labeled inputs is used to estimate the competence level of each expert.


Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. Paper ID: 39 Title: Robust Classification Under Sample Selection Bias NOTE: Due to the short reviewing time and out of fairness to the other papers I was reviewing, I DID NOT do more than glance over the supplementary material. In most cases, I therefore did not verify that results claimed by the authors are correct, but instead checked that they are plausible. Summary: This paper is about how to adjust a classifier when the training set is not representative of the test set; a canonical example is active learning, but the problem can also appear in recalibration. The goal is to find an effective method in the finite-sample regime, since most results are asymptotics.


Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. This paper addresses the issue of object detection, in particular the challenge of obtaining bounding boxes on a scale similar to which category labels exist for object categorization. The authors side-step this challenge by proposing to adapt object classifiers for the detection task. Their algorithm is fairly simple and straightforward, which is not a bad thing in itself. Their experimental protocol uses 100 categories for training (with both category labels and bounding boxes), and tests on 100 left-out categories.


Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. The paper studies the statistical consistency of plug in classifiers under non decomposable loss functions such as the F statistic which is a popular performance measure in machine learning. The problem studied in this paper is complex because non decomposable measures cannot, by definition, be expressed as an empirical expectation. Therefore, usual concentration inequalities are not applicable in this scenario. The authors present a general analysis for measures that can be expressed as a continuous function of the true positive rate and the true negative rate as well as the class probability.


Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. This paper studies the worst case hardness of estimating the parameters of a binary pairwise undirected graphical model. By considering the specific case of the hard-core / independent set model and relying on the known result that approximating the partition function for this problem given the parameters (even for the unweighted case of all theta_i being 0) is hard, the authors show that the other direction -- namely, approximating the parameters given the node marginals -- is hard, in the sense that it does not admit an FPRAS. This is a strong theoretical paper, addressing a computational complexity problem that occurs very often in theory and practice, has been conjectured to be hard, but hadn't yet been shown formally to be hard (to the best of my and the authors' knowledge). The work is unlikely to have much impact in practice.


Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. This paper proposes a denoising algorithm based on non-local image statistics and patch repetition by combining the advantages of NL-means and Exponentially Weighted Aggregation (EWA). The computation of the aggregated estimator is done using MCMC and results are comparable to state-of-the-art algorithms. Pluses: 1) the method seems simple and straightforward to implement. Minus: In terms of explaining *why* the method works, the text leaves something to be desired. E.g., in the second paragraph of section 7 (The proposed implementation proceeds in two identical iterations.)


Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. The paper presents a method for learning multiple tasks in parallel where at each round a sample is given per each task, but only a single task can have its sample annotated. The authors formulate their method using a trade-off between exploitation and exploration. Th1 provides an upper bound on the expected cumulative number of mistakes. The algorithm is compared to 2 different approaches for choosing the single sample/task to be annotated. It is well written and provides good theoretical as well as experimental results.


Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. This paper's core finding is that combining an identity classification task as well as metric-learning-style verification task helps to learn better features for face classification/verification. The verification task here tries to decrease feature-space distance between instances of the same identity, and increase distance between those of different identities. This improvement is embedded in a state-of-the-art system for face verification, which uses convnets trained on many (400) different views to generate features, distilled into a small set of 25 using feature selection. Very good results are obtained and experiments performed using LFW as a test set. Overall, these are very good results obtained using a somewhat complex pipeline, and a good investigation into the contribution of each task in the loss for feature learning.


Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. This paper proposes to learn bilingual word vector representations through an autoencoder. The novelty of this approach is to not rely on word-level alignments. It only requires aligned sentences. An autoencoder model is used to reconstruct the bag-of-words representation of aligned sentences, within and between languages.


Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. The authors provide finite sample bounds on the excess risk of these classifiers. When taken to the limit these bounds reproduce the known consistency results for this class. However, they are superior in two ways: 1. They apply in the finite case 2. They apply to a broader set of metric spaces The presentation is very clear and the intuition is well described.