Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems 

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. In their paper "Information-based learning by agents in unbounded state spaces" the authors extend a previous model of information-based exploration as described in reference [11] to unbounded state spaces by introducing a Chinese restaurant process to model transition probabilities. Previous studies have used the Chinese restaurant process for reinforcement learning--for example, reference [2] cited by the authors. It would therefore be good if the authors could clarify the differences to previous studies that have used Chinese restaurant processes in reinforcement learning to clarify originality. L 130: verb is missing in the second part of the sentence To compute the information gain the authors need to compute relative entropies between the true state transition distribution and the estimated state transition distribution.