Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems 

Summary: A method for jointly training all parameters (decision splits and posterior probabilities) of decision trees is proposed. This is an important topic since current methods train trees in a greedy manner, layer by layer. The task is phrased as a single optimization problem which is then upper-bounded and approximated to obtain a tractable formulation. Quality - The paper is well written but there might be flaws in the reasoning. Clarity - The derivation is clean but experiments and their presentation could be improved.