field, with a "scientifically rigorous", "fair " and "extensive " evaluation (R1,4,5) and 3 of 4 Rs advocating acceptance

Neural Information Processing Systems 

We thank the reviewers for their informative feedback, indicating improved results (All), that hypotheses are "intuitive" (see Section 6). Do partially joint models help? Still, it is interesting future work to try a joint network (see Discussion p.8). That shows local low-level features, beyond being correlated with the likelihood, dominate it. Overclaiming wrt MSP-OE (R5): We agree and would modify wording, e.g., to "slightly underperform".