A Supplementary Material

Neural Information Processing Systems 

Figure A.1: The median difference in GP log score between the forward and backward model, with Figure A.3 shows the distribution of Cyclic graphs occasionally returned by DiBS+ were discarded. We performed an additional experiment comparing the ability of the different methods to model the posterior distribution over DAGs as a function of their run-time. Figure A.4 shows the reverse K-L divergence between the "true" posterior (obtained by enumerating every possible structure and Figure A.4: Reverse K-L divergence between the true posterior and the BGe posterior (green), DiBS+ In figure A.5 we compare the number of score evaluations performed by the different methods when Figure A.5: Distribution of number of scores evaluated by the different methods. Figure A.9 shows the corresponding run-times needed to run