during learning, numerical precision reduction and for finding the Pareto optimal set of configurations apply directly

Neural Information Processing Systems 

We would like to thank the reviewers for their thoughtful comments and valuable suggestions. We will clarify this point in the paper. Our algorithms are agnostic to the leaf distributions used. Thanks for this valuable feedback, we will improve the pseudocode as you suggest. As such, there is memory overhead but no computational overhead.