Goto

Collaborating Authors

 moca




MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks

Neural Information Processing Systems

Human commonsense understanding of the physical and social world is organized around intuitive theories. These theories support making causal and moral judgments. When something bad happens, we naturally ask: who did what, and why? A rich literature in cognitive science has studied people's causal and moral intuitions. This work has revealed a number of factors that systematically influence people's judgments, such as the violation of norms and whether the harm is avoidable or inevitable.


A A unifying framework Data Distribution Model for Fast Weights Slow Weights Updates Evaluation Supervised Learning S, Q C f

Neural Information Processing Systems

For readability, we omit OSAKA pre-training. Replay-based methods store representative samples from the past, either in their original form (e.g., rehearsal Most prior-based methods rely on task boundaries. Since non-stationary data distributions breaks the i.i.d assumption for The update is computed from a parametric combination of the gradient of the current and previous task. Despite that, meta-continual learning is actively researched [61, 6]. Bayesian change-point detection scheme to identify whether a task has changed.





cc3f5463bc4d26bc38eadc8bcffbc654-AuthorFeedback.pdf

Neural Information Processing Systems

We thank all reviewers for their helpful comments. Our responses to each reviewer are below. The reviewer has three major critiques of the paper, which we address in order. NBA experiment, as there is no ground truth hazard rate. The reviewer's concerns center on the We acknowledge that our statement on "avoiding" negative