Methodological reflections for AI alignment research using human feedback

Open in new window