Alignment-Aware Model Extraction Attacks on Large Language Models

Open in new window