Goto

Collaborating Authors

 Law










MoGU: A Framework for Enhancing Safety of LLMs While Preserving Their Usability

Neural Information Processing Systems

Large Language Models (LLMs) are increasingly deployed in various applications. As their usage grows, concerns regarding their safety are rising, especially in maintaining harmless responses when faced with malicious instructions.