Goto

Collaborating Authors

 Large Language Model


MoGU: A Framework for Enhancing Safety of LLMs While Preserving Their Usability

Neural Information Processing Systems

Large Language Models (LLMs) are increasingly deployed in various applications. As their usage grows, concerns regarding their safety are rising, especially in maintaining harmless responses when faced with malicious instructions.






How does Architecture Influence the Base Capabilities

Neural Information Processing Systems

Unlike existing work focusing on the influence of scale on base capabilities, our work examines the influence of architecture on those. Specifically, our concern is: How does architecture influence the base capabilities of pre-trained language models?