Confidence Regulation Neurons in Language Models Alessandro Stolfo ETH Zürich Ben Wu

Neural Information Processing Systems 

Entropy neurons are characterized by an unusually high weight norm and influence the final layer normalization (LayerNorm) scale to effectively scale down the logits.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found