Bayesian Attention Mechanism: A Probabilistic Framework for Positional Encoding and Context Length Extrapolation