The Missing Link - A Probabilistic Model of Document Content and Hypertext Connectivity