The Medium Is Not the Message: Deconfounding Document Embeddings via Linear Concept Erasure