TriangleMix: Accelerating Prefilling via Decoding-time Contribution Sparsity