Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward