Modeling Real-Time Interactive Conversations as Timed Diarized Transcripts