Towards human-like spoken dialogue generation between AI agents from written dialogue