ToDo: Token Downsampling for Efficient Generation of High-Resolution Images