Describe Anything: Detailed Localized Image and Video Captioning