How Much Does Audio Matter to Recognize Egocentric Object Interactions?