Aurelia: Test-time Reasoning Distillation in Audio-Visual LLMs