A biological plausible audio-visual integration model for continual lifelong learning