RA TT: Recurrent Attention to Transient Tasks for Continual Image Captioning (SUPPLEMENTARY MATERIAL)
–Neural Information Processing Systems
We exploit categorical image annotations available in many captioning datasets. The influence of the people category is clearly visible. Figure 2: RA TT ablation on the MS-COCO validation set using different attention masks. Evaluation is the same as MS-COCO (figure 4). In figures 6 and 7, we give a comparison of performance for all considered approaches on the MS-COCO validation set. These learning curves and heatmaps allow us to appreciate the ability of RA TT to remember old tasks.
Neural Information Processing Systems
Nov-15-2025, 06:37:45 GMT
- Country:
- Europe
- Italy (0.05)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.05)
- North America > Canada (0.04)
- Europe
- Industry:
- Leisure & Entertainment > Sports (0.46)
- Technology: