Supplementary Material for Bootstrapping Neural Processes Juho Lee
–Neural Information Processing Systems
A deterministic path uses a self-attention and cross-attention to summarize contexts. The size of the task and the size of the context c was drawn as |c| Unif(3, 47) and n |c| Unif(3, 50 |c|). For Student-t noise, we added ε γ T (2.1) to the curves generated from Training and testing We trained all the model for 100,000 steps with each step computes updates with a batch containing 100 tasks. The size of the task and the size of the context c was drawn as |c| Unif(3, 200) and n |c| Unif(3, 200 |c|). Testings were done for 3,000 batches with each batch containing 16 tasks (48,000 tasks in total).
Neural Information Processing Systems
Jan-24-2025, 02:26:43 GMT