In the beginning, based on the Up-Down model, we have attempted to implement the Constant Prophet Attention
–Neural Information Processing Systems
We thank all the reviewers for the helpful comments. We will revise the paper to address your concerns. R1-Q1: The implementation seems straight-forward and the ablation analysis on the loss function. Thus we kept using L1 norm in the rest of experiments. We will conduct a systematic comparison between various loss functions in the next revision.
Neural Information Processing Systems
Oct-2-2025, 04:23:20 GMT
- Technology: