f1c1592588411002af340cbaedd6fc33-Supplemental.pdf
–Neural Information Processing Systems
Figure 2: These two graphs cannot be distinguished by 1-WL-test. The COMBINE step takes the result of AGGREGATE and the previous representation of current node asinput. Wereduce theFFN inner-layer dimension of4din [47] tod, which does not appreciably hurt the performance but significantly save the parameters. The embedding dropout ratio is set to 0.1 by default in many previous Transformer works[11,34]. The rest of hyper-parameters remain unchanged. Table 8 summarizes the hyper-parameters used for fine-tuning Graphormer on OGBGMolPCBA.
Neural Information Processing Systems
Feb-11-2026, 21:21:25 GMT
- Country:
- North America > United States > California > Santa Clara County > Palo Alto (0.05)
- Technology: