Goto

Collaborating Authors

 Asia





Towards Global Optimal Visual In-Context Learning Prompt Selection

Neural Information Processing Systems

Visual In-Context Learning (VICL) is a prevailing way to transfer visual foundation models to new tasks by leveraging contextual information contained in in-context examples to enhance learning and prediction of query samples.





TableRAG: Million-Token Table Understanding with Language Models Si-An Chen

Neural Information Processing Systems

This enables more efficient data encoding and precise retrieval, significantly reducing prompt lengths and mitigating information loss. We have developed two new million-token benchmarks from the Arcade and BIRD-SQL datasets to thoroughly evaluate TableRAG's effectiveness at scale.


Scattering Vision Transformer: Spectral Mixing Matters-Supplementary

Neural Information Processing Systems

SVT incorporates the scattering network utilizing the DTCWT for image decomposition into low and high-frequency components. Our primary focus is to analyze the low-frequency and high-frequency filter components to emphasize SVT's exceptional directional orientation capabilities.