Large Language Model-guided Document Selection