Collage: Decomposable Rapid Prototyping for Information Extraction on Scientific PDFs