Supplementary material for "FETA: Towards Specializing Foundation Models for Expert Task Applications " Sivan Harary

Neural Information Processing Systems 

The downloaded documents were processed by the DeepSearch tool https://ds4sd.github.io/ We employ a dilation technique in which we increase the length of each of the box's horizontal edges This creates some overlaps between neighboring boxes. We created manual annotations for part of the Car Manuals dataset. The steps are shown on an example pages from the cars dataset. In this test we consider only the manually annotated documents.