DriveThru: a Document Extraction Platform and Benchmark Datasets for Indonesian Local Language Archives