Semi-automatic staging area for high-quality structured data extraction from scientific literature