FLAWS: A Benchmark for Error Identification and Localization in Scientific Papers