Benchmarking large language models for materials synthesis: the case of atomic layer deposition