Detecting automatically the layout of clinical documents to enhance the performances of downstream natural language processing