ReaderLM-v2: Small Language Model for HTML to Markdown and JSON