Omni-DNA: A Genomic Model Supporting Sequence Understanding, Long-context, and Textual Annotation
–Neural Information Processing Systems
The interpretation of genomic sequences is crucial for understanding biological processes. To handle the growing volume of DNA sequence data, Genomic Foundation Models (GFMs) have been developed by adapting architectures and training paradigms from Large Language Models (LLMs). Despite their remarkable performance in DNA sequence classification tasks, there remains a lack of systematic understanding regarding the training and task-adaptation processes of GFMs. Moreover, existing GFMs cannot achieve state-of-the-art performance on both short and long-context tasks and lacks multimodal abilities.
Neural Information Processing Systems
Jun-14-2026, 03:08:22 GMT
- Industry: