Omni-DNA: A Genomic Model Supporting Sequence Understanding, Long-context, and Textual Annotation

Jun-14-2026, 03:08:22 GMT–Neural Information Processing Systems

The interpretation of genomic sequences is crucial for understanding biological processes. To handle the growing volume of DNA sequence data, Genomic Foundation Models (GFMs) have been developed by adapting architectures and training paradigms from Large Language Models (LLMs). Despite their remarkable performance in DNA sequence classification tasks, there remains a lack of systematic understanding regarding the training and task-adaptation processes of GFMs. Moreover, existing GFMs cannot achieve state-of-the-art performance on both short and long-context tasks and lacks multimodal abilities.

bioinformatics, large language model, natural language, (7 more...)

Neural Information Processing Systems

Jun-14-2026, 03:08:22 GMT

Conferences Web Page

Add feedback

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:
- Information Technology
  - Biomedical Informatics > Translational Bioinformatics (1.00)
  - Artificial Intelligence > Natural Language
    - Large Language Model (0.75)