Mol-MoE: Training Preference-Guided Routers for Molecule Generation

Calanzone, Diego, D'Oro, Pierluca, Bacon, Pierre-Luc

Feb-8-2025–arXiv.org Artificial Intelligence

Recent advances in language models have enabled framing molecule generation as sequence modeling. However, existing approaches often rely on single-objective reinforcement learning, limiting their applicability to real-world drug design, where multiple competing properties must be optimized. Traditional multi-objective reinforcement learning (MORL) methods require costly retraining for each new objective combination, making rapid exploration of trade-offs impractical. To overcome these limitations, we introduce Mol-MoE, a mixture-of-experts (MoE) architecture that enables efficient test-time steering of molecule generation without retraining. Central to our approach is a preference-based router training objective that incentivizes the router to combine experts in a way that aligns with user-specified trade-offs. This provides improved flexibility in exploring the chemical property space at test time, facilitating rapid trade-off exploration. Benchmarking against state-of-the-art methods, we show that Mol-MoE achieves superior sample quality and steerability.

machine learning, natural language, training preference-guided router, (19 more...)

arXiv.org Artificial Intelligence

Feb-8-2025

arXiv.org PDF

Add feedback

Country:
- North America
  - Canada > Quebec (0.14)
  - United States > California (0.14)

Genre:
- Research Report > Promising Solution (0.34)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.48)
  - Natural Language (1.00)
  - Representation & Reasoning (1.00)