Spectral Manifold Harmonization for Graph Imbalanced Regression
Nogueira, Brenda, Gomes, Gabe, Jiang, Meng, Chawla, Nitesh V., Moniz, Nuno
–arXiv.org Artificial Intelligence
Graph-structured data is ubiquitous in scientific domains, where models often face imbalanced learning settings. In imbalanced regression, domain preferences focus on specific target value ranges that represent the most scientifically valuable cases; however, we observe a significant lack of research regarding this challenge. In this paper, we present Spectral Manifold Harmonization (SMH), a novel approach to address imbalanced regression challenges on graph-structured data by generating synthetic graph samples that preserve topological properties while focusing on the most relevant target distribution regions. Conventional methods fail in this context because they either ignore graph topology in case generation or do not target specific domain ranges, resulting in models biased toward average target values. Experimental results demonstrate the potential of SMH on chemistry and drug discovery benchmark datasets, showing consistent improvements in predictive performance for target domain ranges. Code is available at https://github.com/brendacnogueira/smh-graph-imbalance.git.
arXiv.org Artificial Intelligence
Jul-15-2025
- Country:
- Asia
- China (0.04)
- Singapore > Central Region
- Singapore (0.04)
- North America
- Canada > Ontario
- Toronto (0.06)
- United States
- California > Los Angeles County
- Long Beach (0.04)
- Indiana > St. Joseph County
- Notre Dame (0.05)
- New York > New York County
- New York City (0.05)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- California > Los Angeles County
- Canada > Ontario
- Asia
- Genre:
- Research Report > New Finding (0.89)
- Industry:
- Technology: