Continuous Mean-Covariance Bandits

Apr-24-2026, 12:49:54 GMT–Neural Information Processing Systems

Existing risk-aware multi-armed bandit models typically focus on risk measures of individual options such as variance. As a result, they cannot be directly applied to important real-world online decision making problems with correlated options. In this paper, we propose a novel Continuous Mean-Covariance Bandit (CMCB) model to explicitly take into account option correlation. Specifically, in CMCB, there is a learner who sequentially chooses weight vectors on given options and observes random feedback according to the decisions. The agent's objective is to achieve the best trade-off between reward and risk, measured with option covariance.

artificial intelligence, data mining, machine learning, (18 more...)

Neural Information Processing Systems

Apr-24-2026, 12:49:54 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.28)

Genre:
- Research Report (0.46)

Industry:
- Banking & Finance > Trading (0.92)
- Health & Medicine (0.69)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (1.00)
  - Data Science > Data Mining
    - Big Data (0.49)

Duplicate Docs Excel Report

Title
ContinuousMean-CovarianceBandits

Similar Docs Excel Report more

Title	Similarity	Source
None found