Online Learning for Non-monotone Submodular Maximization: From Full Information to Bandit Feedback

Zhang, Qixin, Deng, Zengde, Chen, Zaiyi, Zhou, Kuangqi, Hu, Haoyuan, Yang, Yu

Aug-16-2022–arXiv.org Artificial Intelligence

In this paper, we revisit the online non-monotone continuous DR-submodular maximization problem over a down-closed convex set, which finds wide real-world applications in the domain of machine learning, economics, and operations research. At first, we present the Meta-MFW algorithm achieving a $1/e$-regret of $O(\sqrt{T})$ at the cost of $T^{3/2}$ stochastic gradient evaluations per round. As far as we know, Meta-MFW is the first algorithm to obtain $1/e$-regret of $O(\sqrt{T})$ for the online non-monotone continuous DR-submodular maximization problem over a down-closed convex set. Furthermore, in sharp contrast with ODC algorithm \citep{thang2021online}, Meta-MFW relies on the simple online linear oracle without discretization, lifting, or rounding operations. Considering the practical restrictions, we then propose the Mono-MFW algorithm, which reduces the per-function stochastic gradient evaluations from $T^{3/2}$ to 1 and achieves a $1/e$-regret bound of $O(T^{4/5})$. Next, we extend Mono-MFW to the bandit setting and propose the Bandit-MFW algorithm which attains a $1/e$-regret bound of $O(T^{8/9})$. To the best of our knowledge, Mono-MFW and Bandit-MFW are the first sublinear-regret algorithms to explore the one-shot and bandit setting for online non-monotone continuous DR-submodular maximization problem over a down-closed convex set, respectively. Finally, we conduct numerical experiments on both synthetic and real-world datasets to verify the effectiveness of our methods.

algorithm, down-closed convex, maximization, (12 more...)

arXiv.org Artificial Intelligence

Aug-16-2022

arXiv.org PDF

Add feedback

Country:
- Asia
  - Singapore (0.04)
  - China > Hong Kong
    - Kowloon (0.04)

Genre:
- Research Report (0.64)

Industry:
- Education > Educational Setting > Online (0.40)

Technology:
- Information Technology
  - Enterprise Applications > Human Resources
    - Learning Management (0.40)
  - Artificial Intelligence > Machine Learning
    - Statistical Learning > Gradient Descent (0.55)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found