Learning Distributedand Fair Policiesfor Network Load Balancingas Markov Potential Game

Feb-19-2026, 10:52:13 GMT–Neural Information Processing Systems

At t 2 H inahorizonH ofthegireceiwi(t) 2 W, theworkload policy i 2 , where istheload t, a anactionai(t)= {aij(t)}Nj=1, accordingwi(t) are i(t). Q (o, a) r(o, a) Eo0[V (o0)] 2 , whereV (o0)= Ea0[Q (o0,a0) log (a0|o0)] and Q isthetargetQ network; theactorpolicy isupdatedwiththegradient r Eo[Ea [ log (a|o) Q (o, a)]].

artificial intelligence, latexit sha1, machine learning, (9 more...)

Neural Information Processing Systems

Feb-19-2026, 10:52:13 GMT

Conferences PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- Europe
  - United Kingdom > England
    - Greater London > London (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)

Technology:
- Information Technology
  - Communications (0.69)
  - Artificial Intelligence > Machine Learning (0.69)

Duplicate Docs Excel Report

Title
b94d8b035e2183e47afef9e2f299ba47-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found