AITopics | Markov Models

67496dfa96afddab795530cc7c69b57a-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 12:55:50 GMT

Theoptimalbaseline, however, israrelyusedinpractice (Sutton & Barto (2018); foran exception, see (Peters & Schaal, 2008)). Equation (1) thentakesthefollowingform: r E R(x)= E (R(x) B)r log (x).

artificial intelligence, kl-control, machine learning, (15 more...)

Neural Information Processing Systems

Country:

Asia > Myanmar (0.04)
Oceania > Fiji > Western Division > Lautoka (0.04)
Oceania > Australia (0.04)
(17 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

66f09010d989c83faeeac2617464b6a4-Paper-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 12:36:44 GMT

experiment, fdr control, feature distribution, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Michigan (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

Outline

Neural Information Processing SystemsFeb-9-2026, 11:46:49 GMT

We first prove the direction that efficiency ordering implies Loewner ordering. Next we want to showlimt (I γA)t = 0. Since we assume0 < γ < 2/ A 2, we have I γA 2 = maxi=1,2,,n|1 γλi(A)| < 1, where λi(A) > 0 is thei-the eigenvalue of the positivedefinite matrixA. For the original functionG: Rd V Rd, we define another functionΦ: Rd E Rd such thatΦ(θ,eij) = G(θ,j). This is true for periodic Markov chain, and is shown in the following lemma. Due to its random nature across each epoch, random shuffling is not a Markov chain on state space[n].

artificial intelligence, machine learning, sequence, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.55)

Add feedback

992f0fed0720dbb9d4e060d03ed531ba-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 11:45:57 GMT

Decision making in these systems has a long history, yet, if the state is not fully observed acting optimally in such systems is notoriously hard.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
North America > United States > Florida > Orange County > Orlando (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Add feedback

POMDPsin Continuous Timeand Discrete Spaces

Neural Information Processing SystemsFeb-9-2026, 11:45:50 GMT

Pn i=1 (ti t) andthe (x, tN(t)) (x, t), whichsets (x, tN(t)), see Otheroptimal 25, 36].

artificial intelligence, function 0, machine learning, (10 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.05)
North America > United States > Rhode Island > Providence County > Providence (0.04)
(6 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.71)

Add feedback

Scaling up Continuous-Time Markov Chains Helps Resolve Underspecification

Neural Information Processing SystemsFeb-9-2026, 11:29:38 GMT

Modeling the time evolution of discrete sets of items (e.g., genetic mutations) is a fundamental problem in many biomedical applications. We approach this problem through the lens of continuous-time Markov chains, and show that the resulting learning task is generally underspecified in the usual setting of cross-sectional data. We explore a perhaps surprising remedy: including a number of additional independent items can help determine time order, and hence resolve underspecifi-cation. This is in sharp contrast to the common practice of limiting the analysis to a small subset of relevant items, which is followed largely due to poor scaling of existing methods. To put our theoretical insight into practice, we develop an approximate likelihood maximization method for learning continuous-time Markov chains, which can scale to hundreds of items and is orders of magnitude faster than previous methods. We demonstrate the effectiveness of our approach on synthetic and real cancer data.

artificial intelligence, machine learning, probability, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Costa Rica > Heredia Province > Heredia (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Spain (0.04)

Genre: Research Report (0.46)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

EmergentComplexityandZero-shotTransfervia UnsupervisedEnvironmentDesign

Neural Information Processing SystemsFeb-9-2026, 11:27:15 GMT

Awide range ofreinforcement learning (RL) problems --including robustness, transfer learning, unsupervised RL, and emergent complexity -- require specifying a distribution of tasks or environments in which a policy will be trained.

artificial intelligence, arxivpreprintarxiv, machine learning, (16 more...)

Neural Information Processing Systems

Country: