A Additional numerical experiments

Aug-14-2025, 22:18:55 GMT–Neural Information Processing Systems

In this section, we introduce some additional numerical experiments. To add some randomness of the environment, we set that the states transit randomly. The optimal policy encourages the agent to take the special jump and reach the terminal state. In the target policy, the agent will reach the terminal state as soon as possible but avoid to take the special jump. We assume that the agent does not know the attacker's manipulations and the presence of the attacker.

agent, hnull, inequality, (14 more...)

Neural Information Processing Systems

Aug-14-2025, 22:18:55 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence (0.46)

Duplicate Docs Excel Report

Title
678004486c119599ed7d199f47da043a-Supplemental.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found