Value Iteration Algorithm Example

From Optimization to Control: Quasi-Policy Iteration

Abstract: Recent control algorithms for Markov decision processes (MDPs) have been designed using an implicit analogy with well-established optimization algorithms. In this paper, we adopt the ...

IEEE

Two-Stage Value Iteration for Multi-Leader Tracking under Interactive Nash Equilibrium in Discrete Time

Abstract: For the discrete-time multi-leader system, this paper proposes a two-stage value iteration to fit complex optimal solutions in Bellman equations of multi-leader and realize the tracking ...

GitHub

An AI algorithm for playing dice game

MyAgent class defines an AI which plays the dice game with the best strategy possible using the Value Iteration algorithm from the book[2]: (Sutton et al., 2018, p. 83). For storing utilities and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

From Optimization to Control: Quasi-Policy Iteration

Two-Stage Value Iteration for Multi-Leader Tracking under Interactive Nash Equilibrium in Discrete Time

An AI algorithm for playing dice game

Trending now