|
In a stationary MDP, if an agent has a 20% chance of dying at each timestep, what is the optimal value for the discount factor?
|
Main Tools: Graph Searching | Consistency for CSP | SLS for CSP | Deduction | Belief and Decision Networks | Decision Trees | Neural Networks | STRIPS to CSP |