|
Describe one possible scenario in which SARSA would use a different policy than Q-learning.
|
Main Tools: Graph Searching | Consistency for CSP | SLS for CSP | Deduction | Belief and Decision Networks | Decision Trees | Neural Networks | STRIPS to CSP |