|
What would the optimal policy be if the punishment in the top-left square was changed to each of the following values: 5, 0.0001, 0? For a punishment of 5: For a punishment of 0.0001: For a punishment of 0: |
Main Tools: Graph Searching | Consistency for CSP | SLS for CSP | Deduction | Belief and Decision Networks | Decision Trees | Neural Networks | STRIPS to CSP |