How is a sequential decision problem different from a one-off decision problem?
- In a one-off decision problem, even if there are multiple decisions to make they can be treated as a single macro decision. That macro decision is made before any action is carried out. With a sequential decision problem, the agent makes observations, decides on an action, carries out the actions, makes some more observations in the resulting world, then makes more decisions conditioned on the new observations, etc.
|