Given a set of states S, a set of actions A, and an experience ⟨s,a,r,s',a'⟩, what is the time complexity to update the value of Q(s,a) using SARSA?
  • Constant time: O(1)

Valid HTML 4.0 Transitional