Which of the following correctly states the difference between Q-learning and SARSA?

47. Which of the following correctly states the difference between Q-learning and SARSA?

  1. In comparison to SARSA, QL directly learns the optimal policy, whereas SARSA learns a policy that is "near" the optimal
  2. In comparison to QL, SARSA directly learns the optimal policy, whereas QL learns a policy that is "near" the optimal.

Answer: A) In comparison to SARSA, QL directly learns the optimal policy, whereas SARSA learns a policy that is "near" the optimal

Explanation:

In comparison to SARSA, QL directly learns the optimal policy, whereas SARSA learns a policy that is "near" the optimal.

Comments and Discussions!

Load comments ↻






Copyright © 2024 www.includehelp.com. All rights reserved.