Q-learning follows an on-policy learning algorithm or an off-policy learning algorithm?

44. Q-learning follows an on-policy learning algorithm or an off-policy learning algorithm?

  1. On-policy
  2. Off-policy

Answer: B) Off-policy

Explanation:

Q-learning is based on an off-policy learning algorithm.

Comments and Discussions!

Load comments ↻






Copyright © 2024 www.includehelp.com. All rights reserved.