Among On-policy and off-policy, which of the following target policy is not equal to behavior policy?

42. Among On-policy and off-policy, which of the following target policy is not equal to behavior policy?

  1. On-policy
  2. Off-policy

Answer: B) Off-policy

Explanation:

In an off-policy learning algorithm target policy is not equal to behavior policy.

Comments and Discussions!

Load comments ↻






Copyright © 2024 www.includehelp.com. All rights reserved.