Among On-policy and off-policy, which of the following target policy is equal to behavior policy?

43. Among On-policy and off-policy, which of the following target policy is equal to behavior policy?

  1. On-policy
  2. Off-policy

Answer: A) On-policy

Explanation:

In the on-policy learning algorithm target policy is equal to behavior policy.

Comments and Discussions!

Load comments ↻






Copyright © 2024 www.includehelp.com. All rights reserved.