Which of the following type of policy is a learning algorithm in which the same policy is improved and evaluated?

40. Which of the following type of policy is a learning algorithm in which the same policy is improved and evaluated?

  1. behavior policy
  2. Target policy
  3. On-policy
  4. Off-policy

Answer: C) On-policy

Explanation:

On-policy type of policy is a learning algorithm in which the same policy is improved and evaluated.

Comments and Discussions!

Load comments ↻






Copyright © 2024 www.includehelp.com. All rights reserved.