Correct Answer : On-policy
Explanation : On-policy type of policy is a learning algorithm in which the same policy is improved and evaluated.