Among On-policy and off-policy, which of the following target policy is not equal to behavior policy?

A) On-policy

B) Off-policy

C) Both (A) and (B)

D) None of the above

Correct Answer : Off-policy

Explanation : In an off-policy learning algorithm target policy is not equal to behavior policy.

Top Trending Technologies MCQ's

Machine Learning

Artificial Intelligence

Ethical Hacking

Microsoft Azure

Cloud Computing

Quantum Computing

Neural Networks

Natural Language Processing

Virtual Reality

Augmented Reality

Top Trending Technologies Interview Questions

Machine Learning

Artificial Intelligence

Ethical Hacking

HyperAutomation

Microsoft Azure

Cloud Computing

Quantum Computing

Cognitive Computing

Neural Networks

Virtual Reality

Augmented Reality

Full Stack Developer