Which of the following correctly states the difference between Q-learning and SARSA?

Reinforcement Learning - Quiz(MCQ)

In comparison to QL, SARSA directly learns the optimal policy, whereas QL learns a policy that is "near" the optimal.

In comparison to SARSA, QL directly learns the optimal policy, whereas SARSA learns a policy that is "near" the optimal

Both (A) and (B)

None of the above

Correct Answer : In comparison to SARSA, QL directly learns the optimal policy, whereas SARSA learns a policy that is "near" the optimal

Explanation : In comparison to SARSA, QL directly learns the optimal policy, whereas SARSA learns a policy that is "near" the optimal.

Recently Updated in Reinforcement Learning Questions

Gamma (Î³) in the bellman equation is known as?

Value factor

Discount factor

Environment factor

None of the above

Correct Answer : Discount factor

Explanation : Gamma (γ) in the bellman equation is known as the Discount factor.

How do you represent the agent state in reinforcement learning?

Markov state

Discount state

Discount factor

None of the above

Correct Answer : Markov state

Explanation : Represent the agent state in reinforcement learning Markov state.

P[St+1 | St ] = P[St +1 | S1,......, St], in this condition
What is the meaning of St?

State factor

Markov state

Discount factor

None of the above

Correct Answer : Markov state

Explaination : P[St+1 | St ] = P[St +1 | S1,......, St], in the following condition St represents the Markov state.

New Technologies MCQ's

Machine Learning

Artificial Intelligence