Google News
logo
Reinforcement Learning - Quiz(MCQ)
How many tuples does MDP consist of?
A)
3
B)
4
C)
5
D)
6

Correct Answer :   4


Explanation : MDP consists of 4 tuples :

* A set of finite States S
* A set of finite Actions A
* Rewards received after transitioning from state S to state S', due to action a.
* Probability Pa.

Advertisement