Correct Answer : 4
Explanation : MDP consists of 4 tuples :* A set of finite States S* A set of finite Actions A* Rewards received after transitioning from state S to state S', due to action a.* Probability Pa.