Reinforcement learning

It involves learning through trial and error by receiving feedback from the environment. The model tries to find the optimal policy that maximizes the reward function.