It is a type of reinforcement learning that uses neural networks to approximate the value of taking an action in a given state, instead of using a lookup table as in traditional RL.
It is a type of reinforcement learning that uses neural networks to approximate the value of taking an action in a given state, instead of using a lookup table as in traditional RL.