A type of machine learning where an agent learns to take actions in an environment to maximize a reward signal, often used in applications such as game playing or robotics.
A type of machine learning where an agent learns to take actions in an environment to maximize a reward signal, often used in applications such as game playing or robotics.