A machine learning technique that enables an agent to learn by interacting with the environment through trial and error.