It is a type of reinforcement learning that uses a hierarchy of sub-tasks to make the learning process more efficient. Each sub-task can be learned independently, and the learned policies can be combined to solve the larger task.
It is a type of reinforcement learning that uses a hierarchy of sub-tasks to make the learning process more efficient. Each sub-task can be learned independently, and the learned policies can be combined to solve the larger task.