The ability of a robot to interpret and understand its surroundings using visual information captured by cameras.