A learning algorithm that updates a player's belief about their opponent's strategy based on observed moves.