These are measures used to evaluate the performance of NER models, such as precision, recall, and F1 score.