Measures used to assess the performance of a text classification model, such as accuracy, precision, recall, and F1 score.