The process of converting raw text data into numerical features that can be used as input to machine learning models.