Using a subset of data to test cleaning and analysis procedures before applying them to the entire dataset.