The identification and classification of textual variants based on their similarity to other variants.