It is a research methodology that uses computer software to analyze large collections of language data, known as corpora.