The study of language based on large collections of authentic language data collected from different sources.