"Corpus linguistics is the study of a language as that language is expressed in its text corpus..."
How corpus linguistics approaches can be used to support language learning and teaching, including designing and using corpus-based activities and materials.
Corpora: This encompasses the idea, definition, types, and characteristics of corpora as well as how to compile a corpus.
Corpus Annotation: Understanding how to annotate corpora, which includes knowing the symbols, tags, labels, and conventions used for the task.
Frequency Analysis: Describing and interpreting the frequency and distribution of tokens in a corpus. This includes the use of frequency and dispersion plots.
Collocation and Concordance: Analyzing the paired occurrence of words in a corpus as well as generating concordance lines to examine their usage.
Keyword Analysis: Identifying and analyzing the high-frequency terms in a corpus using different types of software.
Textual Analysis: Examining thematic, syntactical, and discursive patterns within the corpus.
Stylistics: Analyzing linguistic choices as an extension of style in a corpus.
Genre Analysis: Describing and categorizing corpora based on their communicative purpose and generic conventions.
Corpus-based Contrastive Analysis: Comparing and contrasting two corpora to examine linguistic differences and similarities.
Corpus-Based Translation Studies: The use of linguistically-informed methods for studying translation based on corpora.
Corpus-based Language Teaching: The use of corpora for language teaching purposes to enhance language learning.
Natural Language Processing: The development of algorithms and machine learning for processing large corpora for the purpose of various other applications.
Computational Linguistics: A subfield of linguistics that focuses on the computational aspects of natural language processing.
Corpus Management: Understanding how to manage a corpus, procedures for maintenance and updating, metadata, and data formats.
Corpora and Discourse: Analyzing how corpora play a role in the study of discourse in various fields, including political, social, and scientific discourse.
Corpus-driven approach: An approach that uses large corpora to derive rules and patterns in language learning.
Corpus-assisted approach: An approach that uses the corpus as a tool or a resource for language learning.
Corpus-based approach: An approach that uses data from corpora to support linguistics research and language learning.
Data-driven learning: An approach that relies on data from corpora to support language learning and teaching.
Task-based learning: An approach that uses tasks, activities, or projects that focus on language learning and use.
Language modeling: An approach that uses computational models to represent the structure and function of language in order to support language learning.
Text classification: An approach that analyzes texts and categorizes them based on certain criteria or features, such as genre, topic, or language proficiency level.
Concordancing: An approach that identifies and analyzes the concordances (words or phrases that appear in close proximity) in a corpus to help language learners understand the use of language in context.
Collocation: An approach that examines the co-occurrence of words in a corpus to identify common collocations (words that frequently appear together) and help language learners develop their use of natural language.
Lexical analysis: An approach that analyses the lexical features of a corpus, including vocabulary, semantics, and syntax, to help language learners understand the structure and use of language in context.
Corpus-based language teaching: An approach that uses data from corpora to support language teaching and learning.
"Corpus linguistics proposes that a reliable analysis of a language is more feasible with corpora collected in the field..."
"The text-corpus method uses the body of texts written in any natural language to derive the set of abstract rules..."
"...collected in the field—the natural context ('realia') of that language..."
"Those results can be used to explore the relationships between that subject language and other languages..."
"The first such corpora were manually derived from source texts..."
"...but now that work is automated."
"Corpora have not only been used for linguistics research, they have also been used to compile dictionaries..."
"...starting with The American Heritage Dictionary of the English Language in 1969..."
"John McHardy Sinclair advocates minimal annotation so texts speak for themselves..."
"The Survey of English Usage team (University College, London) advocate annotation..."
"...as allowing greater linguistic understanding through rigorous recording."
"Corpus linguistics proposes that a reliable analysis of a language is more feasible with corpora..."
"The text-corpus method uses the body of texts written in any natural language to derive the set of abstract rules..."
"Those results can be used to explore the relationships between that subject language and other languages..."
"The first such corpora were manually derived from source texts..."
"...but now that work is automated."
"Corpora have not only been used for linguistics research, they have also been used to compile dictionaries..."
"...starting with The American Heritage Dictionary of the English Language in 1969..."
"Experts in the field have differing views about the annotation of a corpus..."