The process of marking up a corpus with linguistic information, such as part-of-speech tags or syntactic structure.