
Corpus (plural corpora) is a linguistic resource consisting of a wide organized collection of texts (usually electronically registered and processed). In the corpus linguistics, statistical analysis and hypothesis tests are carried out, occurrences are tested or linguistic rules validated within a particular language field. Corpora is the primary knowledge base for corpus linguistics.