NLP Resources – the Beautiful of Corpus

Corpus (plural corpora) is a linguistic resource consisting of a  wide organized collection of texts (usually electronically registered and processed). In the corpus linguistics, statistical analysis and hypothesis tests are carried out, occurrences are tested or linguistic rules validated within a particular language field. Corpora is the primary knowledge base for corpus linguistics.

