Book: News across Five Continents
Chapter: 6. Corpus Design
Chapter 6 describes the dataset used in this study. The sampling method, size and processing of data and the annotation and mark-up are explained separately and in detail. To make the corpus design and sampling decisions transparent and understandable, the notion of representativeness is also discussed.