TY - JOUR A1 - Krüger, K. R. A1 - Lukowiak, A. A1 - Sonntag, J. A1 - Warzecha, Saskia A1 - Stede, Manfred T1 - Classifying news versus opinions in newspapers BT - linguistic features for domain independence JF - Natural language engineering N2 - Newspaper text can be broadly divided in the classes ‘opinion’ (editorials, commentary, letters to the editor) and ‘neutral’ (reports). We describe a classification system for performing this separation, which uses a set of linguistically motivated features. Working with various English newspaper corpora, we demonstrate that it significantly outperforms bag-of-lemma and PoS-tag models. We conclude that the linguistic features constitute the best method for achieving robustness against change of newspaper or domain. Y1 - 2017 U6 - https://doi.org/10.1017/S1351324917000043 SN - 1351-3249 SN - 1469-8110 VL - 23 SP - 687 EP - 707 PB - Cambridge Univ. Press CY - Cambridge ER -