TY - JOUR A1 - Avner, Ehud Alexander A1 - Ordan, Noam A1 - Wintner, Shuly T1 - Identifying translationese at the word and sub-word level JF - Digital Scholarship in the Humanities N2 - We use text classification to distinguish automatically between original and translated texts in Hebrew, a morphologically complex language. To this end, we design several linguistically informed feature sets that capture word-level and sub-word-level (in particular, morphological) properties of Hebrew. Such features are abstract enough to allow for the development of accurate, robust classifiers, and they also lend themselves to linguistic interpretation. Careful evaluation shows that some of the classifiers we define are, indeed, highly accurate, and scale up nicely to domains that they were not trained on. In addition, analysis of the best features provides insight into the morphological properties of translated texts. Y1 - 2016 U6 - https://doi.org/10.1093/llc/fqu047 SN - 2055-7671 SN - 2055-768X VL - 31 SP - 30 EP - 54 PB - Oxford Univ. Press CY - Oxford ER -