Parsing costs as predictors of reading difficulty: An evaluation using the Potsdam Sentence Corpus
- The surprisal of a word on a probabilistic grammar constitutes a promising complexity metric for human sentence comprehension difficulty. Using two different grammar types, surprisal is shown to have an effect on fixation durations and regression probabilities in a sample of German readers’ eye movements, the Potsdam Sentence Corpus. A linear mixed-effects model was used to quantify the effect of surprisal while taking into account unigram and bigram frequency, word length, and empirically-derived word predictability; the so-called “early” and “late” measures of processing difficulty both showed an effect of surprisal. Surprisal is also shown to have a small but statistically non-significant effect on empirically-derived predictability itself. This work thus demonstrates the importance of including parsing costs as a predictor of comprehension difficulty in models of reading, and suggests that a simple identification of syntactic parsing costs with early measures and late measures with durations of post-syntactic events may beThe surprisal of a word on a probabilistic grammar constitutes a promising complexity metric for human sentence comprehension difficulty. Using two different grammar types, surprisal is shown to have an effect on fixation durations and regression probabilities in a sample of German readers’ eye movements, the Potsdam Sentence Corpus. A linear mixed-effects model was used to quantify the effect of surprisal while taking into account unigram and bigram frequency, word length, and empirically-derived word predictability; the so-called “early” and “late” measures of processing difficulty both showed an effect of surprisal. Surprisal is also shown to have a small but statistically non-significant effect on empirically-derived predictability itself. This work thus demonstrates the importance of including parsing costs as a predictor of comprehension difficulty in models of reading, and suggests that a simple identification of syntactic parsing costs with early measures and late measures with durations of post-syntactic events may be difficult to uphold.…
Verfasserangaben: | Marisa Ferrara Boston, John Hale, Reinhold KlieglORCiDGND, Umesh Patil, Shravan VasishthORCiDGND |
---|---|
URN: | urn:nbn:de:kobv:517-opus-57139 |
Schriftenreihe (Bandnummer): | Zweitveröffentlichungen der Universität Potsdam : Humanwissenschaftliche Reihe (paper 253) |
Publikationstyp: | Postprint |
Sprache: | Englisch |
Erscheinungsjahr: | 2008 |
Veröffentlichende Institution: | Universität Potsdam |
Datum der Freischaltung: | 13.12.2011 |
Quelle: | Journal of Eye Movement Research. - ISSN 1995-8692. - 2 (2008), 1, S. 1-12 |
Organisationseinheiten: | Extern / Extern |
Humanwissenschaftliche Fakultät / Strukturbereich Kognitionswissenschaften / Department Psychologie | |
DDC-Klassifikation: | 4 Sprache / 40 Sprache / 400 Sprache |
Name der Einrichtung zum Zeitpunkt der Publikation: | Humanwissenschaftliche Fakultät / Institut für Psychologie |
Lizenz (Deutsch): | Keine öffentliche Lizenz: Unter Urheberrechtsschutz |
Externe Anmerkung: | first published in: Journal of eye movement research. 2 (2008), 1, S. 1-12 |