Filtern
Erscheinungsjahr
Dokumenttyp
- Dissertation (97) (entfernen)
Sprache
- Englisch (97) (entfernen)
Schlagworte
- Satzverarbeitung (8)
- Spracherwerb (7)
- psycholinguistics (7)
- sentence processing (7)
- Informationsstruktur (6)
- Psycholinguistik (6)
- eye-tracking (6)
- information structure (6)
- prosody (6)
- Fokus (5)
- German (5)
- Prosodie (5)
- focus (5)
- language acquisition (5)
- Morphologie (4)
- Syntax (4)
- syntax (4)
- working memory (4)
- Aphasie (3)
- Arbeitsgedächtnis (3)
- Deutsch (3)
- Pronomen (3)
- Sprachverarbeitung (3)
- Sprachverständnis (3)
- aphasia (3)
- bilingualism (3)
- clefts (3)
- cue-based retrieval (3)
- formal semantics (3)
- language production (3)
- linguistics (3)
- morphology (3)
- Akan (2)
- Annotationsprojektion (2)
- Bilingualismus (2)
- Blickbewegungen (2)
- Blickbewegungsmessung (2)
- Computerlinguistik (2)
- Cue-Gewichtung (2)
- Dialog (2)
- ERP (2)
- Eye-Tracking (2)
- Flexion (2)
- Koreferenz (2)
- L2 sentence processing (2)
- Optimalitätstheorie (2)
- Orthographie (2)
- Phonetik (2)
- Pupillometrie (2)
- Russian (2)
- Russisch (2)
- Spaltsätze (2)
- Sprachproduktion (2)
- Sprachwissenschaft (2)
- Säuglinge (2)
- Wortstellung (2)
- Zuweisung thematischer Rollen (2)
- annotation (2)
- annotation projection (2)
- computational linguistics (2)
- dialogue (2)
- experimental studies (2)
- experimentelle Studien (2)
- formale Semantik (2)
- infants (2)
- inflection (2)
- language comprehension (2)
- orthography (2)
- perzeptuelle Reorganisation (2)
- phonetics (2)
- pronouns (2)
- prosodic boundary cues (2)
- prosodische Grenzmarkierungen (2)
- pupillometry (2)
- reading (2)
- sentence comprehension (2)
- word order (2)
- ACT-R (1)
- Adverb (1)
- Affixabfolge (1)
- African languages (1)
- Akzeptabilitätsbewertung (1)
- Altern (1)
- Ambiguität (1)
- Anaphern (1)
- Anaphora (1)
- Annotation (1)
- Approximate Bayesian Computation (1)
- Arbeitsgedächtniss (1)
- Aspekt (1)
- Augenblickmessung (1)
- Awing Grammar (1)
- Awing Grammatik (1)
- Biklausalität (1)
- Binding Theory (1)
- Centering Theory (1)
- Chadic (1)
- Clauses (1)
- Closure Positive Shift (CPS) (1)
- Comparative Deletion (1)
- Computermodell (1)
- Copular clauses (1)
- Cue-abhängiger Abruf (1)
- Cue-basierter Retrieval (1)
- Darstellung Verfälschung (1)
- Deep Learning (1)
- Dekomposition (1)
- Dependenzparsing (1)
- Determinierer (1)
- Deutsch als Fremdsprache (1)
- Dialog KI (1)
- Dialogsystem (1)
- Diathese (1)
- Diskurs (1)
- Diskursgegebenheit (1)
- Diskurskontext (1)
- Diskursproduktion (1)
- Diskursrepräsentation (1)
- Diskurssemantik (1)
- Diskursverstehen (1)
- Distributed Morphology (1)
- Downstep (1)
- Dreisprachigkeit (1)
- EEG (1)
- EKP (1)
- Ellipse (1)
- Englisch (1)
- English (1)
- Ereigniskorrelierte Hirnpotentiale (EKP) (1)
- Ereigniskorreliertes Potenzial (1)
- Erwerb (1)
- Erwerb des Tagalog (1)
- Erwerbsalter (1)
- Exhaustivität (1)
- Exhaustivitätsinferenz (1)
- Eye-Tracking Satzverarbeitung (1)
- Eye-Tracking-Verfahren (1)
- Eye-tracking (1)
- Familiarität (1)
- Feldforschung (1)
- Figurative Sprachverarbeitung (1)
- Figurative language processing (1)
- Fitts' Gesetz (1)
- Fitts' law (1)
- Fokusrealisierung (1)
- Fokussensitivität (1)
- Formale Semantik (1)
- Französisch (1)
- Fremdsprachenerwerb (1)
- Fremdsprachverarbeitung (1)
- French (1)
- GPT (1)
- Ga (Kwa) Sprache (1)
- Ga (Kwa) language (1)
- German past participles (1)
- German word order (1)
- Gradphrasen (1)
- Grassfields Bantu (1)
- IC Model (1)
- IC Modell (1)
- Inferenzen (1)
- Information Structure (1)
- Kinder (1)
- Klassifikator (1)
- Koartikulation (1)
- Kohärenz (1)
- Komparativ-Tilgung (1)
- Konnektoren (1)
- Konsonantencluster (1)
- Kontext (1)
- Kontextkonsistenz (1)
- Konversation (1)
- Koordination zwischen Segmenten (1)
- Kopula-Sätze (1)
- Koreferenzauflösung (1)
- Koreferenzkorpus (1)
- Korpusstudien (1)
- L1 Polish (1)
- L2 (1)
- L2 German (1)
- L2 acquisition (1)
- L2 processing (1)
- L2 word recognition (1)
- Language Acquisition (1)
- Language acquisition (1)
- Lesart (1)
- Leseexperiment (1)
- Lesen (1)
- Lexikalischer Zugriff (1)
- Lexikon (1)
- Linguistik (1)
- Mehrsprachigkeit (1)
- Meinungsforschung (1)
- Methoden (1)
- NLP (1)
- Natural Pause Hierarchy (1)
- Natürliche Pausenhierarchie (1)
- Near-identity (1)
- Nominal morphology (1)
- Nominale Morphologie (1)
- Norwegian (1)
- Norwegisch (1)
- OT-Modellierung (1)
- Objekttopikalisierung (1)
- Obstruent-Liquide Konsonantencluster (1)
- Optimality Theory (1)
- Overtheits-Bedingung (1)
- Parallelkorpora (1)
- Partikelverben (1)
- Passive (1)
- Pausen (1)
- Pauses (1)
- Phonologie (1)
- Polnisch (1)
- Pragmatik (1)
- Priming (1)
- Produktion (1)
- Pronouns (1)
- Prädikatsinterpretation (distributiv vs. nicht-distributiv) (1)
- Quantoren (1)
- RAGE Hypothese (1)
- RAGE hypothesis (1)
- Reaktionszeitmethoden (1)
- Rechtschreibkorrektur (1)
- Reflexivpronomen (1)
- Relativsätze (1)
- Satzverständnis (1)
- Schnittstelle (1)
- Scrambling (1)
- Selektion (1)
- Sentimentanalyse (1)
- Situationsmodell (1)
- Situationssemantik (1)
- Skopusambiguitäten (1)
- Spaltssatz (1)
- Speed-Curvature Power Law (1)
- Sprache (1)
- Sprachentwicklung (1)
- Spracherkennung (1)
- Sprachinhibition (1)
- Sprachstandserfassung (1)
- Sprachverarbeitung <Psycholinguistik> (1)
- Sprachverarbeitung bei Kindern (1)
- Sprachwahrnehmung (1)
- Sprachwechsel (1)
- Sprechmotorik (1)
- Sprechwiederholungen (1)
- Syllabic organization (1)
- Säuglingsalter (1)
- Tagalog acquisition (1)
- Temporäre Ambiguität (1)
- Tempus (1)
- Therapie (1)
- Tonsprache (1)
- Topik (1)
- Truncation (1)
- Trunkation/Verkürzung (1)
- Tschadisch (1)
- Turkish (1)
- Twitter (1)
- Türkisch (1)
- Ultraschall (1)
- Unterspezifikation (1)
- Variabilität (1)
- Variation (1)
- Verarbeitung von Wortfolgen (1)
- Verarbeitung von Zweitsprachen (1)
- Verb focus and negation (1)
- Verbal morphology (1)
- Verbaler Fokus und Negation (1)
- Verben (1)
- Verbflexion (1)
- Verständnis (1)
- Visual World (1)
- Visual-World Eye-Tracking (1)
- Vokale (1)
- Vokalperzeption (1)
- Vokalproduktion (1)
- Wissensgraph (1)
- Worterkennung (1)
- Wortschatzgröße (1)
- Wortschatzstruktur (1)
- Wortsegmentierung (1)
- Wortstellung im Deutschen (1)
- Zeitgewinn (1)
- Zungenbewegungen (1)
- Zweisprachigkeit (1)
- aboutness topic (1)
- acceptability judgments (1)
- acquisition (1)
- adaptive Verarbeitung (1)
- adaptive processing (1)
- adverb (1)
- affix order (1)
- afrikanische Sprachen (1)
- age of acquisition (1)
- ageing (1)
- agrammatic aphasia (1)
- anaphora (1)
- anaphoric existence presupposition (1)
- anaphorische Existenzpräsupposition (1)
- anaphors (1)
- antecedent complexity (1)
- argument mining (1)
- argumentation (1)
- argumentation mining (1)
- argumentation structure (1)
- argumentation structure parsing (1)
- aspect (1)
- auditory sentence processing (1)
- automated planning (1)
- biclausality (1)
- bilingual (1)
- bridging (1)
- case ambiguity (1)
- child language (1)
- child language processing (1)
- children (1)
- classifier (1)
- coarticulation (1)
- cognitive modeling (1)
- coherence (1)
- comparative subclauses (1)
- compensatory effects (1)
- comprehension (1)
- computational model (1)
- computational modeling (1)
- connective ties (1)
- consonant clusters (1)
- context (1)
- context consistency (1)
- contrastive topic (1)
- conversation (1)
- conversational ai (1)
- coreference (1)
- coreference corpus (1)
- coreference resolution (1)
- corpus studies (1)
- cross-linguistic (1)
- cross-linguistisch (1)
- cue confusion (1)
- cue reliability (1)
- decomposition (1)
- deep learning (1)
- definite Pseudospaltsätze (1)
- definite pseudoclefts (1)
- definiteness (1)
- defintie determiner (1)
- degree phrases (1)
- dependency parsing (1)
- dependency resolution (1)
- determiners (1)
- deutsche Partizipien (1)
- dialogue system (1)
- discourse (1)
- discourse analysis (1)
- discourse comprehension (1)
- discourse context (1)
- discourse parsing (1)
- discourse processing (1)
- discourse production (1)
- discourse structure (1)
- discourse-givenness (1)
- downstep (1)
- effective discourse (1)
- ellipsis (1)
- ereigniskorrelierte Potentiale (1)
- event-related potentials (1)
- event-related potentials (ERP) (1)
- evoked potentials (1)
- exclusive particles (1)
- executive functions (1)
- exekutive Funktionen (1)
- exhaustive inference (1)
- exhaustivity (1)
- exklusive Partikel (1)
- experimental procedure (1)
- experimentelle Verfahren (1)
- eye movement control (1)
- eye tracking (1)
- eye-tracing (1)
- familiarity (1)
- fieldwork (1)
- focus realisation (1)
- focus-sensitivity (1)
- fremdsprachliche Worterkennung (1)
- garden path (1)
- gpt (1)
- historical text (1)
- historischer Text (1)
- human-computer interaction (1)
- indirekte Wiederaufnahmen (1)
- individual differences (1)
- individuelle Unterschiede (1)
- infancy (1)
- inferences (1)
- inflectional morphology (1)
- inter-segmental coordination (1)
- interface (1)
- kindliche Sprachverarbeitung (1)
- knowledge graphs (1)
- kognitive Modellierung (1)
- komparative Nebensätze (1)
- kompensatorischer Effekt (1)
- komputationale Modellierung (1)
- kontrastives Topik (1)
- language assessment (1)
- language development (1)
- language inhibition (1)
- language processing (1)
- language processing <psycholinguistics> (1)
- language recognition (1)
- language switching (1)
- left periphery (1)
- lexical access (1)
- lexical tones (1)
- lexicon (1)
- lexicon structure (1)
- lexikalische Töne (1)
- lexion size (1)
- linguistic salience (1)
- linguistische Salienz (1)
- linke Peripherie (1)
- machine learning (1)
- memory distortion (1)
- mental lexicon (1)
- mental representation (1)
- mentale Repräsentation (1)
- mentales Lexikon (1)
- methods (1)
- mispronunciation detection (1)
- morphological impairments (1)
- morphological processing (1)
- morphologische Störungen (1)
- movement structures (1)
- multilingual coreference (1)
- multilinguale Koreferenz (1)
- multilingualism (1)
- natural language generation (1)
- natural language processing (1)
- near-identity (1)
- neural conversation models (1)
- neuronale Konversationsmodelle (1)
- object topicalization (1)
- obstruent-liquid clusters (1)
- opinion mining (1)
- optimality theory (1)
- overtness requirement (1)
- parallel corpora (1)
- partial annotations (1)
- particle verbs (1)
- partielle Annotationen (1)
- passive voice (1)
- perceptual attunement (1)
- perceptual reorganization (1)
- phonological development (1)
- phonologische Entwicklung (1)
- phonology (1)
- pragmatics (1)
- predicate interpretation (distributive vs. non-distributive) (1)
- prediction (1)
- primed picture naming (1)
- priming (1)
- processing (1)
- production (1)
- progressive aspect (1)
- progressiver Aspekt (1)
- prosodic modulation (1)
- prosodic phrase boundaries (1)
- prosodic phrase boundary (1)
- prosodisch (1)
- prosodische Phrasengrenze (1)
- prosodische Phrasengrenzen (1)
- prosodische Variation (1)
- quantifiers (1)
- reaction time methods (1)
- referierender Ausdruck (1)
- referring expressions (1)
- reflexives (1)
- regularity (1)
- relative clauses (1)
- repetitive speech (1)
- retrieval interference (1)
- rhetorical structure theory (1)
- rhetorische Struktur (1)
- schwach überwachte Lernverfahren (1)
- scope ambiguities (1)
- scrambling (1)
- self-paced reading (1)
- semantic processing (1)
- semantic typicality (1)
- semantische Typikalität (1)
- semantische Verarbeitung (1)
- sentiment analysis (1)
- silbische Struktur (1)
- similarity-based interference (1)
- situated context (1)
- situation model (1)
- situation semantics (1)
- social media (1)
- speech (1)
- speech motor control (1)
- speech perception (1)
- speed-curvature power law (1)
- spelling correction (1)
- spoken sentence comprehension (1)
- statistical learning (1)
- stem selection (1)
- syntactic gaps (1)
- syntaktische Lücken (1)
- task demands (1)
- tense (1)
- tf-idf (1)
- thematic role assignment (1)
- thematic-role assignment (1)
- time-buying (1)
- tone language (1)
- tongue movements (1)
- treatment (1)
- trilingualism (1)
- ultrasound tongue imaging (1)
- underspecification (1)
- ungefähre Bayessche Komputation (1)
- variability (1)
- variation (1)
- verb inflection (1)
- verbal irony (1)
- verbale Ironie (1)
- verbs (1)
- visual salience (1)
- visual word recognition (1)
- visual world (1)
- visual-world eye-tracking (1)
- visuell-linguistische Integration (1)
- visuelle Salienz (1)
- visuelle Worterkennung (1)
- visuo-linguistic integration (1)
- voice (1)
- vorausschauende Sprachverarbeitung (1)
- vowel perception (1)
- vowel production (1)
- vowels (1)
- weakly supervised learning techniques (1)
- word order processing (1)
- word segmentation (1)
- ähnlichkeitsbasierte Gedächtnisinterferenz (1)
Institut
- Department Linguistik (97) (entfernen)
It is a common finding that preschoolers have difficulties in identifying who is doing what to whom in non-canonical sentences, such as (object-verb-subject) OVS and passive sentences in German. This dissertation investigates how German monolingual and German-Italian simultaneous bilingual children process German OVS sentences in Study 1 and German passives in Study 2. Offline data (i.e., accuracy data) and online data (i.e., eye-gaze and pupillometry data) were analyzed to explore whether children can assign thematic roles during sentence comprehension and processing. Executive functions, language-internal and -external factors were investigated as potential predictors for children’s sentence comprehension and processing.
Throughout the literature, there are contradicting findings on the relation between language and executive functions. While some results show a bilingual cognitive advantage over monolingual speakers, others suggest there is no relationship between bilingualism and executive functions. If bilingual children possess more advanced executive function abilities than monolingual children, then this might also be reflected in a better performance on linguistic tasks. In the current studies monolingual and bilingual children were tested by means of two executive function tasks: the Flanker task and the task-switching paradigm. However, these findings showed no bilingual cognitive advantages and no better performance by bilingual children in the linguistic tasks. The performance was rather comparable between bilingual and monolingual children, or even better for the monolingual group. This may be due to cross-linguistic influences and language experience (i.e., language input and output). Italian was used because it does not syntactically overlap with the structure of German OVS sentences, and it only overlapped with one of the two types of sentence condition used for the passive study - considering the subject-(finite)verb alignment. The findings showed a better performance of bilingual children in the passive sentence structure that syntactically overlapped in the two languages, providing evidence for cross-linguistic influences.
Further factors for children’s sentence comprehension were considered. The parents’ education, the number of older siblings and language experience variables were derived from a language background questionnaire completed by parents. Scores of receptive vocabulary and grammar, visual and short-term memory and reasoning ability were measured by means of standardized tests. It was shown that higher German language experience by bilinguals correlates with better accuracy in German OVS sentences but not in passive sentences. Memory capacity had a positive effect on the comprehension of OVS and passive sentences in the bilingual group. Additionally, a role was played by executive function abilities in the comprehension of OVS sentences and not of passive sentences. It is suggested that executive function abilities might help children in the sentence comprehension task when the linguistic structures are not yet fully mastered.
Altogether, these findings show that bilinguals’ poorer performance in the comprehension and processing of German OVS is mainly due to reduced language experience in German, and that the different performance of bilingual children with the two types of passives is mainly due to cross-linguistic influences.
Èto-clefts are Russian focus constructions with the demonstrative pronoun èto ‘this’ at the beginning: “Èto Mark vyigral gonku” (“It was Mark who won the race”). They are often being compared with English it-clefts, German es-clefts, as well as the corresponding focus-background structures in other languages.
In terms of semantics, èto-clefts have two important properties which are cross-linguistically typical for clefts: existence presupposition (“Someone won the race”) and exhaustivity (“Nobody except Mark won the race”). However, the exhaustivity effects are not as strong as exhaustivity effects in structures with the exclusive only and require more research.
At the same time, the question if the syntactic structure of èto-clefts matches the biclausal structure of English and German clefts, remains open. There are arguments in favor of biclausality, as well as monoclausality. Besides, there is no consistency regarding the status of èto itself.
Finally, the information structure of èto-clefts has remained underexplored in the existing literature.
This research investigates the information-structural, syntactic, and semantic properties of Russian clefts, both theoretically (supported by examples from Russian text corpora and judgments from native speakers) and experimentally. It is determined which desired changes in the information structure motivate native speakers to choose an èto-cleft and not the canonical structure or other focus realization tools. Novel syntactic tests are conducted to find evidence for bi-/monoclausality of èto-clefts, as well as for base-generation or movement of the cleft pivot. It is hypothesized that èto has a certain important function in clefts, and its status is investigated. Finally, new experiments on the nature of exhaustivity in èto-clefts are conducted. They allow for direct cross-linguistic comparison, using an incremental-information paradigm with truth-value judgments.
In terms of information structure, this research makes a new proposal that presents èto-clefts as structures with an inherent focus-background bipartitioning. Even though èto-clefts are used in typical focus contexts, evidence was found that èto-clefts (as well as Russian thetic clefts) allow for both new information focus and contrastive focus. Èto-clefts are pragmatically acceptable when a singleton answer to the implied question is expected (e.g. “It was Mark who won the race” but not “It was Mark who came to the party”). Importantly, èto in Russian clefts is neither dummy, nor redundant, but is a topic expression; conveys familiarity which triggers existence presupposition; refers to an instantiated event, or a known/perceivable situation; finally, èto plays an important role in the spoken language as a tool for speech coherency and a focus marker.
In terms of syntax, this research makes a new monoclausal proposal and shows evidence that the cleft pivot undergoes movement to the left peripheral position. Èto is proposed to be TopP.
Finally, in terms of semantics, a novel cross-linguistic evaluation of Russian clefts is made. Experiments show that the exhaustivity inference in èto-clefts is not robust. Participants used different strategies in resolving exhaustivity, falling into 2 groups: one group considered èto-clefts exhaustive, while another group considered them non-exhaustive. Hence, there is evidence for the pragmatic nature of exhaustivity in èto-clefts. The experimental results for èto-clefts are similar to the experimental results for clefts in German, French and Akan. It is concluded that speakers use different tools available in their languages to produce structures with similar interpretive properties.
The present dissertation investigates changes in lingual coarticulation across childhood in German-speaking children from three to nine years of age and adults. Coarticulation refers to the mismatch between the abstract phonological units and their seemingly commingled realization in continuous speech. Being a process at the intersection of phonology and phonetics, addressing its changes across childhood allows for insights in speech motor as well as phonological developments. Because specific predictions for changes in coarticulation across childhood can be derived from existing speech production models, investigating children’s coarticulatory patterns can help us model human speech production.
While coarticulatory changes may shed light on some of the central questions of speech production development, previous studies on the topic were sparse and presented a puzzling picture of conflicting findings. One of the reasons for this lack is the difficulty in articulatory data acquisition in a young population. Within the research program this dissertation is embedded in, we accepted this challenge and successfully set up the hitherto largest corpus of articulatory data from children using ultrasound tongue imaging. In contrast to earlier studies, a high number of participants in tight age cohorts across a wide age range and a thoroughly controlled set of pseudowords allowed for statistically powerful investigations of a process known as variable and complicated to track.
The specific focus of my studies is on lingual vocalic coarticulation as measured in the horizontal position of the highest point of the tongue dorsum. Based on three studies on a) anticipatory coarticulation towards the left, b) carryover coarticulation towards the right side of the utterance, and c) anticipatory coarticulatory extent in repeated versus read aloud speech, I deduct the following main theses:
1. Maturing speech motor control is responsible for some developmental changes in coarticulation.
2. Coarticulation can be modeled as the coproduction of articulatory gestures.
3. The developmental change in coarticulation results from a decrease of vocalic activation width.
This dissertation examines the integration of incongruent visual-scene and morphological-case information (“cues”) in building thematic-role representations of spoken relative clauses in German.
Addressing the mutual influence of visual and linguistic processing, the Coordinated Interplay Account (CIA) describes a mechanism in two steps supporting visuo-linguistic integration (Knoeferle & Crocker, 2006, Cog Sci). However, the outcomes and dynamics of integrating incongruent thematic-role representations from distinct sources have been investigated scarcely. Further, there is evidence that both second-language (L2) and older speakers may rely on non-syntactic cues relatively more than first-language (L1)/young speakers. Yet, the role of visual information for thematic-role comprehension has not been measured in L2 speakers, and only limitedly across the adult lifespan.
Thematically unambiguous canonically ordered (subject-extracted) and noncanonically ordered (object-extracted) spoken relative clauses in German (see 1a-b) were presented in isolation and alongside visual scenes conveying either the same (congruent) or the opposite (incongruent) thematic relations as the sentence did.
1 a Das ist der Koch, der die Braut verfolgt.
This is the.NOM cook who.NOM the.ACC bride follows
This is the cook who is following the bride.
b Das ist der Koch, den die Braut verfolgt.
This is the.NOM cook whom.ACC the.NOM bride follows
This is the cook whom the bride is following.
The relative contribution of each cue to thematic-role representations was assessed with agent identification. Accuracy and latency data were collected post-sentence from a sample of L1 and L2 speakers (Zona & Felser, 2023), and from a sample of L1 speakers from across the adult lifespan (Zona & Reifegerste, under review). In addition, the moment-by-moment dynamics of thematic-role assignment were investigated with mouse tracking in a young L1 sample (Zona, under review).
The following questions were addressed: (1) How do visual scenes influence thematic-role representations of canonical and noncanonical sentences? (2) How does reliance on visual-scene, case, and word-order cues vary in L1 and L2 speakers? (3) How does reliance on visual-scene, case, and word-order cues change across the lifespan?
The results showed reliable effects of incongruence of visually and linguistically conveyed thematic relations on thematic-role representations. Incongruent (vs. congruent) scenes yielded slower and less accurate responses to agent-identification probes presented post-sentence. The recently inspected agent was considered as the most likely agent ~300ms after trial onset, and the convergence of visual scenes and word order enabled comprehenders to assign thematic roles predictively.
L2 (vs. L1) participants relied more on word order overall. In response to noncanonical clauses presented with incongruent visual scenes, sensitivity to case predicted the size of incongruence effects better than L1-L2 grouping. These results suggest that the individual’s ability to exploit specific cues might predict their weighting.
Sensitivity to case was stable throughout the lifespan, while visual effects increased with increasing age and were modulated by individual interference-inhibition levels. Thus, age-related changes in comprehension may stem from stronger reliance on visually (vs. linguistically) conveyed meaning.
These patterns represent evidence for a recent-role preference – i.e., a tendency to re-assign visually conveyed thematic roles to the same referents in temporally coordinated utterances. The findings (i) extend the generalizability of CIA predictions across stimuli, tasks, populations, and measures of interest, (ii) contribute to specifying the outcomes and mechanisms of detecting and indexing incongruent representations within the CIA, and (iii) speak to current efforts to understand the sources of variability in sentence comprehension.
This dissertation focuses on the handling of time in dialogue. Specifically, it investigates how humans bridge time, or “buy time”, when they are expected to convey information that is not yet available to them (e.g. a travel agent searching for a flight in a long list while the customer is on the line, waiting). It also explores the feasibility of modeling such time-bridging behavior in spoken dialogue systems, and it examines
how endowing such systems with more human-like time-bridging capabilities may affect humans’ perception of them.
The relevance of time-bridging in human-human dialogue seems to stem largely from a need to avoid lengthy pauses, as these may cause both confusion and discomfort among the participants of a conversation (Levinson, 1983; Lundholm Fors, 2015). However, this avoidance of prolonged silence is at odds with the incremental nature of speech production in dialogue (Schlangen and Skantze, 2011): Speakers often start to verbalize their contribution before it is fully formulated, and sometimes even before they possess the information they need to provide, which may result in them running out of content mid-turn.
In this work, we elicit conversational data from humans, to learn how they avoid being silent while they search for information to convey to their interlocutor. We identify commonalities in the types of resources employed by different speakers, and we propose a classification scheme. We explore ways of modeling human time-buying behavior computationally, and we evaluate the effect on human listeners of embedding this behavior in a spoken dialogue system.
Our results suggest that a system using conversational speech to bridge time while searching for information to convey (as humans do) can provide a better experience in several respects than one which remains silent for a long period of time. However, not all speech serves this purpose equally: Our experiments also show that a system whose time-buying behavior is more varied (i.e. which exploits several categories from the classification scheme we developed and samples them based on information from human data) can prevent overestimation of waiting time when compared, for example, with a system that repeatedly asks the interlocutor to wait (even if these requests for waiting are phrased differently each time). Finally, this research shows that it is possible to model human time-buying behavior on a relatively small corpus, and that a system using such a model can be preferred by participants over one employing a simpler strategy, such as randomly choosing utterances to produce during the wait —even when the utterances used by both strategies are the same.
Successful sentence comprehension requires the comprehender to correctly figure out who did what to whom. For example, in the sentence John kicked the ball, the comprehender has to figure out who did the action of kicking and what was being kicked. This process of identifying and connecting the syntactically-related words in a sentence is called dependency completion. What are the cognitive constraints that determine dependency completion? A widely-accepted theory is cue-based retrieval. The theory maintains that dependency completion is driven by a content-addressable search for the co-dependents in memory. The cue-based retrieval explains a wide range of empirical data from several constructions including subject-verb agreement, subject-verb non-agreement, plausibility mismatch configurations, and negative polarity items.
However, there are two major empirical challenges to the theory: (i) Grammatical sentences’ data from subject-verb number agreement dependencies, where the theory predicts a slowdown at the verb in sentences like the key to the cabinet was rusty compared to the key to the cabinets was rusty, but the data are inconsistent with this prediction; and, (ii) Data from antecedent-reflexive dependencies, where a facilitation in reading times is predicted at the reflexive in the bodybuilder who worked with the trainers injured themselves vs. the bodybuilder who worked with the trainer injured themselves, but the data do not show a facilitatory effect.
The work presented in this dissertation is dedicated to building a more general theory of dependency completion that can account for the above two datasets without losing the original empirical coverage of the cue-based retrieval assumption. In two journal articles, I present computational modeling work that addresses the above two empirical challenges.
To explain the grammatical sentences’ data from subject-verb number agreement dependencies, I propose a new model that assumes that the cue-based retrieval operates on a probabilistically distorted representation of nouns in memory (Article I). This hybrid distortion-plus-retrieval model was compared against the existing candidate models using data from 17 studies on subject-verb number agreement in 4 languages. I find that the hybrid model outperforms the existing models of number agreement processing suggesting that the cue-based retrieval theory must incorporate a feature distortion assumption.
To account for the absence of facilitatory effect in antecedent-reflexive dependencies, I propose an individual difference model, which was built within the cue-based retrieval framework (Article II). The model assumes that individuals may differ in how strongly they weigh a syntactic cue over a number cue. The model was fitted to data from two studies on antecedent-reflexive dependencies, and the participant-level cue-weighting was estimated. We find that one-fourth of the participants, in both studies, weigh the syntactic cue higher than the number cue in processing reflexive dependencies and the remaining participants weigh the two cues equally. The result indicates that the absence of predicted facilitatory effect at the level of grouped data is driven by some, not all, participants who weigh syntactic cues higher than the number cue. More generally, the result demonstrates that the assumption of differential cue weighting is important for a theory of dependency completion processes. This differential cue weighting idea was independently supported by a modeling study on subject-verb non-agreement dependencies (Article III).
Overall, the cue-based retrieval, which is a general theory of dependency completion, needs to incorporate two new assumptions: (i) the nouns stored in memory can undergo probabilistic feature distortion, and (ii) the linguistic cues used for retrieval can be weighted differentially. This is the cumulative result of the modeling work presented in this dissertation.
The dissertation makes an important theoretical contribution: Sentence comprehension in humans is driven by a mechanism that assumes cue-based retrieval, probabilistic feature distortion, and differential cue weighting. This insight is theoretically important because there is some independent support for these three assumptions in sentence processing and the broader memory literature. The modeling work presented here is also methodologically important because for the first time, it demonstrates (i) how the complex models of sentence processing can be evaluated using data from multiple studies simultaneously, without oversimplifying the models, and (ii) how the inferences drawn from the individual-level behavior can be used in theory development.
This thesis is concerned with the phenomenon of quantifier scope ambiguities. This phenomenon has been researched extensively, both from a theoretical and from an empirical point of view. Nevertheless, there are still a number of under-researched topics in the field of quantifier scope, which will be the main focus of this thesis. I will take a closer look at three languages, English, German, and the Asante Twi dialect of Akan (Kwa, Niger-Kongo). The goal is a better understanding of the phenomenon of quantifier scope both within each language, as well as from a cross-linguistic perspective. First, this thesis will provide a series of experiments that allow a direct cross-linguistic comparison between English and German – two languages about which specific claims have been made in the literature. I will also provide exploratory research in the case of Asante Twi, where so far, no work has been dedicated specifically to the study of quantifier scope. The work on Asante Twi will go beyond quantifier scope and also target the quantifier and determiner system in general. The question is not only if particular scope readings are possible or not, but also which factors contribute to an increase or decrease of scope availability, and if there are factors that block certain scope readings altogether. While some of the results confirm and thereby strengthen previous claims, other results contradict general assumptions in the literature. This is particularly the case for inverse readings in German and inverse readings across clause-boundaries.
Early sensitivity to prosodic phrase boundary cues: Behavioral evidence from German-learning infants
(2023)
This dissertation seeks to shed light on the relation of phrasal prosody and developmental speech perception in German-learning infants. Three independent empirical studies explore the role of acoustic correlates of major prosodic boundaries, specifically pitch change, final lengthening, and pause, in infant boundary perception. Moreover, it was examined whether the sensitivity to prosodic phrase boundary markings changes during the first year of life as a result of perceptual attunement to the ambient language (Aslin & Pisoni, 1980).
Using the headturn preference procedure six- and eight-month-old monolingual German-learning infants were tested on their discrimination of two different prosodic groupings of the same list of coordinated names either with or without an internal IPB after the second name, that is, [Moni und Lilli] [und Manu] or [Moni und Lilli und Manu]. The boundary marking was systematically varied with respect to single prosodic cues or specific cue combinations.
Results revealed that six- and eight-month-old German-learning infants successfully detect the internal prosodic boundary when it is signaled by all the three main boundary cues pitch change, final lengthening, and pause. For eight-, but not for six-month-olds, the combination of pitch change and final lengthening, without the occurrence of a pause, is sufficient. This mirrors an adult-like perception by eight-months (Holzgrefe-Lang et al., 2016). Six-month-olds detect a prosodic phrase boundary signaled by final lengthening and pause. The findings suggest a developmental change in German prosodic boundary cue perception from a strong reliance on the pause cue at six months to a differentiated sensitivity to the more subtle cues pitch change and final lengthening at eight months. Neither for six- nor for eight-month-olds the occurrence of pitch change or final lengthening as single cues is sufficient, similar to what has been observed for adult speakers of German (Holzgrefe-Lang et al., 2016).
The present dissertation provides new scientific knowledge on infants’ sensitivity to individual prosodic phrase boundary cues in the first year of life. Methodologically, the studies are pathbreaking since they used exactly the same stimulus materials – phonologically thoroughly controlled lists of names – that have also been used with adults (Holzgrefe-Lang et al., 2016) and with infants in a neurophysiological paradigm (Holzgrefe-Lang, Wellmann, Höhle, & Wartenburger, 2018), allowing for comparisons across age (six/ eight months and adults) and method (behavioral vs. neurophysiological methods). Moreover, materials are suited to be transferred to other languages allowing for a crosslinguistic comparison. Taken together with a study with similar French materials (van Ommen et al., 2020) the observed change in sensitivity in German-learning infants can be interpreted as a language-specific one, from an initial language-general processing mechanism that primarily focuses on the presence of pauses to a language-specific processing that takes into account prosodic properties available in the ambient language. The developmental pattern is discussed as an interplay of acoustic salience, prosodic typology (prosodic regularity) and cue reliability.
The main goal of this dissertation is to experimentally investigate how focus is realised, perceived, and processed by native Turkish speakers, independent of preconceived notions of positional restrictions. Crucially, there are various issues and scientific debates surrounding focus in the Turkish language in the existing literature (chapter 1). It is argued in this dissertation that two factors led to the stagnant literature on focus in Turkish: the lack of clearly defined, modern understandings of information structure and its fundamental notion of focus, and the ongoing and ill-defined debate surrounding the question of whether there is an immediately preverbal focus position in Turkish. These issues gave rise to specific research questions addressed across this dissertation. Specifically, we were interested in how the focus dimensions such as focus size (comparing narrow constituent and broad sentence focus), focus target (comparing narrow subject and narrow object focus), and focus type (comparing new-information and contrastive focus) affect Turkish focus realisation and, in turn, focus comprehension when speakers are provided syntactic freedom to position focus as they see fit.
To provide data on these core goals, we presented three behavioural experiments based on a systematic framework of information structure and its notions (chapter 2): (i) a production task with trigger wh-questions and contextual animations manipulated to elicit the focus dimensions of interest (chapter 3), (ii) a timed acceptability judgment task in listening to the recorded answers in our production task (chapter 4), and (iii) a self-paced reading task to gather on-line processing data (chapter 5).
Based on the results of the conducted experiments, multiple conclusions are made in this dissertation (chapter 6). Firstly, this dissertation demonstrated empirically that there is no focus position in Turkish, neither in the sense of a strict focus position language nor as a focally loaded position facilitating focus perception and/or processing. While focus is, in fact, syntactically variable in the Turkish preverbal area, this is a consequence of movement triggered by other IS aspects like topicalisation and backgrounding, and the observational markedness of narrow subject focus compared to narrow object focus. As for focus type in Turkish, this dimension is not associated with word order in production, perception, or processing. Significant acoustic correlates of focus size (broad sentence focus vs narrow constituent focus) and focus target (narrow subject focus vs narrow object focus) were observed in fundamental frequency and intensity, representing focal boost, (postfocal) deaccentuation, and the presence or absence of a phrase-final rise in the prenucleus, while the perceivability of these effects remains to be investigated. In contrast, no acoustic correlates of focus type in simple, three-word transitive structures were observed, with focus types being interchangeable in mismatched question-answer pairs. Overall, the findings of this dissertation highlight the need for experimental investigations regarding focus in Turkish, as theoretical predictions do not necessarily align with experimental data. As such, the fallacy of implying causation from correlation should be strictly kept in mind, especially when constructions coincide with canonical structures, such as the immediately preverbal position in narrow object foci. Finally, numerous open questions remain to be explored, especially as focus and word order in Turkish are multifaceted. As shown, givenness is a confounding factor when investigating focus types, while thematic role assignment potentially confounds word order preferences. Further research based on established, modern information structure frameworks is needed, with chapter 5 concluding with specific recommendations for such future research.
Neural conversation models aim to predict appropriate contributions to a (given) conversation by using neural networks trained on dialogue data. A specific strand focuses on non-goal driven dialogues, first proposed by Ritter et al. (2011): They investigated the task of transforming an utterance into an appropriate reply. Then, this strand evolved into dialogue system approaches using long dialogue histories and additional background context. Contributing meaningful and appropriate to a conversation is a complex task, and therefore research in this area has been very diverse: Serban et al. (2016), for example, looked into utilizing variable length dialogue histories, Zhang et al. (2018) added additional context to the dialogue history, Wolf et al. (2019) proposed a model based on pre-trained Self-Attention neural networks (Vasvani et al., 2017), and Dinan et al. (2021) investigated safety issues of these approaches. This trend can be seen as a transformation from trying to somehow carry on a conversation to generating appropriate replies in a controlled and reliable way.
In this thesis, we first elaborate the meaning of appropriateness in the context of neural conversation models by drawing inspiration from the Cooperative Principle (Grice, 1975). We first define what an appropriate contribution has to be by operationalizing these maxims as demands on conversation models: being fluent, informative, consistent towards given context, coherent and following a social norm. Then, we identify different targets (or intervention points) to achieve the conversational appropriateness by investigating recent research in that field.
In this thesis, we investigate the aspect of consistency towards context in greater detail, being one aspect of our interpretation of appropriateness.
During the research, we developed a new context-based dialogue dataset (KOMODIS) that combines factual and opinionated context to dialogues. The KOMODIS
dataset is publicly available and we use the data in this thesis to gather new insights in context-augmented dialogue generation.
We further introduced a new way of encoding context within Self-Attention based neural networks. For that, we elaborate the issue of space complexity from knowledge graphs,
and propose a concise encoding strategy for structured context inspired from graph neural networks (Gilmer et al., 2017) to reduce the space complexity of the additional context. We discuss limitations of context-augmentation for neural conversation models, explore the characteristics of knowledge graphs, and explain how we create and augment knowledge graphs for our experiments.
Lastly, we analyzed the potential of reinforcement and transfer learning to improve context-consistency for neural conversation models. We find that current reward functions need to be more precise to enable the potential of reinforcement learning, and that sequential transfer learning can improve the subjective quality of generated dialogues.
The aim of this dissertation was to conduct a larger-scale cross-linguistic empirical investigation of similarity-based interference effects in sentence comprehension.
Interference studies can offer valuable insights into the mechanisms that are involved in long-distance dependency completion.
Many studies have investigated similarity-based interference effects, showing that syntactic and semantic information are employed during long-distance dependency formation (e.g., Arnett & Wagers, 2017; Cunnings & Sturt, 2018; Van Dyke, 2007, Van Dyke & Lewis, 2003; Van Dyke & McElree, 2011). Nevertheless, there are some important open questions in the interference literature that are critical to our understanding of the constraints involved in dependency resolution.
The first research question concerns the relative timing of syntactic and semantic interference in online sentence comprehension. Only few interference studies have investigated this question, and, to date, there is not enough data to draw conclusions with regard to their time course (Van Dyke, 2007; Van Dyke & McElree, 2011).
Our first cross-linguistic study explores the relative timing of syntactic and semantic interference in two eye-tracking reading experiments that implement the study design used in Van Dyke (2007). The first experiment tests English sentences. The second, larger-sample experiment investigates the two interference types in German.
Overall, the data suggest that syntactic and semantic interference can arise simultaneously during retrieval.
The second research question concerns a special case of semantic interference: We investigate whether cue-based retrieval interference can be caused by semantically similar items which are not embedded in a syntactic structure.
This second interference study builds on a landmark study by Van Dyke & McElree (2006). The study design used in their study is unique in that it is able to pin down the source of interference as a consequence of cue overload during retrieval, when semantic retrieval cues do not uniquely match the retrieval target. Unlike most other interference studies, this design is able to rule out encoding interference as an alternative explanation. Encoding accounts postulate that it is not cue overload at the retrieval site but the erroneous encoding of similar linguistic items in memory that leads to interference (Lewandowsky et al., 2008; Oberauer & Kliegl, 2006). While Van Dyke & McElree (2006) reported cue-based retrieval interference from sentence-external distractors, the evidence for this effect was weak. A subsequent study did not show interference of this type (Van Dyke et al., 2014). Given these inconclusive findings, further research is necessary to investigate semantic cue-based retrieval interference.
The second study in this dissertation provides a larger-scale cross-linguistic investigation of cue-based retrieval interference from sentence-external items. Three larger-sample eye-tracking studies in English, German, and Russian tested cue-based interference in the online processing of filler-gap dependencies. This study further extends the previous research by investigating interference in each language under varying task demands (Logačev & Vasishth, 2016; Swets et al., 2008).
Overall, we see some very modest support for proactive cue-based retrieval interference in English. Unexpectedly, this was observed only under a low task demand. In German and Russian, there is some evidence against the interference effect. It is possible that interference is attenuated in languages with richer case marking.
In sum, the cross-linguistic experiments on the time course of syntactic and semantic interference from sentence-internal distractors support existing evidence of syntactic and semantic interference during sentence comprehension. Our data further show that both types of interference effects can arise simultaneously. Our cross-linguistic experiments investigating semantic cue-based retrieval interference from sentence-external distractors suggest that this type of interference may arise only in specific linguistic contexts.
This project describes the nominal, verbal and ‘truncation’ systems of Awing and explains the syntactic and semantic functions of the multifunctional l<-><-> (LE) morpheme in copular and wh-focused constructions. Awing is a Bantu Grassfields language spoken in the North West region of Cameroon. The work begins with morphological processes viz. deverbals, compounding, reduplication, borrowing and a thorough presentation of the pronominal system and takes on verbal categories viz. tense, aspect, mood, verbal extensions, negation, adverbs and triggers of a homorganic N(asal)-prefix that attaches to the verb and other verbal categories. Awing grammar also has a very unusual phenomenon whereby nouns and verbs take long and short forms. A chapter entitled truncation is dedicated to the phenomenon. It is observed that the truncation process does not apply to bare singular NPs, proper names and nouns derived via morphological processes. On the other hand, with the exception of the 1st person non-emphatic possessive determiner and the class 7 noun prefix, nouns generally take the truncated form with modifiers (i.e., articles, demonstratives and other possessives). It is concluded that nominal truncation depicts movement within the DP system (Abney 1987). Truncation of the verb occurs in three contexts: a mass/plurality conspiracy (or lattice structuring in terms of Link 1983) between the verb and its internal argument (i.e., direct object); a means to align (exhaustive) focus (in terms of Fery’s 2013), and a means to form polar questions.
The second part of the work focuses on the role of the LE morpheme in copular and wh-focused clauses. Firstly, the syntax of the Awing copular clause is presented and it is shown that copular clauses in Awing have ‘subject-focus’ vs ‘topic-focus’ partitions and that the LE morpheme indirectly relates such functions. Semantically, it is shown that LE does not express contrast or exhaustivity in copular clauses. Turning to wh-constructions, the work adheres to Hamblin’s (1973) idea that the meaning of a question is the set of its possible answers and based on Rooth’s (1985) underspecified semantic notion of alternative focus, concludes that the LE morpheme is not a Focus Marker (FM) in Awing: LE does not generate or indicate the presence of alternatives (Krifka 2007); The LE morpheme can associate with wh-elements as a focus-sensitive operator with semantic import that operates on the focus alternatives by presupposing an exhaustive answer, among other notions. With focalized categories, the project further substantiates the claim in Fominyam & Šimík (2017), namely that exhaustivity is part of the semantics of the LE morpheme and not derived via contextual implicature, via a number of diagnostics. Hence, unlike in copular clauses, the LE morpheme with wh-focused categories is analysed as a morphological exponent of a functional head Exh corresponding to Horvath's (2010) EI (Exhaustive Identification). The work ends with the syntax of verb focus and negation and modifies the idea in Fominyam & Šimík (2017), namely that the focalized verb that associates with the exhaustive (LE) particle is a lower copy of the finite verb that has been moved to Agr. It is argued that the LE-focused verb ‘cluster’ is an instantiation of adjunction. The conclusion is that verb doubling with verb focus in Awing is neither a realization of two copies of one and the same verb (Fominyam and Šimík 2017), nor a result of a copy triggered by a focus marker (Aboh and Dyakonova 2009). Rather, the focalized copy is said to be merged directly as the complement of LE forming a type of adjoining cluster.
Previous behavioral studies showed that perceptual changes in infancy can be observed in multiple patterns, namely decline (e.g., Mattock et al., 2008; Yeung et al., 2013), maintenance (e.g., Chen & Kager, 2016) and U-shaped development (Liu & Kager, 2014).
This dissertation contributes further to the understanding of the developmental trajectory of phonological acquisition in infancy. The dissertation addresses the questions of how the perceptual sensitivity of lexical tones and vowels changes in infancy and how different experimental procedures contribute to our understanding. We used three experimental procedures to investigate German-learning infants’ discrimination abilities. In Studies 1 and 3 (Chapters 5 and 7) we used behavioral methods (habituation and familiarization procedures) and in Study 2 (Chapter 6) we measured neural correlates.
Study 1 showed a U-shaped developmental pattern: 6- and 18-month-olds discriminated a lexical tone contrast, but not the 9-month-olds. In addition, we found an effect of experimental procedure: infants discriminated the tone contrast at 6 months in a habituation but not in a familiarization procedure. In Study 2, we observed mismatch responses (MMR) to a non-native tone contrast and a native-like vowel in 6- and 9-month-olds. In 6-month-olds, both contrasts elicited positive MMRs. At 9 months, the vowel contrast elicited an adult-like negative MMR, while the tone contrast elicited a positive MMR. Study 3 demonstrated a change in perceptual sensitivity to a vowel contrast between 6 and 9 months. In contrast to the 6-month-old infants, the 9-month-old infants discriminated the tested vowel contrast asymmetrically.
We suggest that the shifts in perceptual sensitivity between 6 and 9 months are functional rather than perceptual. In the case of lexical tone discrimination, infants may have already learned by 9 months of age that pitch is not relevant at the lexical level in German, since the infants in Study 1 showed no perceptual sensitivity to the contrast tested. Nevertheless, the brain responded to the contrast, especially since pitch differences are also part of the German intonation system (Gussenhoven, 2004). The role of the intonation system in pitch discrimination could be supported by the recovery of behavioral discrimination at 18 months of age, as well as behavioral and neural discrimination in German-speaking adults.
Cleft exhaustivity
(2020)
In this dissertation a series of experimental studies are presented which demonstrate that the exhaustive inference of focus-background it-clefts in English and their cross-linguistic counterparts in Akan, French, and German is neither robust nor systematic. The inter-speaker and cross-linguistic variability is accounted for with a discourse-pragmatic approach to cleft exhaustivity, in which -- following Pollard & Yasavul 2016 -- the exhaustive inference is derived from an interaction with another layer of meaning, namely, the existence presupposition encoded in clefts.
In recent years, a substantial number of psycholinguistic studies and of studies on acquired language impairments have investigated the case of morphologically complex words. These have provided evidence for what is known as ‘morphological decomposition’, i.e. a mechanism that decomposes complex words into their constituent morphemes during online processing. This is believed to be a fundamental, possibly universal mechanism of morphological processing, operating irrespective of a word’s specific properties.
However, current accounts of morphological decomposition are mostly based on evidence from suffixed words and compound words, while prefixed words have been comparably neglected. At the same time, it has been consistently observed that, across languages, prefixed words are less widespread than suffixed words. This cross-linguistic preference for suffixing morphology has been claimed to be grounded in language processing and language learning mechanisms. This would predict differences in how prefixed words are processed and therefore also affected in language impairments, challenging the predictions of the major accounts of morphological decomposition.
Against this background, the present thesis aims at reducing the gap between the accounts of morphological decomposition and the accounts of the suffixing preference, by providing a thorough empirical investigation of prefixed words. Prefixed words are examined in three different domains: (i) visual word processing in native speakers; (ii) visual word processing in non-native speakers; (iii) acquired morphological impairments. The processing studies employ the masked priming paradigm, tapping into early stages of visual word recognition. Instead, the studies on morphological impairments investigate the errors produced in reading aloud tasks.
As for native processing, the present work first focuses on derivation (Publication I), specifically investigating whether German prefixed derived words, both lexically restricted (e.g. inaktiv ‘inactive’) and unrestricted (e.g. unsauber ‘unclean’) can be efficiently decomposed. I then present a second study (Publication II) on a Bantu language, Setswana, which offers the unique opportunity of testing inflectional prefixes, and directly comparing priming with prefixed inflected primes (e.g. dikgeleke ‘experts’) to priming with prefixed derived primes (e.g. bokgeleke ‘talent’). With regard to non-native processing (Publication I), the priming effects obtained from the lexically restricted and unrestricted prefixed derivations in native speakers are additionally compared to the priming effects obtained in a group of non-native speakers of German. Finally, in the two studies on acquired morphological impairments, the thesis investigates whether prefixed derived words yield different error patterns than suffixed derived words (Publication III and IV).
For native speakers, the results show evidence for morphological decomposition of both types of prefixed words, i.e. lexically unrestricted and restricted derivations, as well as of prefixed inflected words. Furthermore, non-native speakers are also found to efficiently decompose prefixed derived words, with parallel results to the group of native speakers. I therefore conclude that, for the early stages of visual word recognition, the relative position of stem and affix in prefixed versus suffixed words does not affect how efficiently complex words are decomposed, either in native or in non-native processing. In the studies on acquired language impairments, instead, prefixes are consistently found to be more impaired than suffixes. This is explained in terms of a learnability disadvantage for prefixed words, which may cause weaker representations of the information encoded in affixes when these precede the stem (prefixes) as compared to when they follow it (suffixes). Based on the impairment profiles of the individual participants and on the nature of the task, this dissociation is assumed to emerge from later processing stages than those that are tapped into by masked priming. I therefore conclude that the different characteristics of prefixed and suffixed words do come into play at later processing stages, during which the lexical-semantic information contained in the different constituent morphemes is processed.
The findings presented in the four manuscripts significantly contribute to our current understanding of the mechanisms involved in processing prefixed words. Crucially, the thesis constrains the processing disadvantage for prefixed words to later processing stages, thereby suggesting that theories trying to establish links between language universals and processing mechanisms should more carefully consider the different stages involved in language processing and what factors are relevant for each specific stage.
The immense popularity of online communication services in the last decade has not only upended our lives (with news spreading like wildfire on the Web, presidents announcing their decisions on Twitter, and the outcome of political elections being determined on Facebook) but also dramatically increased the amount of data exchanged on these platforms. Therefore, if we wish to understand the needs of modern society better and want to protect it from new threats, we urgently need more robust, higher-quality natural language processing (NLP) applications that can recognize such necessities and menaces automatically, by analyzing uncensored texts. Unfortunately, most NLP programs today have been created for standard language, as we know it from newspapers, or, in the best case, adapted to the specifics of English social media.
This thesis reduces the existing deficit by entering the new frontier of German online communication and addressing one of its most prolific forms—users’ conversations on Twitter. In particular, it explores the ways and means by how people express their opinions on this service, examines current approaches to automatic mining of these feelings, and proposes novel methods, which outperform state-of-the-art techniques. For this purpose, I introduce a new corpus of German tweets that have been manually annotated with sentiments, their targets and holders, as well as lexical polarity items and their contextual modifiers. Using these data, I explore four major areas of sentiment research: (i) generation of sentiment lexicons, (ii) fine-grained opinion mining, (iii) message-level polarity classification, and (iv) discourse-aware sentiment analysis. In the first task, I compare three popular groups of lexicon generation methods: dictionary-, corpus-, and word-embedding–based ones, finding that dictionary-based systems generally yield better polarity lists than the last two groups. Apart from this, I propose a linear projection algorithm, whose results surpass many existing automatically-generated lexicons. Afterwords, in the second task, I examine two common approaches to automatic prediction of sentiment spans, their sources, and targets: conditional random fields (CRFs) and recurrent neural networks, obtaining higher scores with the former model and improving these results even further by redefining the structure of CRF graphs. When dealing with message-level polarity classification, I juxtapose three major sentiment paradigms: lexicon-, machine-learning–, and deep-learning–based systems, and try to unite the first and last of these method groups by introducing a bidirectional neural network with lexicon-based attention. Finally, in order to make the new classifier aware of microblogs' discourse structure, I let it separately analyze the elementary discourse units of each tweet and infer the overall polarity of a message from the scores of its EDUs with the help of two new approaches: latent-marginalized CRFs and Recursive Dirichlet Process.
There is evidence that infants start extracting words from fluent speech around 7.5 months of age (e.g., Jusczyk & Aslin, 1995) and that they use at least two mechanisms to segment words forms from fluent speech: prosodic information (e.g., Jusczyk, Cutler & Redanz, 1993) and statistical information (e.g., Saffran, Aslin & Newport, 1996). However, how these two mechanisms interact and whether they change during development is still not fully understood.
The main aim of the present work is to understand in what way different cues to word segmentation are exploited by infants when learning the language in their environment, as well as to explore whether this ability is related to later language skills. In Chapter 3 we pursued to determine the reliability of the method used in most of the experiments in the present thesis (the Headturn Preference Procedure), as well as to examine correlations and individual differences between infants’ performance and later language outcomes. In Chapter 4 we investigated how German-speaking adults weigh statistical and prosodic information for word segmentation. We familiarized adults with an auditory string in which statistical and prosodic information indicated different word boundaries and obtained both behavioral and pupillometry responses. Then, we conducted further experiments to understand in what way different cues to word segmentation are exploited by 9-month-old German-learning infants (Chapter 5) and by 6-month-old German-learning infants (Chapter 6). In addition, we conducted follow-up questionnaires with the infants and obtained language outcomes at later stages of development.
Our findings from this thesis revealed that (1) German-speaking adults show a strong weight of prosodic cues, at least for the materials used in this study and that (2) German-learning infants weight these two kind of cues differently depending on age and/or language experience. We observed that, unlike English-learning infants, 6-month-old infants relied more strongly on prosodic cues. Nine-month-olds do not show any preference for either of the cues in the word segmentation task. From the present results it remains unclear whether the ability to use prosodic cues to word segmentation relates to later language vocabulary. We speculate that prosody provides infants with their first window into the specific acoustic regularities in the signal, which enables them to master the specific stress pattern of German rapidly. Our findings are a step forwards in the understanding of an early impact of the native prosody compared to statistical learning in early word segmentation.
The main goal of this thesis is to explore the feasibility of using cross-lingual annotation projection as a method of alleviating the task of manual coreference annotation.
To reach our goal, we build a first trilingual parallel coreference corpus that encompasses multiple genres. For the annotation of the corpus, we develop common coreference annotation guidelines that are applicable to three languages (English, German, Russian) and include a novel domain-independent typology of bridging relations as well as state-of-the-art near-identity categories.
Thereafter, we design and perform several annotation projection experiments. In the first experiment, we implement a direct projection method with only one source language. Our results indicate that, already in a knowledge-lean scenario, our projection approach is superior to the most closely related work of Postolache et al. (2006). Since the quality of the resulting annotations is to a high degree dependent on the word alignment, we demonstrate how using limited syntactic information helps to further improve mention extraction on the target side. As a next step, in our second experiment, we show how exploiting two source languages helps to improve the quality of target annotations for both language pairs by concatenating annotations projected from two source languages. Finally, we assess the projection quality in a fully automatic scenario (using automatically produced source annotations), and propose a pilot experiment on manual projection of bridging pairs.
For each of the experiments, we carry out an in-depth error analysis, and we conclude that noisy word alignments, translation divergences and morphological and syntactic differences between languages are responsible for projection errors. We systematically compare and evaluate our projection methods, and we investigate the errors both qualitatively and quantitatively in order to identify problematic cases. Finally, we discuss the applicability of our method to coreference annotations and propose several avenues of future research.