Filtern
Erscheinungsjahr
- 2022 (34) (entfernen)
Dokumenttyp
- Wissenschaftlicher Artikel (29)
- Dissertation (3)
- Bachelorarbeit (1)
- Postprint (1)
Sprache
- Englisch (34) (entfernen)
Gehört zur Bibliographie
- ja (34)
Schlagworte
- German (3)
- focus (3)
- Individual differences (2)
- Prosody (2)
- alternatives (2)
- information structure (2)
- interference (2)
- processing (2)
- similarity-based interference (2)
- Bayes factors (1)
Institut
- Department Linguistik (34) (entfernen)
Sonority is a fundamental notion in phonetics and phonology, central to many descriptions of the syllable and various useful predictions in phonotactics. Although widely accepted, sonority lacks a clear basis in speech articulation or perception, given that traditional formal principles in linguistic theory are often exclusively based on discrete units in symbolic representation and are typically not designed to be compatible with auditory perception, sensorimotor control, or general cognitive capacities. In addition, traditional sonority principles also exhibit systematic gaps in empirical coverage. Against this backdrop, we propose the incorporation of symbol-based and signal-based models to adequately account for sonority in a complementary manner. We claim that sonority is primarily a perceptual phenomenon related to pitch, driving the optimization of syllables as pitch-bearing units in all language systems. We suggest a measurable acoustic correlate for sonority in terms of periodic energy, and we provide a novel principle that can account for syllabic well-formedness, the nucleus attraction principle (NAP). We present perception experiments that test our two NAP-based models against four traditional sonority models, and we use a Bayesian data analysis approach to test and compare them. Our symbolic NAP model outperforms all the other models we test, while our continuous bottom-up NAP model is at second place, along with the best performing traditional models. We interpret the results as providing strong support for our proposals: (i) the designation of periodic energy as the acoustic correlate of sonority; (ii) the incorporation of continuous entities in phonological models of perception; and (iii) the dual-model strategy that separately analyzes symbol-based top-down processes and signal-based bottom-up processes in speech perception.
The main goal of this dissertation is to experimentally investigate how focus is realised, perceived, and processed by native Turkish speakers, independent of preconceived notions of positional restrictions. Crucially, there are various issues and scientific debates surrounding focus in the Turkish language in the existing literature (chapter 1). It is argued in this dissertation that two factors led to the stagnant literature on focus in Turkish: the lack of clearly defined, modern understandings of information structure and its fundamental notion of focus, and the ongoing and ill-defined debate surrounding the question of whether there is an immediately preverbal focus position in Turkish. These issues gave rise to specific research questions addressed across this dissertation. Specifically, we were interested in how the focus dimensions such as focus size (comparing narrow constituent and broad sentence focus), focus target (comparing narrow subject and narrow object focus), and focus type (comparing new-information and contrastive focus) affect Turkish focus realisation and, in turn, focus comprehension when speakers are provided syntactic freedom to position focus as they see fit.
To provide data on these core goals, we presented three behavioural experiments based on a systematic framework of information structure and its notions (chapter 2): (i) a production task with trigger wh-questions and contextual animations manipulated to elicit the focus dimensions of interest (chapter 3), (ii) a timed acceptability judgment task in listening to the recorded answers in our production task (chapter 4), and (iii) a self-paced reading task to gather on-line processing data (chapter 5).
Based on the results of the conducted experiments, multiple conclusions are made in this dissertation (chapter 6). Firstly, this dissertation demonstrated empirically that there is no focus position in Turkish, neither in the sense of a strict focus position language nor as a focally loaded position facilitating focus perception and/or processing. While focus is, in fact, syntactically variable in the Turkish preverbal area, this is a consequence of movement triggered by other IS aspects like topicalisation and backgrounding, and the observational markedness of narrow subject focus compared to narrow object focus. As for focus type in Turkish, this dimension is not associated with word order in production, perception, or processing. Significant acoustic correlates of focus size (broad sentence focus vs narrow constituent focus) and focus target (narrow subject focus vs narrow object focus) were observed in fundamental frequency and intensity, representing focal boost, (postfocal) deaccentuation, and the presence or absence of a phrase-final rise in the prenucleus, while the perceivability of these effects remains to be investigated. In contrast, no acoustic correlates of focus type in simple, three-word transitive structures were observed, with focus types being interchangeable in mismatched question-answer pairs. Overall, the findings of this dissertation highlight the need for experimental investigations regarding focus in Turkish, as theoretical predictions do not necessarily align with experimental data. As such, the fallacy of implying causation from correlation should be strictly kept in mind, especially when constructions coincide with canonical structures, such as the immediately preverbal position in narrow object foci. Finally, numerous open questions remain to be explored, especially as focus and word order in Turkish are multifaceted. As shown, givenness is a confounding factor when investigating focus types, while thematic role assignment potentially confounds word order preferences. Further research based on established, modern information structure frameworks is needed, with chapter 5 concluding with specific recommendations for such future research.
The picture-word interference paradigm (participants name target pictures while ignoring distractor words) is often used to model the planning processes involved in word production. The participants' naming times are delayed in the presence of a distractor (general interference). The size of this effect depends on the relationship between the target and distractor words. Distractors of the same semantic category create more interference (semantic interference), and distractors overlapping in phonology create less interference (phonological facilitation). The present study examined the relationships between these experimental effects, processing times, and attention in order to better understand the cognitive processes underlying participants' behavior in this paradigm. Participants named pictures with a superimposed line of Xs, semantically related distractors, phonologically related distractors, or unrelated distractors. General interference, semantic interference, and phonological facilitation effects were replicated. Distributional analyses revealed that general and semantic interference effects increase with naming times, while phonological facilitation decreases. The phonological facilitation and semantic interference effects were found to depend on the synchronicity in processing times between the planning of the picture's name and the processing of the distractor word. Finally, electroencephalographic power in the alpha band before stimulus onset varied with the position of the trial in the experiment and with repetition but did not predict the size of interference/facilitation effects. Taken together, these results suggest that experimental effects in the picture-word interference paradigm depend on processing times to both the target word and distractor word and that distributional patterns could partly reflect this dependency.
We investigated the processing of morphologically complex words adopting an approach that goes beyond estimating average effects and allows testing predictions about variability in performance. We tested masked morphological priming effects with English derived ('printer') and inflected ('printed') forms priming their stems ('print') in non-native speakers, a population that is characterized by large variability. We modeled reaction times with a shifted-lognormal distribution using Bayesian distributional models, which allow assessing effects of experimental manipulations on both the mean of the response distribution ('mu') and its standard deviation ('sigma'). Our results show similar effects on mean response times for inflected and derived primes, but a difference between the two on the sigma of the distribution, with inflectional priming increasing response time variability to a significantly larger extent than derivational priming. This is in line with previous research on non-native processing, which shows more variable results across studies for the processing of inflected forms than for derived forms. More generally, our study shows that treating variability in performance as a direct object of investigation can crucially inform models of language processing, by disentangling effects which would otherwise be indistinguishable. We therefore emphasize the importance of looking beyond average performance and testing predictions on other parameters of the distribution rather than just its central tendency.
Background:
Aphasia therapy software applications (apps) can help achieve recommendations regarding aphasia treatment intensity and duration.
However, we currently know very little about speech and language therapists' (SLTs) preferences with regards to these apps.
This may be problematic, as clinician acceptance of novel treatments and technology are a key factor for successful translation from research evidence to practice.
Aim:
This research aimed to increase our understanding of clinicians' experiences with aphasia therapy apps and their perceived barriers and facilitators to the use of aphasia apps. Furthermore, we wanted to explore the influence of some demographic factors (age, country, and SLT availability in the client's hometown) on SLTs' attitudes towards these apps.
Method & Procedures:
35 Dutch and 29 Australian SLTs completed an online survey. The survey contained 9 closed-ended questions and 3 open-ended questions. Responses to the closed-ended questions were summarised through the use of descriptive statistics. The responses to the open questions were analysed and coded into recurring themes that were derived from the data. Logistic regression analyses were performed to explore the relationship between the demographic variables and the responses to the closed-ended questions.
Outcomes & results:
Participants were overwhelmingly positive about aphasia therapy apps and saw the potential for their clients to use apps independently. As facilitators of app use, participants reported accessibility and inclusion of different language modalities, while high costs, absence of a compatible device, and clients' potential computer illiteracy were listed as barriers. None of the analysed demographic factors consistently influenced differences in participants' attitudes towards aphasia therapy apps.
Conclusions:
The positive, extensive and insightful feedback from speech and language therapists is both useful and encouraging for app developers and aphasia researchers, and should facilitate the development of appropriate, high-quality therapy apps.
In this study, we investigated the cognitive-emotional interplay by measuring the effects of executive competition (Pessoa, 2013), i.e., how inhibitory control is influenced when emotional information is encountered. Sixty-three children (8 to 9 years of age) participated in an inhibition task (central task) accompanied by happy, sad, or neutral emoticons (displayed in the periphery). Typical interference effects were found in the main task for speed and accuracy, but in general, these effects were not additionally modulated by the peripheral emoticons indicating that processing of the main task exhausted the limited capacity such that interference from the task-irrelevant, peripheral information did not show (Pessoa, 2013). Further analyses revealed that the magnitude of interference effects depended on the order of congruency conditions: when incongruent conditions preceded congruent ones, there was greater interference. This effect was smaller in sad conditions, and particularly so at the beginning of the experiment. These findings suggest that the bottom-up perception of task-irrelevant emotional information influenced the top-down process of inhibitory control among children in the sad condition when processing demands were particularly high. We discuss if the salience and valence of the emotional stimuli as well as task demands are the decisive characteristics that modulate the strength of this relation.
Dynamical models make specific assumptions about cognitive processes that generate human behavior. In data assimilation, these models are tested against timeordered data. Recent progress on Bayesian data assimilation demonstrates that this approach combines the strengths of statistical modeling of individual differences with the those of dynamical cognitive models.
This paper presents the results of a novel experimental approach to relative quantifier scope in German that elicits data in an indirect manner. Applying the covered-box method (Huang et al. 2013) to scope phenomena, we show that inverse scope is available to some extent in the free constituent order language German, thereby validating earlier findings on other syntactic configurations in German (Rado & Bott 2018) and empirical claims on other free constituent order languages (Japanese, Russian, Hindi), as well as recent corpus findings in Webelhuth (2020). Moreover, the results of the indirect covered-box experiment replicate findings from an earlier direct-query experiment with comparable target items, in which participants were asked directly about the availability of surface scope and inverse scope readings. The configuration of interest consisted of canonical transitive clauses with deaccented existential subject and universal object QPs, in which the restriction of the universal QP was controlled for by the context.
Pronouns can sometimes covary with a non c-commanding quantifier phrase (QP). To obtain such 'telescoping' readings, a semantic representation must be computed in which the QP's semantic scope extends beyond its surface scope. Non-native speakers have been claimed to have more difficulty than native speakers deriving such non-isomorphic syntax-semantics mappings, but evidence from processing studies is scarce. We report the results from an eye-movement monitoring experiment and an offline questionnaire investigating whether native and non-native speakers of German can link personal pronouns to non c-commanding QPs inside relative clauses. Our results show that both participant groups were able to obtain telescoping readings offline, but only the native speakers showed evidence of forming telescoping dependencies during incremental parsing. During processing the non-native speakers focused on a discourse-prominent, non-quantified alternative antecedent instead. The observed group differences indicate that non-native comprehenders have more difficulty than native comprehenders computing scope-shifted representations in real time.
Studies of word production often make use of picture-naming tasks, including the picture-word-interference task. In this task, participants name pictures with superimposed distractor words. They typically need more time to name pictures when the distractor word is semantically related to the picture than when it is unrelated (the semantic interference effect). The present study examines the distributional properties of this effect in a series of Bayesian meta-analyses. Meta-analytic estimates of the semantic interference effect first show that the effect is present throughout the reaction time distribution and that it increases throughout the distribution. Second, we find a correlation between a participant's mean semantic interference effect and the change in the effect in the tail of the reaction time distribution, which has been argued to reflect the involvement of selective inhibition in the naming task. Finally, we show with simulated data that this correlation emerges even when no inhibition is used to generate the data, which suggests that inhibition is not needed to explain this relationship.
Individuals differ in the time needed to name a picture. This contribution asks whether this inter-individual variability emerges in earlier stages of word production (e.g. lexical selection) or later stages (e.g. articulation) and examines the consequences of this variability for EEG group results. We measured participants' (N = 45) naming latencies and continuous EEG in a picture-word interference task and naming latencies in a delayed naming task. Inter-individual variability in naming latencies in immediate naming (in contrast with inter-item variability) was not larger than the variability in the delayed task, suggesting that some variability in immediate naming originates in later stages of word production. EEG data complemented this interpretation: Differences between relatively fast vs. slow speakers emerged in response-aligned analyses in a time window close to the vocal response. We additionally present a method to assess the generalisability of the timing of effects across participants based on random sampling.
Neural conversation models aim to predict appropriate contributions to a (given) conversation by using neural networks trained on dialogue data. A specific strand focuses on non-goal driven dialogues, first proposed by Ritter et al. (2011): They investigated the task of transforming an utterance into an appropriate reply. Then, this strand evolved into dialogue system approaches using long dialogue histories and additional background context. Contributing meaningful and appropriate to a conversation is a complex task, and therefore research in this area has been very diverse: Serban et al. (2016), for example, looked into utilizing variable length dialogue histories, Zhang et al. (2018) added additional context to the dialogue history, Wolf et al. (2019) proposed a model based on pre-trained Self-Attention neural networks (Vasvani et al., 2017), and Dinan et al. (2021) investigated safety issues of these approaches. This trend can be seen as a transformation from trying to somehow carry on a conversation to generating appropriate replies in a controlled and reliable way.
In this thesis, we first elaborate the meaning of appropriateness in the context of neural conversation models by drawing inspiration from the Cooperative Principle (Grice, 1975). We first define what an appropriate contribution has to be by operationalizing these maxims as demands on conversation models: being fluent, informative, consistent towards given context, coherent and following a social norm. Then, we identify different targets (or intervention points) to achieve the conversational appropriateness by investigating recent research in that field.
In this thesis, we investigate the aspect of consistency towards context in greater detail, being one aspect of our interpretation of appropriateness.
During the research, we developed a new context-based dialogue dataset (KOMODIS) that combines factual and opinionated context to dialogues. The KOMODIS
dataset is publicly available and we use the data in this thesis to gather new insights in context-augmented dialogue generation.
We further introduced a new way of encoding context within Self-Attention based neural networks. For that, we elaborate the issue of space complexity from knowledge graphs,
and propose a concise encoding strategy for structured context inspired from graph neural networks (Gilmer et al., 2017) to reduce the space complexity of the additional context. We discuss limitations of context-augmentation for neural conversation models, explore the characteristics of knowledge graphs, and explain how we create and augment knowledge graphs for our experiments.
Lastly, we analyzed the potential of reinforcement and transfer learning to improve context-consistency for neural conversation models. We find that current reward functions need to be more precise to enable the potential of reinforcement learning, and that sequential transfer learning can improve the subjective quality of generated dialogues.
Meaning and alternatives
(2022)
Alternatives and competition in language are pervasive at all levels of linguistic analysis. More specifically, alternatives have been argued to play a prominent role in an ever-growing class of phenomena in the investigation of natural language meaning. In this article, we focus on scalar implicatures, as they are arguably the most paradigmatic case of an alternative-based phenomenon. We first review the main challenge for theories of alternatives, the so-called symmetry problem, and we briefly discuss how it has shaped the different approaches to alternatives. We then turn to two more recent challenges concerning scalar diversity and the inferences of sentences with multiple scalars. Finally, we describe several related alternative-based phenomena and recent conceptual approaches to alternatives. As we discuss, while important progress has been made, much more work is needed both on the theoretical side and on understanding the empirical landscape better.
In the current study, we explore how different information-structural devices affect which referents conversational partners expect in the upcoming discourse. Our main research question is how pitch accents (H*, L+H*) and focus particles (German nur `only' and auch 'also') affect speakers' choices to mention focused referents, previously mentioned alternatives or new, inferable alternatives. Participants in our experiment were presented with short discourses involving two referents and were asked to orally produce two sentences that continue the story. An analysis of speakers' continuations showed that participants were most likely to mention a contextual alternative in the condition with only and the L+H* conditions, followed by H* conditions. In the condition with also, in turn, participants mentioned both the focused/accented referent and the contextual alternative. Our findings highlight the importance of information structure for discourse management and suggest that speakers take activated alternatives to be relevant for an unfolding discourse.
The study of perceptual flexibility in speech depends on a variety of tasks that feature a large degree of variability between participants. Of critical interest is whether measures are consistent within an individual or across stimulus contexts. This is particularly key for individual difference designs that are deployed to examine the neural basis or clinical consequences of perceptual flexibility. In the present set of experiments, we assess the split-half reliability and construct validity of five measures of perceptual flexibility: three of learning in a native language context (e.g., understanding someone with a foreign accent) and two of learning in a non-native context (e.g., learning to categorize non-native speech sounds). We find that most of these tasks show an appreciable level of split-half reliability, although construct validity was sometimes weak. This provides good evidence for reliability for these tasks, while highlighting possible upper limits on expected effect sizes involving each measure.
The Final-over-Final Condition has emerged as a robust and explanatory generalization for a wide range of phenomena (Biberauer, Holmberg, and Roberts 2014, Sheehan et al. 2017). In this article, we argue that it also holds in another domain, nominalization. In languages that show overt nominalization of VPs, one word order is routinely unattested, namely, a head-initial VP with a suffixal nominalizer. This typological gap can be accounted for by the Final-over-Final Condition, if we allow it to hold within mixed extended projections. This view also makes correct predictions about agentive nominalizations and nominalized serial verb constructions.
A comprehensive theory of child language acquisition requires an evidential base that is representative of the typological diversity present in the world's 7000 or so languages. However, languages are dying at an alarming rate, and the next 50 years represents the last chance we have to document acquisition in many of them. Here, we take stock of the last 45 years of research published in the four main child language acquisition journals: Journal of Child Language, First Language, Language Acquisition and Language Learning and Development. We coded each article for several variables, including (1) participant group (mono vs multilingual), (2) language(s), (3) topic(s) and (4) country of author affiliation, from each journal's inception until the end of 2020. We found that we have at least one article published on around 103 languages, representing approximately 1.5% of the world's languages. The distribution of articles was highly skewed towards English and other well-studied Indo-European languages, with the majority of non-Indo-European languages having just one paper. A majority of the papers focused on studies of monolingual children, although papers did not always explicitly report participant group status. The distribution of topics across language categories was more even. The number of articles published on non-Indo-European languages from countries outside of North America and Europe is increasing; however, this increase is driven by research conducted in relatively wealthy countries. Overall, the vast majority of the research was produced in the Global North. We conclude that, despite a proud history of crosslinguistic research, the goals of the discipline need to be recalibrated before we can lay claim to truly a representative account of child language acquisition.
Agreement attraction is a cross-linguistic phenomenon where a verb occasionally agrees not with its subject, as required by grammar, but instead with an unrelated noun ("The key to the cabinets were horizontal ellipsis ").
Despite the clear violation of grammatical rules, comprehenders often rate these sentences as acceptable. Contenders for explaining agreement attraction fall into two broad classes: Morphosyntactic accounts specifically designed to explain agreement attraction, and more general sentence processing models, such as the Lewis and Vasishth model, which explain attraction as a consequence of how linguistic structure is stored and accessed in content-addressable memory.
In the present research, we disambiguate between these two classes by testing a surprising prediction made by the Lewis and Vasishth model but not by the morphosyntactic accounts, namely, that attraction should not be limited to morphosyntax, but that semantic features of unrelated nouns equally induce attraction.
A recent study by Cunnings and Sturt provided initial evidence that this may be the case. Here, we report three single-trial experiments in English that compared semantic and agreement attraction and tested whether and how the two interact.
All three experiments showed strong semantically induced attraction effects closely mirroring agreement attraction effects. We complement these results with computational simulations which confirmed that the Lewis and Vasishth model can faithfully reproduce the observed results.
In sum, our findings suggest that attraction is a more general phenomenon than is commonly believed, and therefore favor more general sentence processing models, such as the Lewis and Vasishth model.
In 2019 the Journal of Memory and Language instituted an open data and code policy; this policy requires that, as a rule, code and data be released at the latest upon publication. How effective is this policy? We compared 59 papers published before, and 59 papers published after, the policy took effect. After the policy was in place, the rate of data sharing increased by more than 50%. We further looked at whether papers published under the open data policy were reproducible, in the sense that the published results should be possible to regenerate given the data, and given the code, when code was provided. For 8 out of the 59 papers, data sets were inaccessible. The reproducibility rate ranged from 34% to 56%, depending on the reproducibility criteria. The strongest predictor of whether an attempt to reproduce would be successful is the presence of the analysis code: it increases the probability of reproducing reported results by almost 40%. We propose two simple steps that can increase the reproducibility of published papers: share the analysis code, and attempt to reproduce one's own analysis using only the shared materials.
Young infants can segment continuous speech with statistical as well as prosodic cues. Understanding how these cues interact can be informative about how infants solve the segmentation problem. Here we investigate how German-speaking adults and 9-month-old German-learning infants weigh statistical and prosodic cues when segmenting continuous speech. We measured participants' pupil size while they were familiarized with a continuous speech stream where prosodic cues were pitted off against transitional probabilities. Adult participants' changes in pupil size synchronized with the occurrence of prosodic words during the familiarization and the temporal alignment of these pupillary changes was predictive of adult participants' performance at test. Further, 9-month-olds as a group failed to consistently segment the familiarization stream with prosodic or statistical cues. However, the variability in temporal alignment of the pupillary changes at word frequency showed that prosodic and statistical cues compete for dominance when segmenting continuous speech. A followup language development questionnaire at 40 months of age suggested that infants who entrained to prosodic words performed better on a vocabulary task and those infants who relied more on statistical cues performed better on grammatical tasks. Together these results suggest that statistics and prosody may serve different roles in speech segmentation in infancy.