publish.UP Search

Variables and similarity in linguistic generalization: Evidence from inflectional classes in Portuguese (2014)

Verissimo, Joao Marques ; Clahsen, Harald

Two opposing viewpoints have been advanced to account for morphological productivity, one according to which some knowledge is couched in the form of operations over variables, and another in which morphological generalization is primarily determined by similarity. We investigated this controversy by examining the generalization of Portuguese verb stems, which fall into one of three conjugation classes. In Study 1, an elicited production task revealed that the generalization of 2nd and 3rd conjugation stems is influenced by the degree of phonological similarity between novel roots and existing verbs, whereas the 1st conjugation generalizes beyond similarity. In Study 2, we directly contrasted two distinct computational implementations of conjugation class assignment in how well they matched the human data: a similarity-driven model that captures phonological similarities, and a dual-mechanism model that implements an explicit distinction between context-free and similarity-based generalizations. The similarity-driven model consistently underestimated 1st conjugation responses and overestimated proportions of 2nd and 3rd conjugation responses, especially for novel verbs that are highly similar to existing verbs of those classes. In contrast, the expected proportions produced by the dual-mechanism model were statistically indistinguishable from human responses. We conclude that both context-free and context-sensitive processes determine the generalization of conjugations in Portuguese, and that similarity-based algorithms of morphological acquisition are insufficient to exhibit default-like generalization. (C) 2014 Elsevier Inc. All rights reserved.

Extending a Gradient Symbolic approach to the native versus non-native contrast: The case of plurals in compounds (2016)

Verissimo, Joao Marques

The Gradient Symbolic Computation (GSC) model presented in the keynote article (Goldrick, Putnam & Schwarz) constitutes a significant theoretical development, not only as a model of bilingual code-mixing, but also as a general framework that brings together symbolic grammars and graded representations. The authors are to be commended for successfully integrating a theory of grammatical knowledge with the voluminous research on lexical co-activation in bilinguals. It is, however, unfortunate that a certain conception of bilingualism was inherited from this latter research tradition, one in which the contrast between native and non-native language takes a back seat.

What eye movements can tell us about sentence comprehension (2013)

Vasishth, Shravan ; von der Malsburg, Titus Raban ; Engelmann, Felix

Eye movement data have proven to be very useful for investigating human sentence processing. Eyetracking research has addressed a wide range of questions, such as recovery mechanisms following garden-pathing, the timing of processes driving comprehension, the role of anticipation and expectation in parsing, the role of semantic, pragmatic, and prosodic information, and so on. However, there are some limitations regarding the inferences that can be made on the basis of eye movements. One relates to the nontrivial interaction between parsing and the eye movement control system which complicates the interpretation of eye movement data. Detailed computational models that integrate parsing with eye movement control theories have the potential to unpack the complexity of eye movement data and can therefore aid in the interpretation of eye movements. Another limitation is the difficulty of capturing spatiotemporal patterns in eye movements using the traditional word-based eyetracking measures. Recent research has demonstrated the relevance of these patterns and has shown how they can be analyzed. In this review, we focus on reading, and present examples demonstrating how eye movement data reveal what events unfold when the parser runs into difficulty, and how the parsing system interacts with eye movement control. WIREs Cogn Sci 2013, 4:125134. doi: 10.1002/wcs.1209 For further resources related to this article, please visit the WIREs website.

Short-term forgetting in sentence comprehension : crosslinguistic evidence from verb-final structures (2010)

Vasishth, Shravan ; Suckow, Katja ; Lewis, Richard L. ; Kern, Sabine

Seven experiments using self-paced reading and eyetracking suggest that omitting the middle verb in a double centre embedding leads to easier processing in English but leads to greater difficulty in German. One commonly accepted explanation for the English pattern-based on data from offline acceptability ratings and due to Gibson and Thomas (1999)- is that working-memory overload leads the comprehender to forget the prediction of the upcoming verb phrase (VP), which reduces working-memory load. We show that this VP-forgetting hypothesis does an excellent job of explaining the English data, but cannot account for the German results. We argue that the English and German results can be explained by the parser's adaptation to the grammatical properties of the languages; in contrast to English, German subordinate clauses always have the verb in clause-final position, and this property of German may lead the German parser to maintain predictions of upcoming VPs more robustly compared to English. The evidence thus argues against language- independent forgetting effects in online sentence processing; working-memory constraints can be conditioned by countervailing influences deriving from grammatical properties of the language under study.

Computational Models of Retrieval Processes in Sentence Processing (2019)

Vasishth, Shravan ; Nicenboim, Bruno ; Engelmann, Felix ; Burchert, Frank

Sentence comprehension requires that the comprehender work out who did what to whom. This process has been characterized as retrieval from memory. This review summarizes the quantitative predictions and empirical coverage of the two existing computational models of retrieval and shows how the predictive performance of these two competing models can be tested against a benchmark data-set. We also show how computational modeling can help us better understand sources of variability in both unimpaired and impaired sentence comprehension.

Bayesian data analysis in the phonetic sciences (2018)

Vasishth, Shravan ; Nicenboim, Bruno ; Beckman, Mary E. ; Li, Fangfang ; Kong, Eun Jong

This tutorial analyzes voice onset time (VOT) data from Dongbei (Northeastern) Mandarin Chinese and North American English to demonstrate how Bayesian linear mixed models can be fit using the programming language Stan via the R package brms. Through this case study, we demonstrate some of the advantages of the Bayesian framework: researchers can (i) flexibly define the underlying process that they believe to have generated the data; (ii) obtain direct information regarding the uncertainty about the parameter that relates the data to the theoretical question being studied; and (iii) incorporate prior knowledge into the analysis. Getting started with Bayesian modeling can be challenging, especially when one is trying to model one’s own (often unique) data. It is difficult to see how one can apply general principles described in textbooks to one’s own specific research problem. We address this barrier to using Bayesian methods by providing three detailed examples, with source code to allow easy reproducibility. The examples presented are intended to give the reader a flavor of the process of model-fitting; suggestions for further study are also provided. All data and code are available from: https://osf.io/g4zpv.

Corrigendum to: Shravan Vasishth, Daniela Mertzen, Lena A. Jäger, Andrew Gelman; The statistical significance filter leads to overoptimistic expectations of replicability. - Journal of Memory and Language. - 103 (2018), pg. 151 - 175 (2018)

Vasishth, Shravan ; Mertzen, Daniela ; Jäger, Lena Ann ; Gelman, Andrew

The statistical significance filter leads to overoptimistic expectations of replicability (2018)

Vasishth, Shravan ; Mertzen, Daniela ; Jaeger, Lena A. ; Gelman, Andrew

It is well-known in statistics (e.g., Gelman & Carlin, 2014) that treating a result as publishable just because the p-value is less than 0.05 leads to overoptimistic expectations of replicability. These effects get published, leading to an overconfident belief in replicability. We demonstrate the adverse consequences of this statistical significance filter by conducting seven direct replication attempts (268 participants in total) of a recent paper (Levy & Keller, 2013). We show that the published claims are so noisy that even non-significant results are fully compatible with them. We also demonstrate the contrast between such small-sample studies and a larger-sample study; the latter generally yields a less noisy estimate but also a smaller effect magnitude, which looks less compelling but is more realistic. We reiterate several suggestions from the methodology literature for improving current practices.

Argument-head distance and processing complexity: Explaining both locality and antilocality effects (2006)

Vasishth, Shravan ; Lewis, Richard L.

Although proximity between arguments and verbs (locality) is a relatively robust determinant of sentence-processing difficulty (Hawkins 1998, 2001, Gibson 2000), increasing argument-verb distance can also facilitate processing (Konieczny 2000). We present two self-paced reading (SPR) experiments involving Hindi that provide further evidence of antilocality, and a third SPR experiment which suggests that similarity-based interference can attenuate this distance-based facilitation. A unified explanation of interference, locality, and antilocality effects is proposed via an independently motivated theory of activation decay and retrieval interference (Anderson et al. 2004).*

How to embrace variation and accept uncertainty in linguistic and psycholinguistic data analysis (2021)

Vasishth, Shravan ; Gelman, Andrew

The use of statistical inference in linguistics and related areas like psychology typically involves a binary decision: either reject or accept some null hypothesis using statistical significance testing. When statistical power is low, this frequentist data-analytic approach breaks down: null results are uninformative, and effect size estimates associated with significant results are overestimated. Using an example from psycholinguistics, several alternative approaches are demonstrated for reporting inconsistencies between the data and a theoretical prediction. The key here is to focus on committing to a falsifiable prediction, on quantifying uncertainty statistically, and learning to accept the fact that - in almost all practical data analysis situations - we can only draw uncertain conclusions from data, regardless of whether we manage to obtain statistical significance or not. A focus on uncertainty quantification is likely to lead to fewer excessively bold claims that, on closer investigation, may turn out to be not supported by the data.

Processing chinese relative clauses - evidence for the subject-relative advantage (2013)

Vasishth, Shravan ; Chen, Zhong ; Li, Qiang ; Guo, Gueilan

A general fact about language is that subject relative clauses are easier to process than object relative clauses. Recently, several self-paced reading studies have presented surprising evidence that object relatives in Chinese are easier to process than subject relatives. We carried out three self-paced reading experiments that attempted to replicate these results. Two of our three studies found a subject-relative preference, and the third study found an object-relative advantage. Using a random effects bayesian meta-analysis of fifteen studies (including our own), we show that the overall current evidence for the subject-relative advantage is quite strong (approximate posterior probability of a subject-relative advantage given the data: 78-80%). We argue that retrieval/integration based accounts would have difficulty explaining all three experimental results. These findings are important because they narrow the theoretical space by limiting the role of an important class of explanation-retrieval/integration cost-at least for relative clause processing in Chinese.

The foundations of statistics: a simulation-based approach (2011)

Vasishth, Shravan ; Broe, Michael

Using approximate Bayesian computation for estimating parameters in the cue-based retrieval model of sentence processing (2020)

Vasishth, Shravan

A commonly used approach to parameter estimation in computational models is the so-called grid search procedure: the entire parameter space is searched in small steps to determine the parameter value that provides the best fit to the observed data. This approach has several disadvantages: first, it can be computationally very expensive; second, one optimal point value of the parameter is reported as the best fit value; we cannot quantify our uncertainty about the parameter estimate. In the main journal article that this methods article accompanies (Jager et al., 2020, Interference patterns in subject-verb agreement and reflexives revisited: A large-sample study, Journal of Memory and Language), we carried out parameter estimation using Approximate Bayesian Computation (ABC), which is a Bayesian approach that allows us to quantify our uncertainty about the parameter's values given data. This customization has the further advantage that it allows us to generate both prior and posterior predictive distributions of reading times from the cue-based retrieval model of Lewis and Vasishth, 2005. <br /> Instead of the conventional method of using grid search, we use Approximate Bayesian Computation (ABC) for parameter estimation in the [4] model. <br /> The ABC method of parameter estimation has the advantage that the uncertainty of the parameter can be quantified.

The role of mother-infant emotional synchrony in speech processing in 9-month-old infants (2022)

Vanoncini, Monica ; Boll-Avetisyan, Natalie ; Elsner, Birgit ; Hoehl, Stefanie ; Kayhan, Ezgi

Rhythmicity characterizes both interpersonal synchrony and spoken language. Emotions and language are forms of interpersonal communication, which interact with each other throughout development. We investigated whether and how emotional synchrony between mothers and their 9-month-old infants relates to infants' word segmentation as an early marker of language development. Twenty-six 9-month-old infants and their German-speaking mothers took part in the study. To measure emotional synchrony, we coded positive, neutral and negative emotional expressions of the mothers and their infants during a free play session. We then calculated the degree to which the mothers' and their infants' matching emotional expressions followed a predictable pattern. To measure word segmentation, we familiarized infants with auditory text passages and tested how long they looked at the screen while listening to familiar versus novel words. We found that higher levels of predictability (i.e. low entropy) during mother-infant interaction is associated with infants' word segmentation performance. These findings suggest that individual differences in word segmentation relate to the complexity and predictability of emotional expressions during mother-infant interactions.

Language-specific prosodic acquisition (2020)

van Ommen, Sandrien ; Boll-Avetisyan, Natalie ; Larraza, Saioa ; Wellmann, Caroline ; Bijeljac-Babic, Ranka ; Höhle, Barbara ; Nazzi, Thierry

This study compares the development of prosodic processing in French- and German-learning infants. The emergence of language-specific perception of phrase boundaries was directly tested using the same stimuli across these two languages. French-learning (Experiment 1, 2) and German-learning 6- and 8-month-olds (Experiment 3) listened to the same French noun sequences with or without major prosodic boundaries ([Loulou et Manou] [et Nina]; [Loulou et Manou et Nina], respectively). The boundaries were either naturally cued (Experiment 1), or cued exclusively by pitch and duration (Experiment 2, 3). French-learning 6- and 8-month-olds both perceived the natural boundary, but neither perceived the boundary when only two cues were present. In contrast, German-learning infants develop from not perceiving the two-cue boundary at 6 months to perceiving it at 8 months, just like German-learning 8-month-olds listening to German (Wellmann, Holzgrefe, Truckenbrodt, Wartenburger, & Hohle, 2012). In a control experiment (Experiment 4), we found little difference between German and French adult listeners, suggesting that later, French listeners catch up with German listeners. Taken together, these cross-linguistic differences in the perception of identical stimuli provide direct evidence for language-specific development of prosodic boundary perception.

Metrical and statistical cues for word segmentation : the use of vowel harmony and word stress as a cue to word boundaries by 6- and 9-month-old Turkish learners (2008)

van Kampen, Anja ; Parmaksiz, Güliz ; van de Vijver, Ruben ; Höhle, Barbara

Resource allocation and fluid intelligence ; insights from pupillometry (2010)

van der Meer, Elke ; Beyer, Reinhard ; Horn, Judith ; Foth, Manja ; Bornemann, Boris ; Ries, Jan ; Kramer, Jürg ; Warmuth, Elke ; Heekeren, Hauke R. ; Wartenburger, Isabell

Negative affect is related to reduced differential neural responses to social and non-social stimuli in 5-to-8-month-old infants (2018)

van der Kant, Anne ; Biro, Szilvia ; Levelt, Claartje ; Huijbregts, Stephan

Both social perception and temperament in young infants have been related to social functioning later in life. Previous functional Near-Infrared Spectroscopy (fNIRS) data (Lloyd-Fox et al., 2009) showed larger blood-oxygenation changes for social compared to non-social stimuli in the posterior temporal cortex of five-month-old infants. We sought to replicate and extend these findings by using fNIRS to study the neural basis of social perception in relation to infant temperament (Negative Affect) in 37 five-to-eight-month-old infants. Infants watched short videos displaying either hand and facial movements of female actors (social dynamic condition) or moving toys and machinery (non-social dynamic condition), while fNIRS data were collected over temporal brain regions. Negative Affect was measured using the Infant Behavior Questionnaire. Results showed significantly larger blood-oxygenation changes in the right posterior-temporal region in the social compared to the non-social condition. Furthermore, this differential activation was smaller in infants showing higher Negative Affect. Our results replicate those of Lloyd-Fox et al. and confirmed that five-to-eight-month-old infants show cortical specialization for social perception. Furthermore, the decreased cortical sensitivity to social stimuli in infants showing high Negative Affect may be an early biomarker for later difficulties in social interaction.

An analysis of pitch and duration in material used to test L2 processing of words (2006)

van de Vijver, Ruben ; Sennema, Anke ; Zimmer–Stahl, Anne

The material reported on in this paper is part of a set of experiments in which the role of Information Structure on L2 processing of words is tested. Pitch and duration of 4 sets of experimental material in German and English are measured and analyzed in this paper. The well-known finding that accent boosts duration and pitch is confirmed. Syntactic and lexical means of marking focus, however, do not give the duration and the pitch of a word an extra boost.

On the distribution of dorsals in complex and simple onsets in child German, Dutch and English (2009)

van de Vijver, Ruben ; Höhle, Barbara ; Ott, Susan

Phonology and intonation (2007)

van de Vijver, Ruben ; Hellmuth, Sam ; Kügler, Frank ; Mayer, Jörg ; Stoel, Ruben

On the role of phonetic motivation and frequency in the acquisition of alternations (2013)

van de Vijver, Ruben ; Baer-Henney, Dinah

On the Role of Phonetic Motivation and Frequency in the Acquisition of Alternations German nouns may alternate in two ways: a final word-final voiceless obstruent in the singular may correspond to a voiced one in the plural and a back vowel in the singular may correspond to a front one in the plural. We investigate the role of phonetic motivation and frequency in the acquisition of these alternations. The voicing alternation has a phonetic motivation, but the vowel alternation does not. On the basis of two corpus studies, we conclude that both alternations occur with equal frequency in the ambient language. In two production experiments, one with 5-year-olds and one with adults, we asked both populations to form plurals for given singular words and nonces. The children produce more voicing alternations in nonces than adults and fewer vowel alternations than adults. We conclude that children rely more on phonetic motivation than adults.

Pisoni, D., Remez, R. (eds.), The handbook of speech perception; Oxford, Blackwell, 2005 (2009)

van de Vijver, Ruben

Does Dutch a-scrambling involve movement? Evidence from antecedent priming (2015)

van de Koot, Hans ; Silva, Renita ; Felser, Claudia ; Sato, Mikako

The present study focuses on A-scrambling in Dutch, a local word-order alternation that typically signals the discourse-anaphoric status of the scrambled constituent. We use cross-modal priming to investigate whether an A-scrambled direct object gives rise to antecedent reactivation effects in the position where a movement theory would postulate a trace. Our results indicate that this is not the case, thereby providing support for a base-generation analysis of A-scrambling in Dutch.

Morphological processing in heritage speakers (2021)

Uygun, Serkan ; Clahsen, Harald

Previous research has shown that heritage speakers struggle with inflectional morphology. 'Limitations of online resources' for processing a non-dominant language has been claimed as one possible reason for these difficulties. To date, however, there is very little experimental evidence on real-time language processing in heritage speakers. Here we report results from a masked priming experiment with 97 bilingual (Turkish/German) heritage speakers and a control group of 40 non-heritage speakers of Turkish examining regular and irregular forms of the Turkish aorist. We found that, for the regular aorist, heritage speakers use the same morphological decomposition mechanism ('affix stripping') as control speakers, whereas for processing irregularly inflected forms they exhibited more variability (i.e., less homogeneous performance) than the control group. Heritage speakers also demonstrated semantic priming effects. At a more general level, these results indicate that heritage speakers draw on multiple sources of information for recognizing morphologically complex words.

The effect of literacy on oral language processing implications for aphasia tests (2011)

Tsegaye, Mulugeta Tarekegne ; De Bleser, Ria ; Iribarren, Carolina

Most studies investigating the impact of literacy on oral language processing have shown that literacy provides phonological awareness skills in the processing of oral language. The implications of these results on aphasia tests could be significant and pose questions on the adequacy of such tools for testing non-literate individuals. Aiming at examining the impact of literacy on oral language processing and its implication on aphasia tests, this study tested 12 non-literate and 12 literate individuals with a modified Amharic version of the Bilingual Aphasia Test (Paradis and Amberber, 1991, Bilingual Aphasia Test. Amharic version. Hillsdale, NJ: Lawrence Erlbaum.). The problems of phonological awareness skills in oral language processing in non-literates are substantiated. In addition, compared with literate participants, non-literate individuals demonstrated difficulties in the word/sentence-picture matching tasks. This study has also revealed that the Amharic version of the Bilingual Aphasia Test may be viable for testing Amharic-speaking non-literate individuals with aphasia when modifications are incorporated.

Topic Drop and Null Subjects in German (2016)

Trutkowski, Ewa

This study presents new insights into null subjects, topic drop and the interpretation of topic-dropped elements. Besides providing an empirical data survey, it offers explanations to well-known problems, e.g. syncretisms in the context of null-subject licensing or the marginality of dropping an element which carries oblique case. The book constitutes a valuable source for both empirically and theoretically interested (generative) linguists.

Production of regular and non-regular verbs : evidence for a lexical entry complexity account (2010)

Trompelt, Helena

The incredible productivity and creativity of language depends on two fundamental resources: a mental lexicon and a mental grammar. Rules of grammar enable us to produce and understand complex phrases we have not encountered before and at the same time constrain the computation of complex expressions. The concepts of the mental lexicon and mental grammar have been thoroughly tested by comparing the use of regular versus non-regular word forms. Regular verbs (e.g. walk-walked) are computed using a suffixation rule in a neural system for grammatical processing; non-regular verbs (run-ran) are retrieved from associative memory. The role of regularity has only been explored for the past tense, where regularity is overtly visible. To explore the representation and encoding of regularity as well as the inflectional processes involved in the production of regular and non-regular verbs, this dissertation investigated three groups of German verbs: regular, irregular and hybrid verbs. Hybrid verbs in German have completely regular conjugation in the present tense and irregular conjugation in the past tense. Articulation latencies were measured while participants named pictures of actions, producing the 3rd person singular of regular, hybrid, and irregular verbs in present and past tense. Studying the production of German verbs in past and present tense, this dissertation explored the complexity of lexical entries as a decisive factor in the production of verbs.

Distributed optimality (2001)

Trommer, Jochen

In this thesis I propose a synthesis (Distributed Optimality, DO) between Optimality Theory (OT, Prince & Smolensky, 1993) and a morphological framework in a genuine derivational tradition, namely Distributed Morphology (DM) as developed by Halle & Marantz (1993). By carrying over the apparatus of OT to DM, phenomena which are captured in DM by language-specific rules or features of lexical entries, are given a more principled account in the terms of ranked universal constraints. On the other hand, also the DM part makes two contributions, namely strong locality and impoverishment. The first gives rise to a simple formal interpretation of DO, while the latter is shown to be indispensable in any theoretically satisfying account of agreement morphology. The empirical basis of the work is given by the complex agreement morphology of genetically different languages. Theoretical focus is mainly on two areas: First, so-called direction marking which is shown to be preferably treated in terms of constraints on feature realization. Second, the effects of precedence constraints which are claimed to regulate the status of agreement affixes as prefixes or suffixes and their respective order. A universal typology for the order of agreement categories by means of OT-constraints is proposed.

The Mental lexicon in acquisition (2019)

Trautwein, Jutta

The individual’s mental lexicon comprises all known words as well related infor-mation on semantics, orthography and phonology. Moreover, entries connect due to simi-larities in these language domains building a large network structure. The access to lexical information is crucial for processing of words and sentences. Thus, a lack of information in-hibits the retrieval and can cause language processing difficulties. Hence, the composition of the mental lexicon is essential for language skills and its assessment is a central topic of lin-guistic and educational research. In early childhood, measurement of the mental lexicon is uncomplicated, for example through parental questionnaires or the analysis of speech samples. However, with growing content the measurement becomes more challenging: With more and more words in the mental lexicon, the inclusion of all possible known words into a test or questionnaire be-comes impossible. That is why there is a lack of methods to assess the mental lexicon for school children and adults. For the same reason, there are only few findings on the courses of lexical development during school years as well as its specific effect on other language skills. This dissertation is supposed to close this gap by pursuing two major goals: First, I wanted to develop a method to assess lexical features, namely lexicon size and lexical struc-ture, for children of different age groups. Second, I aimed to describe the results of this method in terms of lexical development of size and structure. Findings were intended to help understanding mechanisms of lexical acquisition and inform theories on vocabulary growth. The approach is based on the dictionary method where a sample of words out of a dictionary is tested and results are projected on the whole dictionary to determine an indi-vidual’s lexicon size. In the present study, the childLex corpus, a written language corpus for children in German, served as the basis for lexicon size estimation. The corpus is assumed to comprise all words children attending primary school could know. Testing a sample of words out of the corpus enables projection of the results on the whole corpus. For this purpose, a vocabulary test based on the corpus was developed. Afterwards, test performance of virtual participants was simulated by drawing different lexicon sizes from the corpus and comparing whether the test items were included in the lexicon or not. This allowed determination of the relation between test performance and total lexicon size and thus could be transferred to a sample of real participants. Besides lexicon size, lexical content could be approximated with this approach and analyzed in terms of lexical structure. To pursue the presented aims and establish the sampling method, I conducted three consecutive studies. Study 1 includes the development of a vocabulary test based on the childLex corpus. The testing was based on the yes/no format and included three versions for different age groups. The validation grounded on the Rasch Model shows that it is a valid instrument to measure vocabulary for primary school children in German. In Study 2, I estab-lished the method to estimate lexicon sizes and present results on lexical development dur-ing primary school. Plausible results demonstrate that lexical growth follows a quadratic function starting with about 6,000 words at the beginning of school and about 73,000 words on average for young adults. Moreover, the study revealed large interindividual differences. Study 3 focused on the analysis of network structures and their development in the mental lexicon due to orthographic similarities. It demonstrates that networks possess small-word characteristics and decrease in interconnectivity with age. Taken together, this dissertation provides an innovative approach for the assessment and description of the development of the mental lexicon from primary school onwards. The studies determine recent results on lexical acquisition in different age groups that were miss-ing before. They impressively show the importance of this period and display the existence of extensive interindividual differences in lexical development. One central aim of future research needs to address the causes and prevention of these differences. In addition, the application of the method for further research (e.g. the adaptation for other target groups) and teaching purposes (e.g. adaptation of texts for different target groups) appears to be promising.

Non-canonical word order and temporal reference in Vietnamese (2021)

Tran, Thuan

The paper revisits Duffield's (2007) (Duffield, Nigel. 2007. Aspects of Vietnamese clausal structure: Separating tense from assertion. Linguistics 45(4). 765-814) analysis of the correlation between the position of a 'when'-phrase and the temporal reference of a bare sentence in Vietnamese. Bare sentences in Vietnamese, based on (Smith, Carlota S. & Mary S. Erbaugh. 2005. Temporal interpretation in Mandarin Chinese. Linguistics 43(4). 713-756), are argued to obtain their temporal interpretation from their aspectual composition, and the default temporal reference: bounded events are located in the past, unbounded events at present. It is shown that the correlation so observed in when-questions is superficial, and is tied to the syntax and semantics of temporal modification and the requirement that temporal adverbials denoting future time is base generated in sentence-initial position, and past time adverbials in sentence-final position. A 'when'-phrase, being temporally underspecified, obtains its temporal value from its base position. However, the correlation between word order and temporal reference in argument wh-questions and declaratives is factual, depending on whether the predicate-argument configuration allows for a telic interpretation or not. To be specific, it is dependent on whether the application of Generic Modification (Snyder, William. 2012. Parameter theory and motion predicates. In Violeta Demonte & Louise McNally (eds.), Telicity, change, and state. Acrosscategorial view of event structure, 279-299. Oxford: Oxford University Press) or accomplishment composition is realized. Canonical declaratives, and argument wh-questions, with telicity inducing material, license GM or accomplishment composition, yielding bounded events, hence past; by contrast, their noncanonical counterparts block GM or accomplishment composition, giving rise to unbounded event descriptions, hence non-past.

Contrastive topic and focus information in discourse : prosodic realisation and electrophysiological brain correlates (2005)

Toepel, Ulrike

Immediate phonetic convergence in a cue-distractor paradigm (2018)

Tobin, Stephen ; Hullebus, Marc Antony ; Gafos, Adamantios I.

During a cue-distractor task, participants repeatedly produce syllables prompted by visual cues. Distractor syllables are presented to participants via headphones 150 ms after the visual cue (before any response). The task has been used to demonstrate perceptuomotor integration effects (perception effects on production): response times (RTs) speed up as the distractor shares more phonetic properties with the response. Here it is demonstrated that perceptuomotor integration is not limited to RTs. Voice Onset Times (VOTs) of the distractor syllables were systematically varied and their impact on responses was measured. Results demonstrate trial-specific convergence of response syllables to VOT values of distractor syllables.

Lone contrastive topic constructions (2016)

Thuan, Tran

It has been long agreed by formal and functional researchers (primarily based on English data) that contrastive topic marking, namely marking a constituent as a contrastive topic via the B-accent/the rising intonation contour) requires the co-occurrence of focus marking via the A-accent/the falling intonation contour (see Sturgeon 2006, and references therein). However, this consensus has recently been disputed by new findings indicating the occurrence of utterances with only B-accent, dubbed as lone contrastive topic (Büring 2003, Constant 2014). In this paper, I argue, based on the data in Vietnamese, that the presence of lone contrastive topic is just apparent, and that the focus that co-occurs with the seemingly lone contrastive topic is a verum focus.

The clause as a locus of grammar and interaction (2005)

Thompson, Sandra A. ; Couper-Kuhlen, Elizabeth

This article draws on work at the interface of grammar and interaction to argue that the clause is a locus of interaction, in the sense that it is one of the most frequent grammatical formats which speakers orient to in projecting what actions are being done by others' utterances and in acting on these projections. Yet the way in which the clause affords grammatical projectability varies significantly from language to language. In fact, it depends on the nature of the clausal grammatical formats which are available as resources in a language: in some languages these allow early projection in the turn unit (as in English), in others they do not (as in Japanese). We focus here on these two languages and show that their variable grammatical projectability has repercussions on the way in which three interactional phenomena - next-turn onset, co-construction, and turn-unit extension - are realized in the respective speech communities. In each case the practices used are precisely the ones which the clausal grammatical formats in the given language promote. The evidence thus suggests that clauses are interactionally warranted, if variably built, formats for social action

Virality and Emotionality of the Lügenpresse Phenomenon: A Critical Discourse Analysis of German and French Right-Wing Websites (2018)

Thom, Linda

A note on apparent sluicing in Malagasy (2019)

Thiersch, Craig

Acoustic processing of temporally modulated sounds in infants evidence from a combined near-infrared spectroscopy and EEG study (2011)

Telkemeyer, Silke ; Rossi, Sonja ; Nierhaus, Till ; Steinbrink, Jens ; Obrig, Hellmuth ; Wartenburger, Isabell

Speech perception requires rapid extraction of the linguistic content from the acoustic signal. The ability to efficiently process rapid changes in auditory information is important for decoding speech and thereby crucial during language acquisition. Investigating functional networks of speech perception in infancy might elucidate neuronal ensembles supporting perceptual abilities that gate language acquisition. Interhemispheric specializations for language have been demonstrated in infants. How these asymmetries are shaped by basic temporal acoustic properties is under debate. We recently provided evidence that newborns process non-linguistic sounds sharing temporal features with language in a differential and lateralized fashion. The present study used the same material while measuring brain responses of 6 and 3 month old infants using simultaneous recordings of electroencephalography (EEG) and near-infrared spectroscopy (NIRS). NIRS reveals that the lateralization observed in newborns remains constant over the first months of life. While fast acoustic modulations elicit bilateral neuronal activations, slow modulations lead to right-lateralized responses. Additionally, auditory-evoked potentials and oscillatory EEG responses show differential responses for fast and slow modulations indicating a sensitivity for temporal acoustic variations. Oscillatory responses reveal an effect of development, that is, 6 but not 3 month old infants show stronger theta-band desynchronization for slowly modulated sounds. Whether this developmental effect is due to increasing fine-grained perception for spectrotemporal sounds in general remains speculative. Our findings support the notion that a more general specialization for acoustic properties can be considered the basis for lateralization of speech perception. The results show that concurrent assessment of vascular based imaging and electrophysiological responses have great potential in the research on language acquisition.

Sensitivity of newborn auditory cortex to the temporal structure of sounds (2009)

Telkemeyer, Silke ; Rossi, Sonja ; Koch, Stefan P. ; Nierhaus, Till ; Steinbrink, Jens ; Poeppel, David ; Obrig, Hellmuth ; Wartenburger, Isabell

Understanding the rapidly developing building blocks of speech perception in infancy requires a close look at the auditory prerequisites for speech sound processing. Pioneering studies have demonstrated that hemispheric specializations for language processing are already present in early infancy. However, whether these computational asymmetries can be considered a function of linguistic attributes or a consequence of basic temporal signal properties is under debate. Several studies in adults link hemispheric specialization for certain aspects of speech perception to an asymmetry in cortical tuning and reveal that the auditory cortices are differentially sensitive to spectrotemporal features of speech. Applying concurrent electrophysiological (EEG) and hemodynamic (near-infrared spectroscopy) recording to newborn infants listening to temporally structured nonspeech signals, we provide evidence that newborns process nonlinguistic acoustic stimuli that share critical temporal features with language in a differential manner. The newborn brain preferentially processes temporal modulations especially relevant for phoneme perception. In line with multi-time-resolution conceptions, modulations on the time scale of phonemes elicit strong bilateral cortical responses. Our data furthermore suggest that responses to slow acoustic modulations are lateralized to the right hemisphere. That is, the newborn auditory cortex is sensitive to the temporal structure of the auditory input and shows an emerging tendency for functional asymmetry. Hence, our findings support the hypothesis that development of speech perception is linked to basic capacities in auditory processing. From birth, the brain is tuned to critical temporal properties of linguistic signals to facilitate one of the major needs of humans: to communicate.

Children's gradient sensitivity to phonological mismatch (2018)

Tamasi, Katalin ; Mckean, Cristina ; Gafos, Adamantios I. ; Höhle, Barbara

In a preferential looking paradigm, we studied how children's looking behavior and pupillary response were modulated by the degree of phonological mismatch between the correct label of a target referent and its manipulated form. We manipulated degree of mismatch by introducing one or more featural changes to the target label. Both looking behavior and pupillary response were sensitive to degree of mismatch, corroborating previous studies that found differential responses in one or the other measure. Using time-course analyses, we present for the first time results demonstrating full separability among conditions (detecting difference not only between one vs. more, but also between two and three featural changes). Furthermore, the correct labels and small featural changes were associated with stable target preference, while large featural changes were associated with oscillating looking behavior, suggesting significant shifts in looking preference over time. These findings further support and extend the notion that early words are represented in great detail, containing subphonemic information.

Measuring children’s sensitivity to phonological detail using eye tracking and pupillometry (2016)

Tamasi, Katalin

Infants' lexical processing is modulated by featural manipulations made to words, suggesting that early lexical representations are sufficiently specified to establish a match with the corresponding label. However, the precise degree of detail in early words requires further investigation due to equivocal findings. We studied this question by assessing children’s sensitivity to the degree of featural manipulation (Chapters 2 and 3), and sensitivity to the featural makeup of homorganic and heterorganic consonant clusters (Chapter 4). Gradient sensitivity on the one hand and sensitivity to homorganicity on the other hand would suggest that lexical processing makes use of sub-phonemic information, which in turn would indicate that early words contain sub-phonemic detail. The studies presented in this thesis assess children’s sensitivity to sub-phonemic detail using minimally demanding online paradigms suitable for infants: single-picture pupillometry and intermodal preferential looking. Such paradigms have the potential to uncover lexical knowledge that may be masked otherwise due to cognitive limitations. The study reported in Chapter 2 obtained a differential response in pupil dilation to the degree of featural manipulation, a result consistent with gradient sensitivity. The study reported in Chapter 3 obtained a differential response in proportion of looking time and pupil dilation to the degree of featural manipulation, a result again consistent with gradient sensitivity. The study reported in Chapter 4 obtained a differential response to the manipulation of homorganic and heterorganic consonant clusters, a result consistent with sensitivity to homorganicity. These results suggest that infants' lexical representations are not only specific, but also detailed to the extent that they contain sub-phonemic information.

Lexicon-Based methods for sentiment analysis (2011)

Taboada, Maite ; Brooke, Julian ; Tofiloski, Milan ; Voll, Kimberly ; Stede, Manfred

We present a lexicon-based approach to extracting sentiment from text. The Semantic Orientation CALculator (SO-CAL) uses dictionaries of words annotated with their semantic orientation (polarity and strength), and incorporates intensification and negation. SO-CAL is applied to the polarity classification task, the process of assigning a positive or negative label to a text that captures the text's opinion towards its main subject matter. We show that SO-CAL's performance is consistent across domains and on completely unseen data. Additionally, we describe the process of dictionary creation, and our use of Mechanical Turk to check dictionaries for consistency and reliability.

Acquisition of quantifier raising of a universal across an existential (2017)

Szendrői, Kriszta ; Schumacher, Rebecca ; Fritzsche, Tom ; Höhle, Barbara

Our paper reports an act out task with German 5- and 6-year olds and adults involving doubly-quantified sentences with a universal object and an existential subject. We found that 5- and 6-year olds allow inverse scope in such sentences, while adults do not. Our findings contribute to a growing body of research (e.g. Gualmini et al. 2008; Musolino 2009, etc.) showing that children are more flexible in their scopal considerations than initially proposed by the Isomorphism proposal (Lidz & Musolino 2002; Musolino & Lidz 2006). This result provides support for a theory of German, a “no quantifier raising”-language, in terms of soft violable constraints, or global economy terms (Bobaljik & Wurmbrand 2012), rather than in terms of hard inviolable constraints or rules (Frey 1993). Finally, the results are compatible with Reinhart’s (2004) hypothesis that children do not perform global interface economy considerations due to the increased processing associated with it.

Acquisition of prosodic focus marking by English, French, and German three-, four-, five- and six-year-olds (2017)

Szendroi, Kriszta ; Bernard, Carline ; Berger, Frauke ; Gervain, Judit ; Höhle, Barbara

Previous research on young children's knowledge of prosodic focus marking has revealed an apparent paradox, with comprehension appearing to lag behind production. Comprehension of prosodic focus is difficult to study experimentally due to its subtle and ambiguous contribution to pragmatic meaning. We designed a novel comprehension task, which revealed that three- to six-year-old children show adult-like comprehension of the prosodic marking of subject and object focus. Our findings thus support the view that production does not precede comprehension in the acquisition of focus. We tested participants speaking English, German, and French. All three languages allow prosodic subject and object focus marking, but use additional syntactic marking to varying degrees (English: dispreferred; German: possible; French preferred). French participants produced fewer subject marked responses than English participants. We found no other cross-linguistic differences. Participants interpreted prosodic focus marking similarly and in an adult-like fashion in all three languages.

Intonational differences between L1 and L2 english in South Africa (2010)

Swerts, Marc ; Zerbian, Sabine

Previous studies have shown that characteristics of a person's first language (L1) may transfer to a second language (L2). The current study looks at the extent to which this holds for aspects of intonation as well. More specifically, we investigate to what extent traces of the L1 can be discerned in the way intonation is used in the L2 for two functions: (1) to highlight certain words by making them sound more prominent and (2) to signal continuation or finality in a list by manipulating the speech melody. To this end, the article presents an explorative study into the way focus and boundaries are marked prosodically in Zulu, and it also compares such prosodic functions in two variants of English in South Africa, i.e., English spoken as an L1, and English spoken as an L2/additional language by speakers who have Zulu as their L1. The latter language is commonly referred to as Black South African English. This comparison is interesting from a typological perspective, as Zulu is intonationally different from English, especially in the way prosody is exploited for signalling informationally important stretches of speech. Using a specific elicitation procedure, we found in a first study that speakers of South African English (as L1) mark focused words and position within a list by intonational means, just as in other L1 varieties of English, whereas Zulu only uses intonation for marking continuity or finality. A second study focused on speakers of Black South African English, and compared the prosody of proficient versus less proficient speakers. We found that the proficient speakers were perceptually equivalent to L1 speakers of English in their use of intonation for marking focus and boundaries. The less proficient speakers marked boundaries in a similar way as L1 speakers of English, but did not use prosody for signalling focus, analogous to what is typical of their native language. Acoustic observations match these perceptual results.

Child, family, and school factors in bilingual preschoolers’ vocabulary development in heritage languages (2020)

Sun, He ; Ng, Siew Chin ; O'Brien, Beth Ann ; Fritzsche, Tom

Child characteristics, family factors, and preschool factors are all found to affect the rate of bilingual children's vocabulary development in heritage language (HL). However, what remains unknown is the relative importance of these three sets of factors in HL vocabulary growth. The current study explored the complex issue with 457 Singaporean preschool children who are speaking either Mandarin, Malay, or Tamil as their HL. A series of internal factors (e.g., non-verbal intelligence) and external factors (e.g., maternal educational level) were used to predict children's HL vocabulary growth over a year at preschool with linear mixed effects models. The results demonstrated that external factors (i.e., family and preschool factors) are relatively more important than child characteristics in enhancing bilingual children's HL vocabulary growth. Specifically, children's language input quantity (i.e., home language dominance), input quality (e.g., number of books in HL), and HL input quantity at school (i.e., the time between two waves of tests at preschool) predict the participants' HL vocabulary growth, with initial vocabulary controlled. The relative importance of external factors in bilingual children's HL vocabulary development is attributed to the general bilingual setting in Singapore, where HL is taken as a subject to learn at preschool and children have fairly limited exposure to HL in general. The limited amount of input might not suffice to trigger the full expression of internal resources. Our findings suggest the crucial roles that caregivers and preschools play in early HL education, and the necessity of more parental involvement in early HL learning in particular.

The effect of decay and lexical uncertainty on processing long-distance dependencies in reading (2020)

Stone, Kate ; von der Malsburg, Titus Raban ; Vasishth, Shravan

To make sense of a sentence, a reader must keep track of dependent relationships between words, such as between a verb and its particle (e.g. turn the music down). In languages such as German, verb-particle dependencies often span long distances, with the particle only appearing at the end of the clause. This means that it may be necessary to process a large amount of intervening sentence material before the full verb of the sentence is known. To facilitate processing, previous studies have shown that readers can preactivate the lexical information of neighbouring upcoming words, but less is known about whether such preactivation can be sustained over longer distances. We asked the question, do readers preactivate lexical information about long-distance verb particles? In one self-paced reading and one eye tracking experiment, we delayed the appearance of an obligatory verb particle that varied only in the predictability of its lexical identity. We additionally manipulated the length of the delay in order to test two contrasting accounts of dependency processing: that increased distance between dependent elements may sharpen expectation of the distant word and facilitate its processing (an antilocality effect), or that it may slow processing via temporal activation decay (a locality effect). We isolated decay by delaying the particle with a neutral noun modifier containing no information about the identity of the upcoming particle, and no known sources of interference or working memory load. Under the assumption that readers would preactivate the lexical representations of plausible verb particles, we hypothesised that a smaller number of plausible particles would lead to stronger preactivation of each particle, and thus higher predictability of the target. This in turn should have made predictable target particles more resistant to the effects of decay than less predictable target particles. The eye tracking experiment provided evidence that higher predictability did facilitate reading times, but found evidence against any effect of decay or its interaction with predictability. The self-paced reading study provided evidence against any effect of predictability or temporal decay, or their interaction. In sum, we provide evidence from eye movements that readers preactivate long-distance lexical content and that adding neutral sentence information does not induce detectable decay of this activation. The findings are consistent with accounts suggesting that delaying dependency resolution may only affect processing if the intervening information either confirms expectations or adds to working memory load, and that temporal activation decay alone may not be a major predictor of processing time.

The interaction of grammatically distinct agreement dependencies in predictive processing (2021)

Stone, Kate ; Verissimo, Joao ; Schad, Daniel J. ; Oltrogge, Elise ; Vasishth, Shravan ; Lago, Sol

Previous research has found that comprehenders sometimes predict information that is grammatically unlicensed by sentence constraints. An open question is why such grammatically unlicensed predictions occur. We examined the possibility that unlicensed predictions arise in situations of information conflict, for instance when comprehenders try to predict upcoming words while simultaneously building dependencies with previously encountered elements in memory. German possessive pronouns are a good testing ground for this hypothesis because they encode two grammatically distinct agreement dependencies: a retrospective one between the possessive and its previously mentioned referent, and a prospective one between the possessive and its following nominal head. In two visual world eye-tracking experiments, we estimated the onset of predictive effects in participants' fixations. The results showed that the retrospective dependency affected resolution of the prospective dependency by shifting the onset of predictive effects. We attribute this effect to an interaction between predictive and memory retrieval processes.

Does entropy modulate the prediction of German long-distance verb particles? (2022)

Stone, Kate ; Vasishth, Shravan ; von der Malsburg, Titus Raban

In this paper we examine the effect of uncertainty on readers' predictions about meaning. In particular, we were interested in how uncertainty might influence the likelihood of committing to a specific sentence meaning. We conducted two event-related potential (ERP) experiments using particle verbs such as turn down and manipulated uncertainty by constraining the context such that readers could be either highly certain about the identity of a distant verb particle, such as turn the bed [...] down, or less certain due to competing particles, such as turn the music [...] up/down. The study was conducted in German, where verb particles appear clause-finally and may be separated from the verb by a large amount of material. We hypothesised that this separation would encourage readers to predict the particle, and that high certainty would make prediction of a specific particle more likely than lower certainty. If a specific particle was predicted, this would reflect a strong commitment to sentence meaning that should incur a higher processing cost if the prediction is wrong. If a specific particle was less likely to be predicted, commitment should be weaker and the processing cost of a wrong prediction lower. If true, this could suggest that uncertainty discourages predictions via an unacceptable cost-benefit ratio. However, given the clear predictions made by the literature, it was surprisingly unclear whether the uncertainty manipulation affected the two ERP components studied, the N400 and the PNP. Bayes factor analyses showed that evidence for our a priori hypothesised effect sizes was inconclusive, although there was decisive evidence against a priori hypothesised effect sizes larger than 1 mu Vfor the N400 and larger than 3 mu V for the PNP. We attribute the inconclusive finding to the properties of verb-particle dependencies that differ from the verb-noun dependencies in which the N400 and PNP are often studied.

Does entropy modulate the prediction of German long-distance verb particles? (2022)

Stone, Kate ; Vasishth, Shravan ; Malsburg, Titus von der

In this paper we examine the effect of uncertainty on readers’ predictions about meaning. In particular, we were interested in how uncertainty might influence the likelihood of committing to a specific sentence meaning. We conducted two event-related potential (ERP) experiments using particle verbs such as turn down and manipulated uncertainty by constraining the context such that readers could be either highly certain about the identity of a distant verb particle, such as turn the bed […] down, or less certain due to competing particles, such as turn the music […] up/down. The study was conducted in German, where verb particles appear clause-finally and may be separated from the verb by a large amount of material. We hypothesised that this separation would encourage readers to predict the particle, and that high certainty would make prediction of a specific particle more likely than lower certainty. If a specific particle was predicted, this would reflect a strong commitment to sentence meaning that should incur a higher processing cost if the prediction is wrong. If a specific particle was less likely to be predicted, commitment should be weaker and the processing cost of a wrong prediction lower. If true, this could suggest that uncertainty discourages predictions via an unacceptable cost-benefit ratio. However, given the clear predictions made by the literature, it was surprisingly unclear whether the uncertainty manipulation affected the two ERP components studied, the N400 and the PNP. Bayes factor analyses showed that evidence for our a priori hypothesised effect sizes was inconclusive, although there was decisive evidence against a priori hypothesised effect sizes larger than 1μV for the N400 and larger than 3μV for the PNP. We attribute the inconclusive finding to the properties of verb-particle dependencies that differ from the verb-noun dependencies in which the N400 and PNP are often studied.

Understanding the effects of constraint and predictability in ERP (2022)

Stone, Kate ; Nicenboim, Bruno ; Vasishth, Shravan ; Rösler, Frank

Intuitively, strongly constraining contexts should lead to stronger probabilistic representations of sentences in memory. Encountering unexpected words could therefore be expected to trigger costlier shifts in these representations than expected words. However, psycholinguistic measures commonly used to study probabilistic processing, such as the N400 event-related potential (ERP) component, are sensitive to word predictability but not to contextual constraint. Some research suggests that constraint-related processing cost may be measurable via an ERP positivity following the N400, known as the anterior post-N400 positivity (PNP). The PNP is argued to reflect update of a sentence representation and to be distinct from the posterior P600, which reflects conflict detection and reanalysis. However, constraint-related PNP findings are inconsistent. We sought to conceptually replicate Federmeier et al. (2007) and Kuperberg et al. (2020), who observed that the PNP, but not the N400 or the P600, was affected by constraint at unexpected but plausible words. Using a pre-registered design and statistical approach maximising power, we demonstrated a dissociated effect of predictability and constraint: strong evidence for predictability but not constraint in the N400 window, and strong evidence for constraint but not predictability in the later window. However, the constraint effect was consistent with a P600 and not a PNP, suggesting increased conflict between a strong representation and unexpected input rather than greater update of the representation. We conclude that either a simple strong/weak constraint design is not always sufficient to elicit the PNP, or that previous PNP constraint findings could be an artifact of smaller sample size.

Understanding the effects of constraint and predictability in ERP (2023)

Stone, Kate ; Nicenboim, Bruno ; Vasishth, Shravan ; Rösler, Frank

Intuitively, strongly constraining contexts should lead to stronger probabilistic representations of sentences in memory. Encountering unexpected words could therefore be expected to trigger costlier shifts in these representations than expected words. However, psycholinguistic measures commonly used to study probabilistic processing, such as the N400 event-related potential (ERP) component, are sensitive to word predictability but not to contextual constraint. Some research suggests that constraint-related processing cost may be measurable via an ERP positivity following the N400, known as the anterior post-N400 positivity (PNP). The PNP is argued to reflect update of a sentence representation and to be distinct from the posterior P600, which reflects conflict detection and reanalysis. However, constraint-related PNP findings are inconsistent. We sought to conceptually replicate Federmeier et al. (2007) and Kuperberg et al. (2020), who observed that the PNP, but not the N400 or the P600, was affected by constraint at unexpected but plausible words. Using a pre-registered design and statistical approach maximising power, we demonstrated a dissociated effect of predictability and constraint: strong evidence for predictability but not constraint in the N400 window, and strong evidence for constraint but not predictability in the later window. However, the constraint effect was consistent with a P600 and not a PNP, suggesting increased conflict between a strong representation and unexpected input rather than greater update of the representation. We conclude that either a simple strong/weak constraint design is not always sufficient to elicit the PNP, or that previous PNP constraint findings could be an artifact of smaller sample size.

Decoding music perception and imagination using deep-learning techniques (2018)

Stober, Sebastian ; Sternin, Avital

Deep learning is a sub-field of machine learning that has recently gained substantial popularity in various domains such as computer vision, automatic speech recognition, natural language processing, and bioinformatics. Deep-learning techniques are able to learn complex feature representations from raw signals and thus also have potential to improve signal processing in the context of brain-computer interfaces (BCIs). However, they typically require large amounts of data for training - much more than what can often be provided with reasonable effort when working with brain activity recordings of any kind. In order to still leverage the power of deep-learning techniques with limited available data, special care needs to be taken when designing the BCI task, defining the structure of the deep model, and choosing the training method. This chapter presents example approaches for the specific scenario of music-based brain-computer interaction through electroencephalography - in the hope that these will prove to be valuable in different settings as well. We explain important decisions for the design of the BCI task and their impact on the models and training techniques that can be used. Furthermore, we present and compare various pre-training techniques that aim to improve the signal-to-noise ratio. Finally, we discuss approaches to interpret the trained models.

Successive cyclicity as residual wh-scope marking (2006)

Stepanov, Arthur ; Stateva, Penka

Recursion in natural language syntax: a minimalist perspective (2005)

Stepanov, Arthur

Robert Frank: Phrase structure composition and syntactic dependencies / rezensiert von Arthur Stepanov (2006)

Stepanov, Arthur

Rezensiertes Werk: Frank, Robert: Phrase Structure Composition and Syntactic Dependencies. - Cambridge, MA : MIT Press, 2002. - xiv + 326 S.

The Development of Syntactic and Pragmatic Aspects of Language in Children with Developmental Disorders (2019)

Stegenwallner-Schütz, Maja Henny Katherine

Connective-Lex (2019)

Stede, Manfred ; Scheffler, Tatjana ; Mendes, Amalia

In this paper, we present a tangible outcome of the TextLink network: a joint online database project displaying and linking existing and newly-created lexicons of discourse connectives in multiple languages. We discuss the definition and demarcation of the class of connectives that should be included in such a resource, and present the syntactic, semantic/pragmatic, and lexicographic information we collected. Further, the technical implementation of the database and the search functionality are presented. We discuss how the multilingual integration of several connective lexicons provides added value for linguistic researchers and other users interested in connectives, by allowing crosslinguistic comparison and a direct linking between discourse relational devices in different languages. Finally, we provide pointers for possible future extensions both in breadth (i.e., by adding lexicons for additional languages) and depth (by extending the information provided for each connective item and by strengthening the crosslinguistic links).

The role of illocutionary status in the usage conditions of causal connectives and in coherence relations (2012)

Stede, Manfred ; Peldszus, Andreas

The meaning of linguistic connectives has often been characterized in terms of their position in a bipartite (semantic, pragmatic) or a tripartite (content, epistemic, speech act) structure of domains, depending on what kinds of entities are being connected (largely: propositions or speech acts). This paper argues that a more fine-grained analysis can be achieved by directing some more attention to the characterization of the entities being related. We propose an inventory of categories of illocutionary status for labelling the spans that are being connected. On this basis, the distinction between the content and the epistemic domain, in particular, can be made more explicit. Focusing on the group of causal connectives in German, we conducted a corpus annotation study from which we derived distinct pragmatic 'usage profiles' of the most frequent causal connectives. Finally, we offer some suggestions on the role of illocutions in relation-based accounts of discourse structure.

Identifying the content zones of German court decisions (2009)

Stede, Manfred ; Kuhn, Florian

Inter-operability and reusability the science of annotation (2012)

Stede, Manfred ; Huang, Chu-Ren

Annotating linguistic data has become a major field of interest, both for supplying the necessary data for machine learning approaches to NLP applications, and as a research issue in its own right. This comprises issues of technical formats, tools, and methodologies of annotation. We provide a brief overview of these notions and then introduce the papers assembled in this special issue.

Salience in discurse : multidisciplinary approaches to discourse 2005 (2005)

Stede, Manfred ; Chiarcos, Christian ; Grabski, Michael ; Lagerwerf, Luuk

Automatic argumentation mining and the role of stance and sentiment (2020)

Stede, Manfred

Argumentation mining is a subfield of Computational Linguistics that aims (primarily) at automatically finding arguments and their structural components in natural language text. We provide a short introduction to this field, intended for an audience with a limited computational background. After explaining the subtasks involved in this problem of deriving the structure of arguments, we describe two other applications that are popular in computational linguistics: sentiment analysis and stance detection. From the linguistic viewpoint, they concern the semantics of evaluation in language. In the final part of the paper, we briefly examine the roles that these two tasks play in argumentation mining, both in current practice, and in possible future systems.

Does discourse processing need discourse topics? (2004)

Stede, Manfred

DiMLex: a lexical approach to discourse markers (2002)

Stede, Manfred

Polibox: Generating desciptions, comparisons, and recommendations from a database (2002)

Stede, Manfred

From connectives to coherence relations (2020)

Stede, Manfred

The notion of coherence relations is quite widely accepted in general, but concrete proposals differ considerably on the questions of how they should be motivated, which relations are to be assumed, and how they should be defined. This paper takes a "bottom-up" perspective by assessing the contribution made by linguistic signals (connectives), using insights from the relevant literature as well as verification by practical text annotation. We work primarily with the German language here and focus on the realm of contrast. Thus, we suggest a new inventory of contrastive connective functions and discuss their relationship to contrastive coherence relations that have been proposed in earlier work.

Disambiguating rhetorical structure (2008)

Stede, Manfred

Empirical studies of text coherence often use tree-like structures in the spirit of Rhetorical Structure Theory (RST) as representational device. This paper identifies several sources of ambiguity in RST-inspired trees and argues that such structures are therefore not as explanatory as a text representation should be. As an alternative, an approach toward multi-level annotation (MLA) of texts is proposed, which separates the information into distinct levels of representation, in particular: referential structure, thematic structure, conjunctive relations, and intentional structure. Levels are conceptually built upon each other, and human annotators can produce them using a dedicated software environment. We argue that the resulting multi-level corpora are descriptively more adequate, and as a resource are more useful than RST-style treebanks.

RST revisited : disentangling nuclearity (2008)

Stede, Manfred

Connective-based local coherence analysis : a lexicon for recognizing causal relationships (2008)

Stede, Manfred

Plato on nature (φύσις) and convention (συνθήκη) (2019)

Staudacher, Peter

New frontiers beyond context-freeness (1993)

Staudacher, Peter

LeMo - an expert-system for the assessment of lexical and morphological impairments in aphasia (1995)

Stadie, Nicole ; Tabatabaie, Sia ; De Bleser, Ria ; Cholewa, Jürgen

Evaluation of lexically and nonlexically based reading treatment in a deep dyslexic (2006)

Stadie, Nicole ; Rilling, Eva

The aim of the single case study was to evaluate two different treatment procedures to improve reading skills with a German-speaking deep dyslexic. Generally, in treatment studies for deep dyslexia, retraining of grapheme-phoneme correspondences is described, but hardly any treatment focuses on reactivating residual functions of the semantic- lexical route. This strategy was explored here with an experimentally presented priming paradigm, to implicitly strengthen residual skills of lexical access with semantically/phonologically related primes (lexically based treatment). In contrast, grapheme-phoneme associations and blending were explicitly relearned during a nonlexically based treatment. Stimuli were controlled for part of speech, word length, and frequency. A cross-over design to identify item- and treatment-specific effects for both procedures was applied. Results indicate positive outcomes with respect to treatment-specific effects for both procedures, generalization to untrained items, and a transfer task after the nonlexically based procedure. All effects remained stable in the follow-up assessment. Implications for theoretically/ empirically generated expectations about treatment outcomes are discussed

Oral and written naming in a multilingual aphasic patient (1995)

Springer, Lydia ; Burk, F. ; De Bleser, Ria ; Stadie, Nicole

Does it have to be trees? : Data-driven dependency parsing with incomplete and noisy training data (2011)

Spreyer, Kathrin

We present a novel approach to training data-driven dependency parsers on incomplete annotations. Our parsers are simple modifications of two well-known dependency parsers, the transition-based Malt parser and the graph-based MST parser. While previous work on parsing with incomplete data has typically couched the task in frameworks of unsupervised or semi-supervised machine learning, we essentially treat it as a supervised problem. In particular, we propose what we call agnostic parsers which hide all fragmentation in the training data from their supervised components. We present experimental results with training data that was obtained by means of annotation projection. Annotation projection is a resource-lean technique which allows us to transfer annotations from one language to another within a parallel corpus. However, the output tends to be noisy and incomplete due to cross-lingual non-parallelism and error-prone word alignments. This makes the projected annotations a suitable test bed for our fragment parsers. Our results show that (i) dependency parsers trained on large amounts of projected annotations achieve higher accuracy than the direct projections, and that (ii) our agnostic fragment parsers perform roughly on a par with the original parsers which are trained only on strictly filtered, complete trees. Finally, (iii) when our fragment parsers are trained on artificially fragmented but otherwise gold standard dependencies, the performance loss is moderate even with up to 50% of all edges removed.

Not only the apples (2014)

Spalek, Katharina ; Gotzner, Nicole ; Wartenburger, Isabell

Focus sensitive particles highlight the relevance of contextual alternatives for the interpretation of a sentence. Two experiments tested whether this leads to better encoding and therefore, ultimately, better recall of focus alternatives. Participants were presented with auditory stimuli that introduced a set of elements ("context sentence") and continued in three different versions: the critical sentences either contained the exclusive particle nur ("only"), the inclusive particle sogar ("even"), or no particle (control condition). After being exposed to blocks of ten trials, participants were asked to recall the elements in the context sentence. The results show that both particles enhanced memory performance for the alternatives to the focused element, relative to the control condition. The results support the assumption that information-structural alternatives are better encoded in memory in the presence of a focus sensitive particle.

Global organization in Spanish onsets (2020)

Sotiropoulou, Stavroula ; Gibson, Mark ; Gafos, Adamantios I.

This paper addresses the relation between syllable structure and inter-segmental temporal coordination. The data examined are Electromagnetic Articulometry recordings from six speakers of Central Peninsular Spanish (henceforth, Spanish), producing words beginning with the clusters /pl, bl, kl, gl, p(sic), k(sic), t(sic)/ as well as corresponding unclustered sonorant-initial words in three vowel contexts /a, e, o/. In our results, we find evidence for a global organization of the segments involved in these combinations. This is reflected in a number of ways: shortening of the prevocalic sonorant in the cluster-initial case compared to the unclustered case, reorganization of the relative timing of the internal CV subsequence (in a CCV) in the obstruent-lateral context, early vowel initiation, and a strong compensatory relation between the duration of the obstruent-to-lateral transition and the duration of the lateral. In other words, we find that the global organization presiding over the segments partaking in these tautosyllabic CCVs is pleiotropic, that is, simultaneously expressed over a set of different phonetic parameters rather than via a privileged metric such as c-center stability or any other such given single measure (employed in prior works).

Phonetic indices of syllabic organization in German stop-lateral clusters (2022)

Sotiropoulou, Stavroula ; Gafos, Adamantios I.

Using articulatory data from five German speakers, we study how segmental sequences under different syllabic organizations respond to perturbations of phonetic parameters in the segments that compose them. Target words contained stop-lateral sequences /bl, gl, kl, pl/ in word-initial and cross-word contexts and were embedded in carrier phrases with different prosodic boundaries, i.e., no phrase boundary versus an utterance phrase boundary preceded the target word in the case of word-initial clusters, or separated the consonants in the case of cross-word sequences. For word-initial cluster (CCV) onsets, we find that increasing C1 stop duration or the lag between two consonants leads to earlier vowel initiation and reduced local timing stability across CV, CCV. Furthermore, as the inter-consonantal lag increases, C2 duration decreases. In contrast, for cross-word C#CV sequences, increasing inter-consonantal lag does not lead to earlier vowel initiation and robust local timing stability is maintained across CV, C#CV. In other words, in CCV sequences within words, local perturbations to segments have effects that ripple through the rest of the sequence. Instead, in cross-word C#CV sequences, local perturbations stay local. Overall, the findings indicate that the effects of phonetic perturbations on coordination patterns depend on the syllabic organization superimposed on these clusters.

Pleiotropy of phonetic indices in the expression of syllabic organization (2019)

Sotiropoulou, Stavroula

This dissertation is concerned with the relation between qualitative phonological organization in the form of syllabic structure and continuous phonetics, that is, the spatial and temporal dimensions of vocal tract action that express syllabic structure. The main claim of the dissertation is twofold. First, we argue that syllabic organization exerts multiple effects on the spatio-temporal properties of the segments that partake in that organization. That is, there is no unique or privileged exponent of syllabic organization. Rather, syllabic organization is expressed in a pleiotropy of phonetic indices. Second, we claim that a better understanding of the relation between qualitative phonological organization and continuous phonetics is reached when one considers how the string of segments (over which the nature of the phonological organization is assessed) responds to perturbations (scaling of phonetic variables) of localized properties (such as durations) within that string. Specifically, variation in phonetic variables and more specifically prosodic variation is a crucial key to understanding the nature of the link between (phonological) syllabic organization and the phonetic spatio-temporal manifestation of that organization. The effects of prosodic variation on segmental properties and on the overlap between the segments, we argue, offer the right pathway to discover patterns related to syllabic organization. In our approach, to uncover evidence for global organization, the sequence of segments partaking in that organization as well as properties of these segments or their relations with one another must be somehow locally varied. The consequences of such variation on the rest of the sequence can then be used to unveil the span of organization. When local perturbations to segments or relations between adjacent segments have effects that ripple through the rest of the sequence, this is evidence that organization is global. If instead local perturbations stay local with no consequences for the rest of the whole, this indicates that organization is local.

A principled approach to feature selection in models of sentence processing (2020)

Smith, Garrett ; Vasishth, Shravan

Among theories of human language comprehension, cue-based memory retrieval has proven to be a useful framework for understanding when and how processing difficulty arises in the resolution of long-distance dependencies. Most previous work in this area has assumed that very general retrieval cues like [+subject] or [+singular] do the work of identifying (and sometimes misidentifying) a retrieval target in order to establish a dependency between words. However, recent work suggests that general, handpicked retrieval cues like these may not be enough to explain illusions of plausibility (Cunnings & Sturt, 2018), which can arise in sentences like The letter next to the porcelain plate shattered. Capturing such retrieval interference effects requires lexically specific features and retrieval cues, but handpicking the features is hard to do in a principled way and greatly increases modeler degrees of freedom. To remedy this, we use well-established word embedding methods for creating distributed lexical feature representations that encode information relevant for retrieval using distributed retrieval cue vectors. We show that the similarity between the feature and cue vectors (a measure of plausibility) predicts total reading times in Cunnings and Sturt's eye-tracking data. The features can easily be plugged into existing parsing models (including cue-based retrieval and self-organized parsing), putting very different models on more equal footing and facilitating future quantitative comparisons.

Discontinuous noun phrases in Yucatec Maya (2022)

Skopeteas, Stavros ; Verhoeven, Elisabeth ; Fanselow, Gisbert

Languages differ in whether or not they allow discontinuous noun phrases. If they do, they further vary in the ways the nominal projections interact with the available syntactic operations. Yucatec Maya has two left-peripheral configurations that differ syntactically: a preverbal position for foci or wh-elements that is filled in by movement, and the possibility to adjoin topics at the highest clausal layer. These two structural options are reflected in different ways of the formation of discontinuous patterns. Subextraction from nominal projections to the focus position yielding discontinuous NPs is possible, but subject to several restrictions. It observes conditions on extraction domains, and does not apply to the left branch of nominal structures. The topic position also appears to license discontinuity, typically involving a non-referential nominal expression as the topic and quantifiers/adjectives that form an elliptical nominal projection within the clause proper. Such constructions can involve several morphological and syntactic mismatches between their parts that are excluded for continuous noun phrases, and they are not sensitive to syntactic island restrictions. Thus, in a strict sense, discontinuities involving the topic position are only apparent, because the construction involves two independent nominal projections that are semantically linked.

The interaction between topicalization and structural constraints : evidence from Yucatec Maya (2009)

Skopeteas, Stavros ; Verhoeven, Elisabeth

This article deals with the syntactic and pragmatic properties of left dislocated constituents in Yucatec Maya. It has been argued that these constituents are topics, which implies that a particular structural configuration, namely left dislocation displays a 1:1 correspondence to a particular discourse function. We present evidence that the discourse properties of left dislocation are not uniform: only a subset of the left dislocated constituents qualify as topics in the strict sense, while other instances of left dislocation are better explained if we assume a structural constraint that bans the postverbal occurrence of subject constituents in a particular syntactic configuration. Our empirical findings show that though the occurrence of word order possibilities in discourse is not random, it is not necessarily determined by a unique licensing condition.

Word order and intonation in Georgian (2009)

Skopeteas, Stavros ; Féry, Caroline ; Asatiani, Rusudan

Georgian is famous for its word order flexibility: all permutations of constituent order are possible and the choice among them is primarily determined by information structure. In this paper, we show that word order is not the only means to encode information structure in this language, but it is used in combination with sentence prosody. After a preliminary description of the use of prosodic phrasing and intonation for this purpose, we address the question of the interrelation between these two strategies. Based on experimental evidence, we investigate the interaction of focus with word order and prosody, and we conclude that some aspects of word order variation are pragmatically vacuous and can be accommodated in any context if they are realized with an appropriate prosodic structure, while other word order phenomena are quite restrictive and cannot be overridden through prosodic manipulations.

Questionnaire on information structure (OUIS): reference manual (2006)

Skopeteas, Stavros ; Fiedler, Ines ; Hellmuth, Sam ; Schwarz, Anne ; Stoel, Ruben ; Fanselow, Gisbert ; Féry, Caroline ; Krifka, Manfred

Contents: Chapter 1. Introduction 1 Information Structure 2 Grammatical Correlates of Information Structure 3 Structure of the Questionnaire 4 Experimental Tasks 5 Technicalities 6 Archiving 7 Acknowledgments Chapter 2. General Questions 1 General Information 2 Phonology 3 Morphology and Syntax Chapter 3. Experimental tasks 1 Changes (Given/New in Intransitives and Transitives) 2 Giving (Given/New in Ditransitives) 3 Visibility (Given/New, Animacy and Type/Token Reference) 4 Locations (Given/New in Locative Expressions) 5 Sequences (Given/New/Contrast in Transitives) 6 Dynamic Localization (Given/New in Dynamic Loc. Descriptions) 7 Birthday Party (Weight and Discourse Status) 8 Static Localization (Macro-Planning and Given/New in Locatives) 9 Guiding (Presentational Utterances) 10 Event Cards (All New) 11 Anima (Focus types and Animacy) 12 Contrast (Contrast in pairing events) 13 Animal Game (Broad/Narrow Focus in NP) 14 Properties (Focus on Property and Possessor) 15 Eventives (Thetic and Categorical Utterances) 16 Tell a Story (Contrast in Text) 17 Focus Cards (Selective, Restrictive, Additive, Rejective Focus) 18 Who does What (Answers to Multiple Constituent Questions) 19 Fairy Tale (Topic and Focus in Coherent Discourse) 20 Map Task (Contrastive and Selective Focus in Spontaneous Dialogue) 21 Drama (Contrastive Focus in Argumentation) 22 Events in Places (Spatial, Temporal and Complex Topics) 23 Path Descriptions (Topic Change in Narrative) 24 Groups (Partial Topic) 25 Connections (Bridging Topic) 26 Indirect (Implicational Topic) 27 Surprises (Subject-Topic Interrelation) 28 Doing (Action Given, Action Topic) 29 Influences (Question Priming) Chapter 4. Translation tasks 1 Basic Intonational Properties 2 Focus Translation 3 Topic Translation 4 Quantifiers Chapter 5. Information structure summary survey 1 Preliminaries 2 Syntax 3 Morphology 4 Prosody 5 Summary: Information structure Chapter 6. Performance of Experimental Tasks in the Field 1 Field sessions 2 Field Session Metadata 3 Informants’ Agreement

Focus in Georgian and the expression of contrast (2010)

Skopeteas, Stavros ; Fanselow, Gisbert

This paper examines the impact of contrastive focus in Georgian syntax. In a semi-naturalistic production study, we elicited spontaneous answers to questions which have shown that contexts involving contrastive focus induce placement of the focused constituent at the immediately preverbal position more frequently than other contexts. Based on this observation we investigate the properties of Georgian grammar which may account for the different impact of contrastive vs. non-contrastive contexts on word order. We first examine the involved syntactic structures and present evidence that preverbal focus is a result of movement to the specifier position of a functional projection whose head attracts the finite verb. We then address the question whether there is evidence for an association between contrast and movement to this position and we provide evidence that the correlation between context and order in the behavioral data does not result from a biunique form-function association of the kind 'contrast <-> move-movement to the specifier position', but from an asymmetry at a discourse level such that contexts involving contrast induce answers in which focused constituents occupy the stressed position in the clause more often than contexts that do not.

Effects of givenness and constraints on free word order (2010)

Skopeteas, Stavros ; Fanselow, Gisbert

Word oder and information structure: empirical methods for linguistic fieldwork (2009)

Skopeteas, Stavros

The role of givenness, presupposition, and prosody in Czech word order: An experimental study (2015)

Simik, Radek ; Wierzba, Marta

The PRO-wh connection in modal existential wh-constructions an argument in favor of semantic control (2013)

Simik, Radek

Recent discussion of obligatory control in the literature mostly concentrates on the issue of which syntactic module (movement, agreement, etc.) is responsible for the establishment of the control relation. This paper looks at the issue of control from a higher order perspective. Abandoning the presupposition that control constituents denote propositions and that, therefore, control must be syntactic, I deliver an argument in favor of the property-type analysis of control constituents and, by transitivity, for a semantic resolution of the control relation. The argument comes from modal existential wh-constructions and in particular from a strong parallelism between obligatorily controlled PRO and wh-expressions. It is revealed that PRO and wh-words form a natural class, to the exclusion of all other types of nominal expressions. This is then turned into an argument of treating PRO (and wh-words) essentially as a logical lambda-operator, naturally leading to the property theory of control. In addition, the article contributes to our understanding of the syntax, semantics, and typology of modal existential wh-constructions. It is argued that at least one type of these constructions, what I call "control MECs", is embedded (minimally) by a complex predicate BE+FOR which expresses the state of availability (BE) which makes it possible for someone to profit (FOR) from the event characterized by the modal existential wh-construction.

Morphological constraints in children's spoken language comprehension - a visual world study of plurals inside compounds in English (2013)

Silva, Renita ; Gerth, Sabrina ; Clahsen, Harald

Many previous studies have shown that the human language processor is capable of rapidly integrating information from different sources during reading or listening. Yet, little is known about how this ability develops from child to adulthood. To gain insight into how children (in comparison to adults) handle different kinds of linguistic information during on-line language comprehension, the current study investigates a well-known morphological phenomenon that is subject to both structural and semantic constraints, the plurals-in-compounds effect, i.e. the dislike of plural (specifically regular plural) modifiers inside compounds (e.g. rats eater). We examined 96 seven-to-twelve-year-old children and a control group of 32 adults measuring their eye-gaze changes in response to compound-internal plural and singular forms. Our results indicate that children rely more upon structural properties of language (in the present case, morphological cues) early in development and that the ability to efficiently integrate information from multiple sources takes time for children to reach adult-like levels.

The deviant lexicon-approaching lexical processing in Williams syndrome (2003)

Siegmüller, Julia ; Bartels, Henrik

Sentiment analysis of German Twitter (2019)

Sidarenka, Uladzimir

The immense popularity of online communication services in the last decade has not only upended our lives (with news spreading like wildfire on the Web, presidents announcing their decisions on Twitter, and the outcome of political elections being determined on Facebook) but also dramatically increased the amount of data exchanged on these platforms. Therefore, if we wish to understand the needs of modern society better and want to protect it from new threats, we urgently need more robust, higher-quality natural language processing (NLP) applications that can recognize such necessities and menaces automatically, by analyzing uncensored texts. Unfortunately, most NLP programs today have been created for standard language, as we know it from newspapers, or, in the best case, adapted to the specifics of English social media. This thesis reduces the existing deficit by entering the new frontier of German online communication and addressing one of its most prolific forms—users’ conversations on Twitter. In particular, it explores the ways and means by how people express their opinions on this service, examines current approaches to automatic mining of these feelings, and proposes novel methods, which outperform state-of-the-art techniques. For this purpose, I introduce a new corpus of German tweets that have been manually annotated with sentiments, their targets and holders, as well as lexical polarity items and their contextual modifiers. Using these data, I explore four major areas of sentiment research: (i) generation of sentiment lexicons, (ii) fine-grained opinion mining, (iii) message-level polarity classification, and (iv) discourse-aware sentiment analysis. In the first task, I compare three popular groups of lexicon generation methods: dictionary-, corpus-, and word-embedding–based ones, finding that dictionary-based systems generally yield better polarity lists than the last two groups. Apart from this, I propose a linear projection algorithm, whose results surpass many existing automatically-generated lexicons. Afterwords, in the second task, I examine two common approaches to automatic prediction of sentiment spans, their sources, and targets: conditional random fields (CRFs) and recurrent neural networks, obtaining higher scores with the former model and improving these results even further by redefining the structure of CRF graphs. When dealing with message-level polarity classification, I juxtapose three major sentiment paradigms: lexicon-, machine-learning–, and deep-learning–based systems, and try to unite the first and last of these method groups by introducing a bidirectional neural network with lexicon-based attention. Finally, in order to make the new classifier aware of microblogs' discourse structure, I let it separately analyze the elementary discourse units of each tweet and infer the overall polarity of a message from the scores of its EDUs with the help of two new approaches: latent-marginalized CRFs and Recursive Dirichlet Process.

Formal analysis of èto-clefts in Russian: syntax and semantics (2024)

Shipova, Evgeniya

Èto-clefts are Russian focus constructions with the demonstrative pronoun èto ‘this’ at the beginning: “Èto Mark vyigral gonku” (“It was Mark who won the race”). They are often being compared with English it-clefts, German es-clefts, as well as the corresponding focus-background structures in other languages. In terms of semantics, èto-clefts have two important properties which are cross-linguistically typical for clefts: existence presupposition (“Someone won the race”) and exhaustivity (“Nobody except Mark won the race”). However, the exhaustivity effects are not as strong as exhaustivity effects in structures with the exclusive only and require more research. At the same time, the question if the syntactic structure of èto-clefts matches the biclausal structure of English and German clefts, remains open. There are arguments in favor of biclausality, as well as monoclausality. Besides, there is no consistency regarding the status of èto itself. Finally, the information structure of èto-clefts has remained underexplored in the existing literature. This research investigates the information-structural, syntactic, and semantic properties of Russian clefts, both theoretically (supported by examples from Russian text corpora and judgments from native speakers) and experimentally. It is determined which desired changes in the information structure motivate native speakers to choose an èto-cleft and not the canonical structure or other focus realization tools. Novel syntactic tests are conducted to find evidence for bi-/monoclausality of èto-clefts, as well as for base-generation or movement of the cleft pivot. It is hypothesized that èto has a certain important function in clefts, and its status is investigated. Finally, new experiments on the nature of exhaustivity in èto-clefts are conducted. They allow for direct cross-linguistic comparison, using an incremental-information paradigm with truth-value judgments. In terms of information structure, this research makes a new proposal that presents èto-clefts as structures with an inherent focus-background bipartitioning. Even though èto-clefts are used in typical focus contexts, evidence was found that èto-clefts (as well as Russian thetic clefts) allow for both new information focus and contrastive focus. Èto-clefts are pragmatically acceptable when a singleton answer to the implied question is expected (e.g. “It was Mark who won the race” but not “It was Mark who came to the party”). Importantly, èto in Russian clefts is neither dummy, nor redundant, but is a topic expression; conveys familiarity which triggers existence presupposition; refers to an instantiated event, or a known/perceivable situation; finally, èto plays an important role in the spoken language as a tool for speech coherency and a focus marker. In terms of syntax, this research makes a new monoclausal proposal and shows evidence that the cleft pivot undergoes movement to the left peripheral position. Èto is proposed to be TopP. Finally, in terms of semantics, a novel cross-linguistic evaluation of Russian clefts is made. Experiments show that the exhaustivity inference in èto-clefts is not robust. Participants used different strategies in resolving exhaustivity, falling into 2 groups: one group considered èto-clefts exhaustive, while another group considered them non-exhaustive. Hence, there is evidence for the pragmatic nature of exhaustivity in èto-clefts. The experimental results for èto-clefts are similar to the experimental results for clefts in German, French and Akan. It is concluded that speakers use different tools available in their languages to produce structures with similar interpretive properties.

Stochastic Time Models of Syllable Structure (2015)

Shaw, Jason A. ; Gafos, Adamantios I.

Drawing on phonology research within the generative linguistics tradition, stochastic methods, and notions from complex systems, we develop a modelling paradigm linking phonological structure, expressed in terms of syllables, to speech movement data acquired with 3D electromagnetic articulography and X-ray microbeam methods. The essential variable in the models is syllable structure. When mapped to discrete coordination topologies, syllabic organization imposes systematic patterns of variability on the temporal dynamics of speech articulation. We simulated these dynamics under different syllabic parses and evaluated simulations against experimental data from Arabic and English, two languages claimed to parse similar strings of segments into different syllabic structures. Model simulations replicated several key experimental results, including the fallibility of past phonetic heuristics for syllable structure, and exposed the range of conditions under which such heuristics remain valid. More importantly, the modelling approach consistently diagnosed syllable structure proving resilient to multiple sources of variability in experimental data including measurement variability, speaker variability, and contextual variability. Prospects for extensions of our modelling paradigm to acoustic data are also discussed.

The use of focus markers in second language word processing (2009)

Sennema-Skowronek, Anke

There are many factors which make speaking and understanding a second language (L2) a highly complex challenge. Skills and competencies in in both linguistic and metalinguistic areas emerge as parts of a multi-faceted, flexible concept underlying bilingual/multilingual communication. On the linguistic level, a combination of an extended knowledge of idiomatic expressions, a broad lexical familiarity, a large vocabulary size, and the ability to deal with phonetic distinctions and fine phonetic detail has been argued necessary for effective nonnative comprehension of spoken language. The scientific interest in these factors has also led to more interest in the L2’s information structure, the way in which information is organised and packaged into informational units, both within and between clauses. On a practical level, the information structure of a language can offer the means to assign focus to a certain element considered important. Speakers can draw from a rich pool of linguistic means to express this focus, and listeners can in turn interpret these to guide them to the highlighted information which in turn facilitates comprehension, resulting in an appropriate understanding of what has been said. If a speaker doesn’t follow the principles of information structure, and the main accent in a sentence is placed on an unimportant word, then there may be inappropriate information transfer within the discourse, and misunderstandings. The concept of focus as part of the information structure of a language, the linguistic means used to express it, and the differential use of focus in native and nonnative language processing are central to this dissertation. Languages exhibit a wide range of ways of directing focus, including by prosodic means, by syntactic constructions, and by lexical means. The general principles underlying information structure seem to contrast structurally across different languages, and they can also differ in the way they express focus. In the context of L2 acquisition, characteristics of the L1 linguistic system are argued to influence the acquisition of the L2. Similarly, the conceptual patterns of information structure of the L1 may influence the organization of information in the L2. However, strategies and patterns used to exploit information structure for succesful language comprehension in the native L1, may not apply at all, or work in different ways or todifferent degrees in the L2. This means that L2 learners ideally have to understand the way that information structure is expressed in the L2 to fully use the information structural benefit in the L2. The knowledge of information structural requirements in the L2 could also imply that the learner would have to make adjustments regarding the use of information structural devices in the L2. The general question is whether the various means to mark focus in the learners’ native language are also accessible in the nonnative language, and whether a L1-L2 transfer of their usage should be considered desirable. The current work explores how information structure helps the listener to discover and structure the forms and meanings of the L2. The central hypothesis is that the ability to access information structure has an impact on the level of the learners’ appropriateness and linguistic competence in the L2. Ultimately, the ability to make use of information structure in the L2 is believed to underpin the L2 learners’ ability to effectively communicate in the L2. The present study investigated how use of focus markers affects processing speed and word recall recall in a native-nonnative language comparison. The predominant research question was whether the type of focus marking leads to more efficient and accurate word processing in marked structures than in unmarked structures, and whether differences in processing patterns can be observed between the two language conditions. Three perception studies were conducted, each concentrating on one of the following linguistic parameters: 1. Prosodic prominence: Does prosodic focus conveyed by sentence accent and by word position facilitate word recognition? 2. Syntactical means: Do cleft constructions result in faster and more accurate word processing? 3. Lexical means: Does focus conveyed by the particles even/only (German: sogar/nur) facilitate word processing and word recall? Experiments 2 and 3 additionally investigated the contribution of context in the form of preceding questions. Furthermore, they considered accent and its facilitative effect on the processing of words which are in the scope of syntactic or lexical focus marking. All three experiments tested German learners of English in a native German language condition and in English as their L2. Native English speakers were included as a control for the English language condition. Test materials consisted of single sentences, all dealing with bird life. Experiment 1 tested word recognition in three focus conditions (broad focus, narrow focus on the target, and narrow focus on a constituent than the target) in one condition using natural unmanipulated sentences, and in the other two conditions using spliced sentences. Experiment 2 (effect of syntactic focus marking) and Experiment 3 (effect of lexical focus marking) used phoneme monitoring as a measure for the speed of word processing. Additionally, a word recall test (4AFC) was conducted to assess the effective entry of target-bearing words in the listeners’ memory. Experiment 1: Focus marking by prosodic means Prosodic focus marking by pitch accent was found to highlight important information (Bolinger, 1972), making the accented word perceptually more prominent (Klatt, 1976; van Santen & Olive, 1990; Eefting, 1991; Koopmans-van Beinum & van Bergem, 1989). However, accent structure seems to be processed faster in native than in nonnative listening (Akker& Cutler, 2003, Expt. 3). Therefore, it is expected that prosodically marked words are better recognised than unmarked words, and that listeners can exploit accent structure better for accurate word recognition in their L1 than they do in the L2 (L1 > L2). Altogether, a difference in word recognition performance in L1 listening is expected between different focus conditions (narrow focus > broad focus). Results of Experiments 1 show that words were better recognized in native listening than in nonnative listening. Focal accent, however, doesn’t seem to help the German subjects recognize accented words more accurately, in both the L1 and the L2. This could be due to the focus conditions not being acoustically distinctive enough. Results of experiments with spliced materials suggest that the surrounding prosodic sentence contour made listeners remember a target word and not the local, prosodic realization of the word. Prosody seems to indeed direct listeners’ attention to the focus of the sentence (see Cutler, 1976). Regarding the salience of word position, VanPatten (2002; 2004) postulated a sentence location principle for L2 processing, stating a ranking of initial > final > medial word position. Other evidence mentions a processing adantage of items occurring late in the sentence (Akker & Cutler, 2003), and Rast (2003) observed in an English L2 production study a trend of an advantage of items occurring at the outer ends of the sentence. The current Experiment 1 aimed to keep the length of the sentences to an acceptable length, mainly to keep the task in the nonnative lnaguage condition feasable. Word length showed an effect only in combination with word position (Rast, 2003; Rast & Dommergues, 2003). Therefore, word length was included in the current experiment as a secondary factor and without hypotheses. Results of Experiment 1 revealed that the length of a word doesn’t seem to be important for its accurate recognition. Word position, specifically the final position, clearly seems to facilitate accurate word recognition in German. A similar trend emerges in condition English L2, confirming Klein (1984) and Slobin (1985). Results don’t support the sentence location principle of VanPatten (2002; 2004). The salience of the final position is interpreted as recency effect (Murdock, 1962). In addition, the advantage of the final position may benefit from the discourse convention that relevant background information is referred to first, and then what is novel later (Haviland & Clark, 1974). This structure is assumed to cue the listener as to what the speaker considers to be important information, and listeners might have reacted according to this convention. Experiment 2: Focus marking by syntactic means Atypical syntactic structures often draw listeners’ attention to certain information in an utterance, and the cleft structure as a focus marking device appears to be a common surface feature in many languages (Lambrecht, 2001). Surface structure influences sentence processing (Foss & Lynch, 1969; Langford & Holmes, 1979), which leads to competing hypotheses in Experiment 2: on the one hand, the focusing effect of the cleft construction might reduce processing times. On the other, cleft constructions in German were found to be used less to mark fo than in English (Ahlemeyer & Kohlhof, 1999; Doherty, 1999; E. Klein, 1988). The complexity of the constructions, and the experience from the native language might work against an advantage of the focus effect in the L2. Results of Experiment 2 show that the cleft structure is an effective device to mark focus in German L1. The processing advantage is explained by the low degree of structural markedness of cleft structures: listeners use the focus function of sentence types headed by the dummy subject es (English: it) due to reliance on 'safe' subject-prominent SVO-structures. The benefit of cleft is enhanced when the sentences are presented with context, suggesting a substantial benefit when focus effects of syntactic surface structure and coherence relation between sentences are integrated. Clefts facilitate word processing for English native speakers. Contrary to German L1, the marked cleft construction doesn’t reduce processing times in English L2. The L1-L2 difference was interpreted as a learner problem of applying specific linguistic structures according to the principles of information structure in the target language. Focus marking by cleft did not help German learners in native or in nonnative word recall. This could be attributed to the phonological similarity of the multiple choice options (Conrad & Hull, 1964), and to a long time span between listening and recall (Birch & Garnsey, 1995; McKoon et al., 1993). Experiment 3: Focus marking by lexical means Focus particles are elements of structure that can indicate focus (König, 1991), and their function is to emphasize a certain part of the sentence (Paterson et al., 1999). I argue that the focus particles even/only (German: sogar/nur) evoke contrast sets of alternatives resp. complements to the element in focus (Ni et al., 1996), which causes interpretations of context. Therefore, lexical focus marking isn’t expected to lead to faster word processing. However, since different mechanisms of encoding seem to underlie word memory, a benefit of the focusing function of particles is expected to show in the recall task: due to focus particles being a preferred and well-used feature for native speakers of German, a transfer of this habitualness is expected, resulting in a better recall of focused words. Results indicated that focus particles seem to be the weakest option to mark focus: Focus marking by lexical particle don’t seem to reduce word processing times in either German L1, English L2, or in English L1. The presence of focus particles is likely to instantiate a complex discourse model which lets the listener await further modifying information (Liversedge et al., 2002). This semantic complexity might slow down processing. There are no indications that focus particles facilitate native language word recall in German L1 and English L1. This could be because focus particles open sets of conditions and contexts that enlarge the set of representations in listeners rather than narrowing it down to the element in the scope of the focus particle. In word recall, the facilitative effect of focus particles emerges only in the nonnative language condition. It is suggested that L2 learners, when faced with more demanding tasks in an L2, use a broad variety of means that identify focus for a better representation of novel words in the memory. In Experiments 2 and 3, evidence suggests that accent is an important factor for efficient word processing and accurate recall in German L1 and English L1, but less so in English L2. This underlines the function of accent as core speech parameter and consistent cue to the perception of prominence native language use (see Cutler & Fodor, 1979; Pitt & Samuel, 1990a; Eriksson et al., 2002; Akker & Cutler, 2003); the L1-L2 difference is attributed to patterns of expectation that are employed in the L1 but not (yet?) in the L2. There seems to exist a fine-tuned sensitivity to how accents are distributed in the native language, listeners expect an appropriate distribution and interpret it accordingly (Eefting, 1991). This pleads for accent placement as extremely important to L2 proficiency; the current results also suggest that accent and its relationship with other speech parameters has to be newly established in the L2 to fully reveal its benefits for efficient processing of speech. There is evidence that additional context facilitates processing of complex syntactic structures but that a surplus of information has no effect if the sentence construction is less challenging for the listener. The increased amount of information to be processed seems to impede better word recall, particularly in the L2. Altogether, it seems that focus marking devices and context can combine to form an advantageous alliance: a substantial benefit in processing efficiency is found when parameters of focus marking and sentence coherence are integrated. L2 research advocates the beneficial aspects of providing context for efficient L2 word learning (Lawson & Hogben, 1996). The current thesis promotes the view that a context which offers more semantic, prosodic, or lexical connections might compensate for the additional processing load that context constitutes for the listeners. A methodological consideration concerns the order in which language conditions are presented to listeners, i.e., L1-L2 or L2-L1. Findings suggest that presentation order could enforce a learning bias, with the performance in the second experiment being influenced by knowledge acquired in the first (see Akker & Cutler, 2003). To conclude this work: The results of the present study suggest that information structure is more accessible in the native language than it is in the nonnative language. There is, however, some evidence that L2 learners have an understanding of the significance of some information-structural parameters of focus marking. This has a beneficial effect on processing efficiency and recall accuracy; on the cognitive side it illustrates the benefits and also the need of a dynamic exchange of information-structural organization between L1 and L2. The findings of the current thesis encourage the view that an understanding of information structure can help the learner to discover and categorise forms and meanings of the L2. Information structure thus emerges as a valuable resource to advance proficiency in a second language.

Focus accent, word lenght and position as cues to L1 and L2 word recognition (2005)

Sennema, Anke ; van de Vijver, Ruben ; Carroll, Susanne E. ; Zimmer-Stahl, Anne

For Charles Goodwin, Chuck (2018)

Selting, Margret

This appreciation will not be a testimonial to Chuck’s numerous publications and research achievements – I am sure that others will have a lot to say about those. Instead, I will say something about how I personally experienced and think of him, as a researcher personality, based on the limited time and the few occasions that we have had together.

The display and management of affectivity in climaxes of amusing stories (2017)

Selting, Margret

Visual attention and quantifier-spreading in heritage Russian bilinguals (2015)

Sekerina, Irina A. ; Sauermann, Antje

It is well established in language acquisition research that monolingual children and adult second language learners misinterpret sentences with the universal quantifier every and make quantifier-spreading errors that are attributed to a preference for a match in number between two sets of objects. The present Visual World eye-tracking study tested bilingual heritage Russian-English adults and investigated how they interpret of sentences like Every alligator lies in a bathtub in both languages. Participants performed a sentence-picture verification task while their eye movements were recorded. Pictures showed three pairs of alligators in bathtubs and two extra objects: elephants (Control condition), bathtubs (Overexhaustive condition), or alligators (Underexhaustive condition). Monolingual adults performed at ceiling in all conditions. Heritage language (HL) adults made 20% q-spreading errors, but only in the Overexhaustive condition, and when they made an error they spent more time looking at the two extra bathtubs during the Verb region. We attribute q-spreading in HL speakers to cognitive overload caused by the necessity to integrate conflicting sources of information, i.e. the spoken sentences in their weaker, heritage, language and attention-demanding visual context, that differed with respect to referential salience.

Refine

Has Fulltext

Author

Year of publication

Document Type

Language

Is part of the Bibliography

Keywords

Institute

934 search hits