TY  - BOOK
A1  - Olsen, Susan
A1  - Stiebels, Barbara
A1  - Bierwisch, Manfred
A1  - Zimmermann, Ilse
A1  - Cavar, Damir
A1  - Georgi, Doreen
A1  - Bacskai-Atkari, Julia
A1  - Alexiadou, Artemis
A1  - Błaszczak, Joanna
A1  - Müller, Gereon
A1  - Šimík, Radek
A1  - Meinunger, André
A1  - Thiersch, Craig
A1  - Arnhold, Anja
A1  - Féry, Caroline
A1  - Bayer, Josef
A1  - Titov, Elena
A1  - Fominyam, Henry
A1  - Tran, Thuan
A1  - Bornkessel-Schlesewsky, Ina D.
A1  - Schlesewsky, Matthias
A1  - Zimmermann, Malte
A1  - Häussler, Jana
A1  - Mucha, Anne
A1  - Schmidt, Andreas
A1  - Weskott, Thomas
A1  - Wierzba, Marta
A1  - Stede, Manfred
A1  - Skopeteas, Stavros
A1  - Gafos, Adamantios I.
A1  - Haider, Hubert
A1  - Wunderlich, Dieter
A1  - Staudacher, Peter
A1  - Rauh, Gisa
ED  - Brown, Jessica M. M.
ED  - Schmidt, Andreas
ED  - Wierzba, Marta
T1  - Of Trees and Birds
BT  - A Festschrift for Gisbert Fanselow
N2  - Gisbert Fanselow’s work has been invaluable and inspiring to many ­researchers working on syntax, morphology, and information ­structure, both from a ­theoretical and from an experimental perspective. This ­volume comprises a collection of articles dedicated to Gisbert on the occasion of his 60th birthday, covering a range of topics from these areas and beyond. The contributions have in ­common that in a broad sense they have to do with language structures (and thus trees), and that in a more specific sense they have to do with birds. They thus cover two of Gisbert’s major interests in- and outside of the linguistic world (and ­perhaps even at the interface).
KW  - Festschrift
KW  - Linguistik
KW  - Syntax
KW  - Morphologie
KW  - Informationsstruktur
KW  - festschrift
KW  - linguistics
KW  - syntax
KW  - morphology
KW  - information structure
Y1  - 2019
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-426542
SN  - 978-3-86956-457-9
PB  - Universitätsverlag Potsdam
CY  - Potsdam
ER  - 
TY  - JOUR
A1  - Schäfer, Robin
A1  - Stede, Manfred
T1  - Argument mining on twitter
BT  - a survey
JF  - Information technology : it ; Methoden und innovative Anwendungen der Informatik und Informationstechnik ; Organ der Fachbereiche 3 und 4 der GI e.V. und des Fachbereichs 6 der ITG
N2  - In the last decade, the field of argument mining has grown notably. However, only relatively few studies have investigated argumentation in social media and specifically on Twitter. Here, we provide the, to our knowledge, first critical in-depth survey of the state of the art in tweet-based argument mining. We discuss approaches to modelling the structure of arguments in the context of tweet corpus annotation, and we review current progress in the task of detecting argument components and their relations in tweets. We also survey the intersection of argument mining and stance detection, before we conclude with an outlook.
KW  - Argument Mining
KW  - Twitter
KW  - Stance Detection
Y1  - 2021
U6  - https://doi.org/10.1515/itit-2020-0053
SN  - 1611-2776
SN  - 2196-7032
VL  - 63
IS  - 1
SP  - 45
EP  - 58
PB  - De Gruyter
CY  - Berlin
ER  - 
TY  - JOUR
A1  - Stede, Manfred
T1  - Automatic argumentation mining and the role of stance and sentiment
JF  - Journal of argumentation in context
N2  - Argumentation mining is a subfield of Computational Linguistics that aims (primarily) at automatically finding arguments and their structural components in natural language text. We provide a short introduction to this field, intended for an audience with a limited computational background. After explaining the subtasks involved in this problem of deriving the structure of arguments, we describe two other applications that are popular in computational linguistics: sentiment analysis and stance detection. From the linguistic viewpoint, they concern the semantics of evaluation in language. In the final part of the paper, we briefly examine the roles that these two tasks play in argumentation mining, both in current practice, and in possible future systems.
KW  - argumentation structure
KW  - argumentation mining
KW  - sentiment analysis
KW  - stance detection
Y1  - 2020
U6  - https://doi.org/10.1075/jaic.00006.ste
SN  - 2211-4742
SN  - 2211-4750
VL  - 9
IS  - 1
SP  - 19
EP  - 41
PB  - John Benjamins Publishing Co.
CY  - Amsterdam
ER  - 
TY  - BOOK
A1  - Stede, Manfred
T1  - Korpusgestützte Textanalyse : Grundzüge der Ebenen-orientierten Textlinguistik
Y1  - 2007
SN  - 978-3-8233-6301-9
SN  - 0941-8105
PB  - Narr
CY  - Tübingen
ER  - 
TY  - JOUR
A1  - Grabski, Michael
A1  - Stede, Manfred
T1  - Bei : intraclausal coherence relations illustrated with a German preposition
N2  - Coherence relations are typically taken to link two clauses or larger units and to be signaled at the text surface by conjunctions and certain adverbials. Relations, however, also can hold within clauses, indicated by prepositions like despite, due to, or in case of, when these have an internal argument denoting an eventuality. Although these prepositions act as reliable cues to indicate a specific relation, others are lexically more neutral. We investigated this situation for the German preposition bei, which turns out to be highly ambiguous. We demonstrate the range of readings in a corpus study, proposing 6 more specific prepositions as a comprehensive substitution set. All these uses of bei share a common kernel meaning, which is missed by the standard accounts that assume lexical polysemy. We examine the range of coherence relations that can be signaled by bei and provide some factors here supporting the disambiguation task in a framework of discourse interpretation
Y1  - 2006
UR  - http://www.informaworld.com/0163-853X
U6  - https://doi.org/10.1207/s15326950dp4102_5
SN  - 0163-853X
ER  - 
TY  - JOUR
A1  - Stede, Manfred
T1  - Does discourse processing need discourse topics?
Y1  - 2004
SN  - 0301-4428
ER  - 
TY  - JOUR
A1  - Stede, Manfred
T1  - DiMLex: a lexical approach to discourse markers
Y1  - 2002
ER  - 
TY  - JOUR
A1  - Stede, Manfred
T1  - Polibox: Generating desciptions, comparisons, and recommendations from a database
Y1  - 2002
SN  - 1-55860- 899-0
ER  - 
TY  - JOUR
A1  - Dipper, Stefanie
A1  - Götze, Michael
A1  - Stede, Manfred
A1  - Wegst, Tillmann
T1  - ANNIS
BT  - a linguistic database for exploring information structure
JF  - Interdisciplinary studies on information structure : ISIS ; working papers of the SFB 632
N2  - In this paper, we discuss the design and implementation of our first version of the database "ANNIS" ("ANNotation of Information Structure"). For research based on empirical data, ANNIS provides a uniform environment for storing this data together with its linguistic annotations. A central database promotes standardized annotation, which facilitates interpretation and comparison of the data. ANNIS is used through a standard web browser and offers tier-based visualization of data and annotations, as well as search facilities that allow for cross-level and cross-sentential queries. The paper motivates the design of the system, characterizes its user interface, and provides an initial technical evaluation of ANNIS with respect to data size and query processing.
Y1  - 2004
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus-8432
SN  - 1866-4725
SN  - 1614-4708
IS  - 1
SP  - 245
EP  - 279
ER  - 
TY  - GEN
A1  - Afantenos, Stergos
A1  - Peldszus, Andreas
A1  - Stede, Manfred
T1  - Comparing decoding mechanisms for parsing argumentative structures
T2  - Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe
N2  - Parsing of argumentative structures has become a very active line of research in recent years. Like discourse parsing or any other natural language task that requires prediction of linguistic structures, most approaches choose to learn a local model and then perform global decoding over the local probability distributions, often imposing constraints that are specific to the task at hand. Specifically for argumentation parsing, two decoding approaches have been recently proposed: Minimum Spanning Trees (MST) and Integer Linear Programming (ILP), following similar trends in discourse parsing. In contrast to discourse parsing though, where trees are not always used as underlying annotation schemes, argumentation structures so far have always been represented with trees. Using the 'argumentative microtext corpus' [in: Argumentation and Reasoned Action: Proceedings of the 1st European Conference on Argumentation, Lisbon 2015 / Vol. 2, College Publications, London, 2016, pp. 801-815] as underlying data and replicating three different decoding mechanisms, in this paper we propose a novel ILP decoder and an extension to our earlier MST work, and then thoroughly compare the approaches. The result is that our new decoder outperforms related work in important respects, and that in general, ILP and MST yield very similar performance.
T3  - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 1062 
KW  - argumentation structure
KW  - argument mining
KW  - parsing
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-470527
SN  - 1866-8372
IS  - 1062
ER  - 
TY  - JOUR
A1  - Afantenos, Stergos
A1  - Peldszus, Andreas
A1  - Stede, Manfred
T1  - Comparing decoding mechanisms for parsing argumentative structures
JF  - Argument & Computation
N2  - Parsing of argumentative structures has become a very active line of research in recent years. Like discourse parsing or any other natural language task that requires prediction of linguistic structures, most approaches choose to learn a local model and then perform global decoding over the local probability distributions, often imposing constraints that are specific to the task at hand. Specifically for argumentation parsing, two decoding approaches have been recently proposed: Minimum Spanning Trees (MST) and Integer Linear Programming (ILP), following similar trends in discourse parsing. In contrast to discourse parsing though, where trees are not always used as underlying annotation schemes, argumentation structures so far have always been represented with trees. Using the ‘argumentative microtext corpus’ [in: Argumentation and Reasoned Action: Proceedings of the 1st European Conference on Argumentation, Lisbon 2015 / Vol. 2, College Publications, London, 2016, pp. 801–815] as underlying data and replicating three different decoding mechanisms, in this paper we propose a novel ILP decoder and an extension to our earlier MST work, and then thoroughly compare the approaches. The result is that our new decoder outperforms related work in important respects, and that in general, ILP and MST yield very similar performance.
KW  - Argumentation structure
KW  - argument mining
KW  - parsing
Y1  - 2018
U6  - https://doi.org/10.3233/AAC-180033
SN  - 1946-2166
SN  - 1946-2174
VL  - 9
IS  - 3
SP  - 177
EP  - 192
PB  - IOS Press
CY  - Amsterdam
ER  - 
TY  - JOUR
A1  - Krüger, K. R.
A1  - Lukowiak, A.
A1  - Sonntag, J.
A1  - Warzecha, Saskia
A1  - Stede, Manfred
T1  - Classifying news versus opinions in newspapers
BT  - linguistic features for domain independence
JF  - Natural language engineering
N2  - Newspaper text can be broadly divided in the classes ‘opinion’ (editorials, commentary, letters to the editor) and ‘neutral’ (reports). We describe a classification system for performing this separation, which uses a set of linguistically motivated features. Working with various English newspaper corpora, we demonstrate that it significantly outperforms bag-of-lemma and PoS-tag models. We conclude that the linguistic features constitute the best method for achieving robustness against change of newspaper or domain.
Y1  - 2017
U6  - https://doi.org/10.1017/S1351324917000043
SN  - 1351-3249
SN  - 1469-8110
VL  - 23
SP  - 687
EP  - 707
PB  - Cambridge Univ. Press
CY  - Cambridge
ER  - 
TY  - GEN
A1  - Saint-Dizier, Patrick
A1  - Stede, Manfred
T1  - Foundations of the language of argumentation
T2  - Argument & computation
Y1  - 2017
U6  - https://doi.org/10.3233/AAC-170018
SN  - 1946-2166
SN  - 1946-2174
VL  - 8
IS  - 2 Special issue
SP  - 91
EP  - 93
PB  - IOS Press
CY  - Amsterdam
ER  - 
TY  - JOUR
A1  - Stede, Manfred
T1  - From connectives to coherence relations
BT  - a case study of German contrastrive connectives
JF  - Revue roumaine de linguistique : RRL = Romanian review of linguistics
N2  - The notion of coherence relations is quite widely accepted in general, but concrete proposals differ considerably on the questions of how they should be motivated, which relations are to be assumed, and how they should be defined. This paper takes a "bottom-up" perspective by assessing the contribution made by linguistic signals (connectives), using insights from the relevant literature as well as verification by practical text annotation. We work primarily with the German language here and focus on the realm of contrast. Thus, we suggest a new inventory of contrastive connective functions and discuss their relationship to contrastive coherence relations that have been proposed in earlier work.
KW  - coherence relation
KW  - connective
KW  - contrast
KW  - concession
KW  - corpus analysis
Y1  - 2020
SN  - 0035-3957
VL  - 65
IS  - 3
SP  - 213
EP  - 233
PB  - Ed. Academiei Române
CY  - Bucureşti
ER  - 
TY  - BOOK
A1  - Stede, Manfred
A1  - Chiarcos, Christian
A1  - Grabski, Michael
A1  - Lagerwerf, Luuk
T1  - Salience in discurse : multidisciplinary approaches to discourse 2005
T3  - Uitgaven Stichting Neerlandistiek VU
Y1  - 2005
SN  - 3-89323-749-6
VL  - 49
PB  - Nodus-Publ; Stichting Neerlandistiek VU
CY  - Münster; Amsterdam
ER  - 
TY  - JOUR
A1  - Stede, Manfred
T1  - Disambiguating rhetorical structure
N2  - Empirical studies of text coherence often use tree-like structures in the spirit of Rhetorical Structure Theory (RST) as representational device. This paper identifies several sources of ambiguity in RST-inspired trees and argues that such structures are therefore not as explanatory as a text representation should be. As an alternative, an approach toward multi-level annotation (MLA) of texts is proposed, which separates the information into distinct levels of representation, in particular: referential structure, thematic structure, conjunctive relations, and intentional structure. Levels are conceptually built upon each other, and human annotators can produce them using a dedicated software environment. We argue that the resulting multi-level corpora are descriptively more adequate, and as a resource are more useful than RST-style treebanks.
Y1  - 2008
UR  - http://www.springerlink.com/content/111138
U6  - https://doi.org/10.1007/s11168-008-9053-7
SN  - 1570-7075
ER  - 
TY  - JOUR
A1  - Stede, Manfred
T1  - RST revisited : disentangling nuclearity
Y1  - 2008
SN  - 978-90-272-3109-3
ER  - 
TY  - JOUR
A1  - Stede, Manfred
T1  - Connective-based local coherence analysis : a lexicon for recognizing causal relationships
Y1  - 2008
SN  - 978-1-904-98793-2
ER  - 
TY  - JOUR
A1  - Chiarcos, Christian
A1  - Dipper, Stefanie
A1  - Götze, Michael
A1  - Leser, Ulf
A1  - Lüdeling, Anke
A1  - Ritz, Julia
A1  - Stede, Manfred
T1  - A flexible framework for integrating annotations from different tools and tag sets
N2  - We present a general framework for integrating annotations from different tools and tag sets. When annotating corpora at multiple linguistic levels, annotators may use different expert tools for different phenomena or types of annotation. These tools employ different data models and accompanying approaches to visualization, and they produce different output formats. For the purposes of uniformly processing these outputs, we developed a pivot format called PAULA, along with converters to and from tool formats. Different annotations are not only integrated at the level of data format, but are also joined on the level of conceptual representation. For this purpose, we introduce OLiA, an ontology of linguistic annotations that mediates between alternative tag sets that cover the same class of linguistic phenomena. All components are integrated in the linguistic information system ANNIS : Annotation tool output is converted to the pivot format PAULA and read into a database where the data can be visualized, queried, and evaluated across multiple layers. For cross-tag set querying and statistical evaluation, ANNIS uses the ontology of linguistic annotations. Finally, ANNIS is also tied to a machine learning component for semiautomatic annotation.
Y1  - 2008
UR  - http://www.atala.org/A-Flexible-Framework-for
SN  - 1248-9433
ER  - 
TY  - JOUR
A1  - Stede, Manfred
A1  - Kuhn, Florian
T1  - Identifying the content zones of German court decisions
Y1  - 2009
SN  - 978-3-642- 03423-7
ER  - 
TY  - JOUR
A1  - Stede, Manfred
T1  - Computerlinguistik und Textanalyse
Y1  - 2008
UR  - http://www.narr.de/bib/17432/9783823374329.pdf
SN  - 978-3-8233- 6432-0
ER  - 
TY  - JOUR
A1  - Chiarcos, Christian
A1  - Ritz, Julia
A1  - Stede, Manfred
T1  - By all these lovely tokens... Merging conflicting tokenizations
JF  - Language resources and evaluation
N2  - Given the contemporary trend to modular NLP architectures and multiple annotation frameworks, the existence of concurrent tokenizations of the same text represents a pervasive problem in everyday's NLP practice and poses a non-trivial theoretical problem to the integration of linguistic annotations and their interpretability in general. This paper describes a solution for integrating different tokenizations using a standoff XML format, and discusses the consequences from a corpus-linguistic perspective.
KW  - Linguistic annotation
KW  - Multi-layer annotation
KW  - Conflicting tokenizations
KW  - Tokenization alignment
KW  - Corpus linguistics
Y1  - 2012
U6  - https://doi.org/10.1007/s10579-011-9161-0
SN  - 1574-020X
VL  - 46
IS  - 1
SP  - 53
EP  - 74
PB  - Springer
CY  - Dordrecht
ER  - 
TY  - JOUR
A1  - Stede, Manfred
A1  - Huang, Chu-Ren
T1  - Inter-operability and reusability the science of annotation
JF  - Language resources and evaluation
N2  - Annotating linguistic data has become a major field of interest, both for supplying the necessary data for machine learning approaches to NLP applications, and as a research issue in its own right. This comprises issues of technical formats, tools, and methodologies of annotation. We provide a brief overview of these notions and then introduce the papers assembled in this special issue.
KW  - Linguistic annotation
KW  - Annotation tools
KW  - Inter-operability
Y1  - 2012
U6  - https://doi.org/10.1007/s10579-011-9164-x
SN  - 1574-020X
VL  - 46
IS  - 1
SP  - 91
EP  - 94
PB  - Springer
CY  - Dordrecht
ER  - 
TY  - JOUR
A1  - Taboada, Maite
A1  - Brooke, Julian
A1  - Tofiloski, Milan
A1  - Voll, Kimberly
A1  - Stede, Manfred
T1  - Lexicon-Based methods for sentiment analysis
JF  - Computational linguistics
N2  - We present a lexicon-based approach to extracting sentiment from text. The Semantic Orientation CALculator (SO-CAL) uses dictionaries of words annotated with their semantic orientation (polarity and strength), and incorporates intensification and negation. SO-CAL is applied to the polarity classification task, the process of assigning a positive or negative label to a text that captures the text's opinion towards its main subject matter. We show that SO-CAL's performance is consistent across domains and on completely unseen data. Additionally, we describe the process of dictionary creation, and our use of Mechanical Turk to check dictionaries for consistency and reliability.
Y1  - 2011
SN  - 0891-2017
VL  - 37
IS  - 2
SP  - 267
EP  - 307
PB  - MIT Press
CY  - Cambridge
ER  - 
TY  - JOUR
A1  - Stede, Manfred
A1  - Peldszus, Andreas
T1  - The role of illocutionary status in the usage conditions of causal connectives and in coherence relations
JF  - Journal of pragmatics : an interdisciplinary journal of language studies
N2  - The meaning of linguistic connectives has often been characterized in terms of their position in a bipartite (semantic, pragmatic) or a tripartite (content, epistemic, speech act) structure of domains, depending on what kinds of entities are being connected (largely: propositions or speech acts). This paper argues that a more fine-grained analysis can be achieved by directing some more attention to the characterization of the entities being related. We propose an inventory of categories of illocutionary status for labelling the spans that are being connected. On this basis, the distinction between the content and the epistemic domain, in particular, can be made more explicit. Focusing on the group of causal connectives in German, we conducted a corpus annotation study from which we derived distinct pragmatic 'usage profiles' of the most frequent causal connectives. Finally, we offer some suggestions on the role of illocutions in relation-based accounts of discourse structure.
KW  - Connective
KW  - Coherence relation
KW  - Speech act
KW  - Illocutionary force
Y1  - 2012
U6  - https://doi.org/10.1016/j.pragma.2012.01.004
SN  - 0378-2166
VL  - 44
IS  - 2
SP  - 214
EP  - 229
PB  - Elsevier
CY  - Amsterdam
ER  - 
TY  - BOOK
A1  - Stede, Manfred
A1  - Mamprin, Sara
A1  - Peldszus, Andreas
A1  - Herzog, André
A1  - Kaupat, David
A1  - Chiarcos, Christian
A1  - Warzecha, Saskia
ED  - Stede, Manfred
T1  - Handbuch Textannotation
T1  - Handbook text annotation
BT  - Potsdamer Kommentarkorpus 2.0
BT  - Potsdam commentary corpus 2.0
N2  - Das Potsdamer Kommentarkorpus ist eine Sammlung von Zeitungstexten, die dem Genre ‘Kommentar' zuzuordnen sind. Der öffentlich verfügbare Teil besteht aus 175 Texten aus der Märkischen Allgemeinen Zeitung, die hinsichtlich Syntax, Koreferenz, Konnektoren und Rhetorische Struktur manuell annotiert wurden. Weitere Ebenen werden bei zukünftigen Korpusversionen hinzukommen. Dieses Buch enthält die Annotationsrichtlinien, die der Bearbeitung des öffentlichen Teils des Korpus zugrunde lagen, sowie auch anderer Teile, bei denen mit weiteren Annotationsebenen experimentiert wurde. Die meisten der Richtlinien werden auch für ähnliche Text-Genres und für andere Sprachen verwendbar sein.
N2  - The Potsdam Commentary Corpus is a collection of newspaper texts belonging to the ‘commentary’ genre. The public part consists of 175 texts from Märkische Allgemeine Zeitung that have been manually annotated for syntax, coreference, connectives, and rhetorical structure. Further layers will be added to future releases of the corpus. This book assembles the annotation guidelines that have been used for that public part, as well as for other portions, where other layers of annotation have been experimented with. Most of the guidelines will be applicable to similar genres, and also to other languages.
T3  - Potsdam Cognitive Science Series - 8 
KW  - linguistische Annotation
KW  - linguistisches Korpus
KW  - Textstruktur
KW  - Zeitungskommentare
KW  - linguistic annotation
KW  - linguistic corpus
KW  - text structure
KW  - newspaper commentary
Y1  - 2015
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-82761
SN  - 978-3-86956-343-5
ER  - 
TY  - JOUR
A1  - Lüdeling, Anke
A1  - Ritz, Julia
A1  - Stede, Manfred
A1  - Zeldes, Amir
T1  - Corpus Linguistics and Information Structure Research
JF  - The Oxford handbook of information structure
Y1  - 2016
SN  - 978-0-19-964267-0
SP  - 599
EP  - 617
PB  - Oxford University Press
CY  - Oxford
ER  - 
TY  - JOUR
A1  - Stede, Manfred
T1  - Noch kindlich oder schon jugendlich? Oder gar erwachsen?
BT  - Betrachtung von Komplexitätsmerkmalen altersspezifischer Texte
JF  - Of trees and birds. A Festschrift for Gisbert Fanselow
KW  - Festschrift
KW  - Informationsstruktur
KW  - Linguistik
KW  - Morphologie
KW  - Syntax
KW  - festschrift
KW  - information structure
KW  - linguistics
KW  - morphology
KW  - syntax
Y1  - 2019
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-432569
SN  - 978-3-86956-457-9
SP  - 323
EP  - 334
PB  - Universitätsverlag Potsdam
CY  - Potsdam
ER  - 
TY  - JOUR
A1  - Wulff, Peter
A1  - Buschhüter, David
A1  - Westphal, Andrea
A1  - Nowak, Anna
A1  - Becker, Lisa
A1  - Robalino, Hugo
A1  - Stede, Manfred
A1  - Borowski, Andreas
T1  - Computer-based classification of preservice physics teachers’ written reflections
JF  - Journal of science education and technology
N2  - Reflecting in written form on one's teaching enactments has been considered a facilitator for teachers' professional growth in university-based preservice teacher education. Writing a structured reflection can be facilitated through external feedback. However, researchers noted that feedback in preservice teacher education often relies on holistic, rather than more content-based, analytic feedback because educators oftentimes lack resources (e.g., time) to provide more analytic feedback. To overcome this impediment to feedback for written reflection, advances in computer technology can be of use. Hence, this study sought to utilize techniques of natural language processing and machine learning to train a computer-based classifier that classifies preservice physics teachers' written reflections on their teaching enactments in a German university teacher education program. To do so, a reflection model was adapted to physics education. It was then tested to what extent the computer-based classifier could accurately classify the elements of the reflection model in segments of preservice physics teachers' written reflections. Multinomial logistic regression using word count as a predictor was found to yield acceptable average human-computer agreement (F1-score on held-out test dataset of 0.56) so that it might fuel further development towards an automated feedback tool that supplements existing holistic feedback for written reflections with data-based, analytic feedback.
KW  - reflection
KW  - teacher professional development
KW  - hatural language
KW  - processing
KW  - machine learning
Y1  - 2020
U6  - https://doi.org/10.1007/s10956-020-09865-1
SN  - 1059-0145
SN  - 1573-1839
VL  - 30
IS  - 1
SP  - 1
EP  - 15
PB  - Springer
CY  - Dordrecht
ER  - 
TY  - JOUR
A1  - Stede, Manfred
A1  - Scheffler, Tatjana
A1  - Mendes, Amalia
T1  - Connective-Lex
BT  - a Web-Based Multilingual Lexical Resource for Connectives
JF  - Discours : revue de linguistique, psycholinguistique et informatique
N2  - In this paper, we present a tangible outcome of the TextLink network: a joint online database project displaying and linking existing and newly-created lexicons of discourse connectives in multiple languages. We discuss the definition and demarcation of the class of connectives that should be included in such a resource, and present the syntactic, semantic/pragmatic, and lexicographic information we collected. Further, the technical implementation of the database and the search functionality are presented. We discuss how the multilingual integration of several connective lexicons provides added value for linguistic researchers and other users interested in connectives, by allowing crosslinguistic comparison and a direct linking between discourse relational devices in different languages. Finally, we provide pointers for possible future extensions both in breadth (i.e., by adding lexicons for additional languages) and depth (by extending the information provided for each connective item and by strengthening the crosslinguistic links).
N2  - Nous présentons dans cet article un résultat tangible du réseau TextLink : un projet conjoint de base de données en ligne, qui montre et relie des lexiques, aussi bien existants que créés récemment, de connecteurs discursifs dans plusieurs langues. Nous commençons par considérer la définition et la délimitation de la classe des connecteurs qui devraient être inclus dans une telle ressource, et nous présentons l’information syntaxique, sémantico-pragmatique et lexicographique que nous avons recueillie. D’autre part, l’implémentation technique de cette base de données et les fonctionnalités de recherche qu’elle permet sont aussi décrites. Nous discutons de quelle manière l’intégration multilingue de plusieurs lexiques de connecteurs apporte une valeur ajoutée aux chercheurs en linguistique et aux autres utilisateurs qui s’intéressent aux connecteurs, en permettant de comparer plusieurs langues et de relier directement les connecteurs dans différentes langues. Pour finir, nous donnons des indications quant à une possible extension future en termes d’ampleur (par exemple, en ajoutant des lexiques pour de nouvelles langues) et de profondeur (en augmentant l’information qui est donnée pour chaque connecteur et en renforçant les liens entre lexiques).
KW  - discourse connectives
KW  - lexicon
KW  - multilingual resources
KW  - crosslinguistic links
Y1  - 2019
U6  - https://doi.org/10.4000/discours.10098
SN  - 1963-1723
IS  - 24
PB  - Université de Paris-Sorbonne
CY  - Paris
ER  - 
TY  - JOUR
A1  - Clausen, Yulia
A1  - Stede, Manfred
T1  - Discourse connectives and their arguments
BT  - an experiment on anaphoricity in German
JF  - Linguistics Vanguard
N2  - Adverbial connectives like therefore, which link a preceding 'external' to an 'internal' argument, can be regarded as anaphoric: The external argument is selected by an interpretation process akin to that of an event anaphor, and intervening material can appear between both arguments. 
We report on a crowdsourcing experiment on the German connectives trotzdem and dennoch that studies factors that lead readers to assume such long-distance arguments: semantic plausibility of intervening material, 'subjective' versus 'objective' content, and the presence of an anaphoric morpheme in the connective. 
We find that the type and content of the intervening material play an important role in argument choice.
KW  - connective
KW  - discourse anaphora
KW  - discourse structure
Y1  - 2022
U6  - https://doi.org/10.1515/lingvan-2021-0102
SN  - 2199-174X
VL  - 8
IS  - 1
SP  - 95
EP  - 111
PB  - De Gruyter
CY  - Berlin
ER  - 
TY  - JOUR
A1  - Aktas, Berfin
A1  - Stede, Manfred
T1  - Anaphoric distance in oral and written language
BT  - Experimental evidence
JF  - Discours : revue de linguistique, psycholinguistique et informatique
N2  - We investigate the variation in oral and written language in terms of anaphoric distance (i.e., the textual distance between anaphors and their antecedents), expanding corpus-based research with experimental evidence. 
Contrastive corpus studies demonstrate that oral genres include longer average anaphoric distance than written genres, if the distance is measured in terms of clauses (Fox, 1987; Aktas & Stede, 2020). 

We designed an experiment in order to examine the contrasts in oral and written mediums, using the same genre. 
We aim to gain more insight about the impact of the medium, in a situation where both mediums convey a similar level of spontaneity, informality and interactivity. We designed a story continuation study, where the participants are recruited via crowdsourcing. 
To our knowledge, this is the first study of its kind, where anaphoric distance is manipulated systematically in a language production experiment in order to examine medium distinctions. 

We observed that participants use more pronouns in oral medium than in written medium if the anaphoric distance is long. 
This result is in line with the implications of the earlier corpus-based research. In addition, our results indicate that anaphoric distance has a larger effect in referential choice for the written medium.
KW  - anaphora
KW  - anaphoric distance
KW  - referential choice
KW  - production medium
KW  - oral
KW  - written
KW  - story continuation
KW  - crowdsourcing
Y1  - 2022
U6  - https://doi.org/10.4000/discours.12383
SN  - 1963-1723
IS  - 31
PB  - Université de Paris-Sorbonne, Maion Recherche
CY  - Paris
ER  -