TY  - THES
A1  - Sidarenka, Uladzimir
T1  - Sentiment analysis of German Twitter
T1  - Sentimentanalyse des deutschen Twitters
N2  - The immense popularity of online communication services in the last decade has not only upended our lives (with news spreading like wildfire on the Web, presidents announcing their decisions on Twitter, and the outcome of political elections being determined on Facebook) but also dramatically increased the amount of data exchanged on these platforms. Therefore, if we wish to understand the needs of modern society better and want to protect it from new threats, we urgently need more robust, higher-quality natural language processing (NLP) applications that can recognize such necessities and menaces automatically, by analyzing uncensored texts.  Unfortunately, most NLP programs today have been created for standard language, as we know it from newspapers, or, in the best case, adapted to the specifics of English social media.

This thesis reduces the existing deficit by entering the new frontier of German online communication and addressing one of its most prolific forms—users’ conversations on Twitter. In particular, it explores the ways and means by how people express their opinions on this service, examines current approaches to automatic mining of these feelings, and proposes novel methods, which outperform state-of-the-art techniques. For this purpose, I introduce a new corpus of German tweets that have been manually annotated with sentiments, their targets and holders, as well as lexical polarity items and their contextual modifiers. Using these data, I explore four major areas of sentiment research: (i) generation of sentiment lexicons, (ii) fine-grained opinion mining, (iii) message-level polarity classification, and (iv) discourse-aware sentiment analysis. In the first task, I compare three popular groups of lexicon generation methods: dictionary-, corpus-, and word-embedding–based ones, finding that dictionary-based systems generally yield better polarity lists than the last two groups. Apart from this, I propose a linear projection algorithm, whose results surpass many existing automatically-generated lexicons. Afterwords, in the second task, I examine two common approaches to automatic prediction of sentiment spans, their sources, and targets: conditional random fields (CRFs) and recurrent neural networks, obtaining higher scores with the former model and improving these results even further by redefining the structure of CRF graphs. When dealing with message-level polarity classification, I juxtapose three major sentiment paradigms: lexicon-, machine-learning–, and deep-learning–based systems, and try to unite the first and last of these method groups by introducing a bidirectional neural network with lexicon-based attention. Finally, in order to make the new classifier aware of microblogs' discourse structure, I let it separately analyze the elementary discourse units of each tweet and infer the overall polarity of a message from the scores of its EDUs with the help of two new approaches: latent-marginalized CRFs and Recursive Dirichlet Process.
N2  - Die enorme Popularität von Online-Kommunikationsdiensten in den letzten Jahrzehnten hat nicht unser Leben massiv geändert (sodass Nachrichten sich wie Fegefeuer übers Internet ausbreiten, Präsidenten ihre Entscheidungen auf Twitter ankündigen, und Ergebnisse politischer Wahlen auf Facebook entschieden werden) sondern auch zu einem dramatischen Anstieg der Datenmenge geführt, die über solche Plattformen ausgetauscht werden.  Deswegen braucht man heutzutage dringend zuverlässige, qualitätvolle NLP-Programme, um neue gesellschaftliche Bedürfnisse und Risiken in unzensierten Nutzernachrichten automatisch erkennen und abschätzen zu können. Leider sind die meisten modernen NLP-Anwendungen entweder auf die Analyse der Standardsprache (wie wir sie aus Zeitungstexten kennen) ausgerichtet oder im besten Fall an die Spezifika englischer Social Media angepasst.

Diese Dissertation reduziert den bestehenden Rückstand, indem sie das "Neuland" der deutschen Online-Kommunikation betritt und sich einer seiner produktivsten Formen zuwendet—den User-Diskussionen auf Twitter.  Diese Arbeit erforscht insbesondere die Art und Weise, wie Leute ihre Meinungen auf diesem Online-Service äußern, analysiert existierende Verfahren zur automatischen Erkennung ihrer Gefühle und schlägt neue Verfahren vor, die viele heutige State-of-the-Art-Systeme übertreffen.

Zu diesem Zweck stelle ich ein neues Korpus deutscher Tweets vor, die manuell von zwei menschlichen Experten mit Sentimenten (polaren Meinungen), ihren Quellen (sources) und Zielen (targets) sowie lexikalischen polaren Termen und deren kontextuellen Modifizierern annotiert wurden.  Mithilfe dieser Daten untersuche ich vier große Teilgebiete der Sentimentanalyse: (i) automatische Generierung von Sentiment-Lexika, (ii) aspekt-basiertes Opinion-Mining, (iii) Klassifizierung der Polarität von ganzen Nachrichten und (iv) diskurs-bewusste Sentimentanalyse.

In der ersten Aufgabe vergleiche ich drei populäre Gruppen von Lexikongenerierungsmethoden: wörterbuch-, corpus- und word-embedding-basierte Verfahren, und komme zu dem Schluss, dass wörterbuch-basierte Ansätze generell bessere Polaritätslexika liefern als die letzten zwei Gruppen.  Abgesehen davon, schlage ich einen neuen Linearprojektionsalgorithmus vor, dessen Resultate deutlich besser als viele automatisch generierte Polaritätslisten sind.

Weiterhin, in der zweiten Aufgabe, untersuche ich zwei gängige Herangehensweisen an die automatische Erkennung der Textspannen von Sentimenten, Sources und Targets: Conditional Random Fields (CRFs) und rekurrente neuronale Netzwerke.  Ich erziele bessere Ergebnisse mit der ersten Methode und verbessere diese Werte noch weiter durch alternative Topologien der CRF-Graphen.

Bei der Analyse der Nachrichtenpolarität stelle ich drei große Sentiment-Paradigmen gegenüber: lexikon-, Machine-Learning–, und Deep-Learning–basierte Systeme, und versuche die erste und die letzte dieser Gruppen in einem Verfahren zu vereinigen, indem ich eine neue neuronale Netzwerkarchitektur vorschlage: bidirektionales rekurrentes Netzwerk mit lexikon-basierter Attention (LBA).

Im letzten Kapitel unternehme ich einen Versuch, die Prädiktion der Gesamtpolarität von Tweets über die Diskursstruktur der Nachrichten zu informieren.  Zu diesem Zweck wende ich den vorgeschlagenen LBA-Klassifikator separat auf jede einzelne elementare Diskurs-Einheit (EDU) eines Microblogs an und induziere die allgemeine semantische Ausrichtung dieser Nachricht mithilfe von zwei neuen Methoden: latenten marginalisierten CRFs und rekursivem Dirichlet-Prozess.
KW  - sentiment analysis
KW  - opinion mining
KW  - social media
KW  - Twitter
KW  - natural language processing
KW  - discourse analysis
KW  - NLP
KW  - computational linguistics
KW  - machine learning
KW  - Sentimentanalyse
KW  - Computerlinguistik
KW  - Meinungsforschung
Y1  - 2019
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-437422
ER  - 
TY  - JOUR
A1  - Schäfer, Robin
A1  - Stede, Manfred
T1  - Argument mining on twitter
BT  - a survey
JF  - Information technology : it ; Methoden und innovative Anwendungen der Informatik und Informationstechnik ; Organ der Fachbereiche 3 und 4 der GI e.V. und des Fachbereichs 6 der ITG
N2  - In the last decade, the field of argument mining has grown notably. However, only relatively few studies have investigated argumentation in social media and specifically on Twitter. Here, we provide the, to our knowledge, first critical in-depth survey of the state of the art in tweet-based argument mining. We discuss approaches to modelling the structure of arguments in the context of tweet corpus annotation, and we review current progress in the task of detecting argument components and their relations in tweets. We also survey the intersection of argument mining and stance detection, before we conclude with an outlook.
KW  - Argument Mining
KW  - Twitter
KW  - Stance Detection
Y1  - 2021
U6  - https://doi.org/10.1515/itit-2020-0053
SN  - 1611-2776
SN  - 2196-7032
VL  - 63
IS  - 1
SP  - 45
EP  - 58
PB  - De Gruyter
CY  - Berlin
ER  - 
TY  - JOUR
A1  - Kapidzic, Sanja
A1  - Frey, Felix
A1  - Neuberger, Christoph
A1  - Stieglitz, Stefan
A1  - Mirbabaie, Milad
T1  - Crisis communication on Twitter
BT  - differences between user types in top tweets about the 2015 “refugee crisis” in Germany
JF  - International journal of communication
N2  - The study explores differences between three user types in the top tweets about the 2015 “refugee crisis” in Germany and presents the results of a quantitative content analysis. All tweets with the keyword “Flüchtlinge” posted for a monthlong period following September 13, 2015, the day Germany decided to implement border controls, were collected (N = 763,752). The top 2,495 tweets according to number of retweets were selected for analysis. Differences between news media, public and private actor tweets in topics, tweet characteristics such as tone and opinion expression, links, and specific sentiments toward refugees were analyzed. We found strong differences between the tweets. Public actor tweets were the main source of positive sentiment toward refugees and the main information source on refugee support. News media tweets mostly reflected traditional journalistic norms of impartiality and objectivity, whereas private actor tweets were more diverse in sentiments toward refugees.
KW  - refugee crisis 2015
KW  - Germany
KW  - social media
KW  - Twitter
KW  - user types
Y1  - 2023
UR  - https://ijoc.org/index.php/ijoc/article/view/18172/4022
SN  - 1932-8036
VL  - 17
SP  - 735
EP  - 754
PB  - The Annenberg Center for Communication
CY  - Los Angeles, Calif.
ER  - 
TY  - JOUR
A1  - Hagemann, Linus
A1  - Abramova, Olga
T1  - Emotions and information diffusion on social media
BT  - a replication in the context of political communication on Twitter
JF  - AIS transactions on replication research
N2  - This paper presents a methodological and conceptual replication of Stieglitz and Dang-Xuan’s (2013) investigation of the role of sentiment in information-sharing behavior on social media. Whereas Stieglitz and Dang-Xuan (2013) focused on Twitter communication prior to the state parliament elections in the German states Baden-Wurttemberg, Rheinland-Pfalz, and Berlin in 2011, we test their theoretical propositions in the context of the state parliament elections in Saxony-Anhalt (Germany) 2021. We confirm the positive link between sentiment in a political Twitter message and its number of retweets in a methodological replication. In a conceptual replication, where sentiment was assessed with the alternative dictionary-based tool LIWC, the sentiment was negatively associated with the retweet volume. In line with the original study, the strength of association between sentiment and retweet time lag insignificantly differs between tweets with negative sentiment and tweets with positive sentiment. We also found that the number of an author’s followers was an essential determinant of sharing behavior. However, two hypotheses supported in the original study did not hold for our sample. Precisely, the total amount of sentiments was insignificantly linked to the time lag to the first retweet. Finally, in our data, we do not observe that the association between the overall sentiment and retweet quantity is stronger for tweets with negative sentiment than for those with positive sentiment.
KW  - Twitter
KW  - information diffusion
KW  - sentiment
KW  - elections
Y1  - 2023
U6  - https://doi.org/10.17705/1atrr.00079
SN  - 2473-3458
VL  - 9
IS  - 1
SP  - 1
EP  - 19
PB  - AIS
CY  - Atlanta
ER  - 
TY  - GEN
A1  - Brendel, Nina
A1  - Matzner, Nils
A1  - Menzel, Max-Peter
T1  - Geographisches Gezwitscher – Analyse von Twitter-Daten als Methode im GW-Unterricht
T2  - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe
N2  - Soziale Medien sind ein wesentlicher Bestandteil des Alltags von Schüler*innen und gleichzeitig zunehmend wichtig in Wirtschaft, Politik und Wissenschaft. Am Beispiel von Twitter zeigt dieser Beitrag, dass soziale Medien im Unterricht auch für die Beantwortung geographischer Fragestellungen verwendet werden können. Hierfür eignen sich Twitter-Daten aufgrund ihrer Georeferenzierung und weiterer interessanter Inhalte besonders. Der Beitrag gibt einen Überblick über die Verwendung von Twitter für sozialwissenschaftliche und humangeographische Fragestellungen und reflektiert die Nutzung von Twitter im Unterricht. Für die Unterrichtspraxis werden Beispiele zu den Themen Braunkohle, Flutereignisse und Raumwahrnehmungen sowie Anleitungen zur Auswertung, Anwendung und Reflexion von Twitter-Analysen vorgestellt.
T3  - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 1247 
KW  - Twitter
KW  - Soziale Medien
KW  - Forschungsmethodik
KW  - Unterrichtsmethoden
Y1  - 2022
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-550614
SN  - 1866-8372
SP  - 72
EP  - 85
PB  - Universitätsverlag Potsdam
CY  - Potsdam
ER  - 
TY  - JOUR
A1  - Brendel, Nina
A1  - Matzner, Nils
A1  - Menzel, Max-Peter
T1  - Geographisches Gezwitscher – Analyse von Twitter-Daten als Methode im GW-Unterricht
JF  - GW-Unterricht
N2  - Soziale Medien sind ein wesentlicher Bestandteil des Alltags von Schüler*innen und gleichzeitig zunehmend wichtig in Wirtschaft, Politik und Wissenschaft. Am Beispiel von Twitter zeigt dieser Beitrag, dass soziale Medien im Unterricht auch für die Beantwortung geographischer Fragestellungen verwendet werden können. Hierfür eignen sich Twitter-Daten aufgrund ihrer Georeferenzierung und weiterer interessanter Inhalte besonders. Der Beitrag gibt einen Überblick über die Verwendung von Twitter für sozialwissenschaftliche und humangeographische Fragestellungen und reflektiert die Nutzung von Twitter im Unterricht. Für die Unterrichtspraxis werden Beispiele zu den Themen Braunkohle, Flutereignisse und Raumwahrnehmungen sowie Anleitungen zur Auswertung, Anwendung und Reflexion von Twitter-Analysen vorgestellt.
KW  - Twitter
KW  - Soziale Medien
KW  - Forschungsmethodik
KW  - Unterrichtsmethoden
Y1  - 2021
U6  - https://doi.org/10.1553/gw-unterricht164s72
SN  - 2414-4169
SP  - 72
EP  - 85
PB  - Verlag der Österreichischen Akademie der Wissenschaften
CY  - Wien
ER  - 
TY  - JOUR
A1  - Abramova, Olga
A1  - Batzel, Katharina
A1  - Modesti, Daniela
T1  - Collective response to the health crisis among German Twitter users
BT  - a structural topic modeling approach
JF  - International Journal of Information Management Data Insights
N2  - We used structural topic modeling to analyze over 800,000 German tweets about COVID-19 to answer the questions: What patterns emerge in tweets as a response to a health crisis? And how do topics discussed change over time? The study leans on the goals associated with the health information seeking (GAINS) model, discerning whether a post aims at tackling and eliminating the problem (i.e., problem-focused) or managing the emotions (i.e., emotion-focused); whether it strives to maximize positive outcomes (promotion focus) or to minimize negative outcomes (prevention focus). The findings indicate four clusters salient in public reactions: 1) “Understanding” (problem-promotion); 2) “Action planning” (problem-prevention); 3) “Hope” (emotion-promotion) and 4) “Reassurance” (emotion-prevention). Public communication is volatile over time, and a shift is evidenced from self-centered to community-centered topics within 4.5 weeks. Our study illustrates social media text mining's potential to quickly and efficiently extract public opinions and reactions. Monitoring fears and trending topics enable policymakers to rapidly respond to deviant behavior, like resistive attitudes toward containment measures or deteriorating physical health. Healthcare workers can use the insights to provide mental health services for battling anxiety or extensive loneliness from staying home.
KW  - social media
KW  - Twitter
KW  - modeling
KW  - regulatory focus theory
KW  - crisis management
KW  - text mining
Y1  - 2022
U6  - https://doi.org/10.1016/j.jjimei.2022.100126
SN  - 2667-0968
VL  - 2
IS  - 2
PB  - Elsevier
CY  - Amsterdam
ER  -