Refine
Document Type
- Article (6)
- Doctoral Thesis (1)
- Postprint (1)
Keywords
- Twitter (8) (remove)
Institute
The immense popularity of online communication services in the last decade has not only upended our lives (with news spreading like wildfire on the Web, presidents announcing their decisions on Twitter, and the outcome of political elections being determined on Facebook) but also dramatically increased the amount of data exchanged on these platforms. Therefore, if we wish to understand the needs of modern society better and want to protect it from new threats, we urgently need more robust, higher-quality natural language processing (NLP) applications that can recognize such necessities and menaces automatically, by analyzing uncensored texts. Unfortunately, most NLP programs today have been created for standard language, as we know it from newspapers, or, in the best case, adapted to the specifics of English social media.
This thesis reduces the existing deficit by entering the new frontier of German online communication and addressing one of its most prolific forms—users’ conversations on Twitter. In particular, it explores the ways and means by how people express their opinions on this service, examines current approaches to automatic mining of these feelings, and proposes novel methods, which outperform state-of-the-art techniques. For this purpose, I introduce a new corpus of German tweets that have been manually annotated with sentiments, their targets and holders, as well as lexical polarity items and their contextual modifiers. Using these data, I explore four major areas of sentiment research: (i) generation of sentiment lexicons, (ii) fine-grained opinion mining, (iii) message-level polarity classification, and (iv) discourse-aware sentiment analysis. In the first task, I compare three popular groups of lexicon generation methods: dictionary-, corpus-, and word-embedding–based ones, finding that dictionary-based systems generally yield better polarity lists than the last two groups. Apart from this, I propose a linear projection algorithm, whose results surpass many existing automatically-generated lexicons. Afterwords, in the second task, I examine two common approaches to automatic prediction of sentiment spans, their sources, and targets: conditional random fields (CRFs) and recurrent neural networks, obtaining higher scores with the former model and improving these results even further by redefining the structure of CRF graphs. When dealing with message-level polarity classification, I juxtapose three major sentiment paradigms: lexicon-, machine-learning–, and deep-learning–based systems, and try to unite the first and last of these method groups by introducing a bidirectional neural network with lexicon-based attention. Finally, in order to make the new classifier aware of microblogs' discourse structure, I let it separately analyze the elementary discourse units of each tweet and infer the overall polarity of a message from the scores of its EDUs with the help of two new approaches: latent-marginalized CRFs and Recursive Dirichlet Process.
The field of healthcare is characterized by constant innovation, with gender-specific medicine emerging as a new subfield that addresses sex and gender disparities in clinical manifestations, outcomes, treatment, and prevention of disease. Despite its importance, the adoption of gender-specific medicine remains understudied, posing potential risks to patient outcomes due to a lack of awareness of the topic. Building on the Innovation Decision Process Theory, this study examines the spread of information about gender-specific medicine in online networks. The study applies social network analysis to a Twitter dataset reflecting online discussions about the topic to gain insights into its adoption by health professionals and patients online. Results show that the network has a community structure with limited information exchange between sub-communities and that mainly medical experts dominate the discussion. The findings suggest that the adoption of gender-specific medicine might be in its early stages, focused on knowledge exchange. Understanding the diffusion of gender-specific medicine among medical professionals and patients may facilitate its adoption and ultimately improve health outcomes.
Soziale Medien sind ein wesentlicher Bestandteil des Alltags von Schüler*innen und gleichzeitig zunehmend wichtig in Wirtschaft, Politik und Wissenschaft. Am Beispiel von Twitter zeigt dieser Beitrag, dass soziale Medien im Unterricht auch für die Beantwortung geographischer Fragestellungen verwendet werden können. Hierfür eignen sich Twitter-Daten aufgrund ihrer Georeferenzierung und weiterer interessanter Inhalte besonders. Der Beitrag gibt einen Überblick über die Verwendung von Twitter für sozialwissenschaftliche und humangeographische Fragestellungen und reflektiert die Nutzung von Twitter im Unterricht. Für die Unterrichtspraxis werden Beispiele zu den Themen Braunkohle, Flutereignisse und Raumwahrnehmungen sowie Anleitungen zur Auswertung, Anwendung und Reflexion von Twitter-Analysen vorgestellt.
Soziale Medien sind ein wesentlicher Bestandteil des Alltags von Schüler*innen und gleichzeitig zunehmend wichtig in Wirtschaft, Politik und Wissenschaft. Am Beispiel von Twitter zeigt dieser Beitrag, dass soziale Medien im Unterricht auch für die Beantwortung geographischer Fragestellungen verwendet werden können. Hierfür eignen sich Twitter-Daten aufgrund ihrer Georeferenzierung und weiterer interessanter Inhalte besonders. Der Beitrag gibt einen Überblick über die Verwendung von Twitter für sozialwissenschaftliche und humangeographische Fragestellungen und reflektiert die Nutzung von Twitter im Unterricht. Für die Unterrichtspraxis werden Beispiele zu den Themen Braunkohle, Flutereignisse und Raumwahrnehmungen sowie Anleitungen zur Auswertung, Anwendung und Reflexion von Twitter-Analysen vorgestellt.
This paper presents a methodological and conceptual replication of Stieglitz and Dang-Xuan’s (2013) investigation of the role of sentiment in information-sharing behavior on social media. Whereas Stieglitz and Dang-Xuan (2013) focused on Twitter communication prior to the state parliament elections in the German states Baden-Wurttemberg, Rheinland-Pfalz, and Berlin in 2011, we test their theoretical propositions in the context of the state parliament elections in Saxony-Anhalt (Germany) 2021. We confirm the positive link between sentiment in a political Twitter message and its number of retweets in a methodological replication. In a conceptual replication, where sentiment was assessed with the alternative dictionary-based tool LIWC, the sentiment was negatively associated with the retweet volume. In line with the original study, the strength of association between sentiment and retweet time lag insignificantly differs between tweets with negative sentiment and tweets with positive sentiment. We also found that the number of an author’s followers was an essential determinant of sharing behavior. However, two hypotheses supported in the original study did not hold for our sample. Precisely, the total amount of sentiments was insignificantly linked to the time lag to the first retweet. Finally, in our data, we do not observe that the association between the overall sentiment and retweet quantity is stronger for tweets with negative sentiment than for those with positive sentiment.
The study explores differences between three user types in the top tweets about the 2015 “refugee crisis” in Germany and presents the results of a quantitative content analysis. All tweets with the keyword “Flüchtlinge” posted for a monthlong period following September 13, 2015, the day Germany decided to implement border controls, were collected (N = 763,752). The top 2,495 tweets according to number of retweets were selected for analysis. Differences between news media, public and private actor tweets in topics, tweet characteristics such as tone and opinion expression, links, and specific sentiments toward refugees were analyzed. We found strong differences between the tweets. Public actor tweets were the main source of positive sentiment toward refugees and the main information source on refugee support. News media tweets mostly reflected traditional journalistic norms of impartiality and objectivity, whereas private actor tweets were more diverse in sentiments toward refugees.
We used structural topic modeling to analyze over 800,000 German tweets about COVID-19 to answer the questions: What patterns emerge in tweets as a response to a health crisis? And how do topics discussed change over time? The study leans on the goals associated with the health information seeking (GAINS) model, discerning whether a post aims at tackling and eliminating the problem (i.e., problem-focused) or managing the emotions (i.e., emotion-focused); whether it strives to maximize positive outcomes (promotion focus) or to minimize negative outcomes (prevention focus). The findings indicate four clusters salient in public reactions: 1) “Understanding” (problem-promotion); 2) “Action planning” (problem-prevention); 3) “Hope” (emotion-promotion) and 4) “Reassurance” (emotion-prevention). Public communication is volatile over time, and a shift is evidenced from self-centered to community-centered topics within 4.5 weeks. Our study illustrates social media text mining's potential to quickly and efficiently extract public opinions and reactions. Monitoring fears and trending topics enable policymakers to rapidly respond to deviant behavior, like resistive attitudes toward containment measures or deteriorating physical health. Healthcare workers can use the insights to provide mental health services for battling anxiety or extensive loneliness from staying home.
Argument mining on twitter
(2021)
In the last decade, the field of argument mining has grown notably. However, only relatively few studies have investigated argumentation in social media and specifically on Twitter. Here, we provide the, to our knowledge, first critical in-depth survey of the state of the art in tweet-based argument mining. We discuss approaches to modelling the structure of arguments in the context of tweet corpus annotation, and we review current progress in the task of detecting argument components and their relations in tweets. We also survey the intersection of argument mining and stance detection, before we conclude with an outlook.