publish.UP Search

5 search hits

1 to 5

Sort by

HPI Future SOC Lab – Proceedings 2019 (2024)

Kuban, Robert ; Rotta, Randolf ; Nolte, Jörg ; Chromik, Jonas ; Beilharz, Jossekin Jakob ; Pirl, Lukas ; Friedrich, Tobias ; Lenzner, Pascal ; Weyand, Christopher ; Juiz, Carlos ; Bermejo, Belen ; Sauer, Joao ; Coelh, Leandro dos Santos ; Najafi, Pejman ; Pünter, Wenzel ; Cheng, Feng ; Meinel, Christoph ; Sidorova, Julia ; Lundberg, Lars ; Vogel, Thomas ; Tran, Chinh ; Moser, Irene ; Grunske, Lars ; Elsaid, Mohamed Esameldin Mohamed ; Abbas, Hazem M. ; Rula, Anisa ; Sejdiu, Gezim ; Maurino, Andrea ; Schmidt, Christopher ; Hügle, Johannes ; Uflacker, Matthias ; Nozza, Debora ; Messina, Enza ; Hoorn, André van ; Frank, Markus ; Schulz, Henning ; Alhosseini Almodarresi Yasin, Seyed Ali ; Nowicki, Marek ; Muite, Benson K. ; Boysan, Mehmet Can ; Bianchi, Federico ; Cremaschi, Marco ; Moussa, Rim ; Abdel-Karim, Benjamin M. ; Pfeuffer, Nicolas ; Hinz, Oliver ; Plauth, Max ; Polze, Andreas ; Huo, Da ; Melo, Gerard de ; Mendes Soares, Fábio ; Oliveira, Roberto Célio Limão de ; Benson, Lawrence ; Paul, Fabian ; Werling, Christian ; Windheuser, Fabian ; Stojanovic, Dragan ; Djordjevic, Igor ; Stojanovic, Natalija ; Stojnev Ilic, Aleksandra ; Weidmann, Vera ; Lowitzki, Leon ; Wagner, Markus ; Ifa, Abdessatar Ben ; Arlos, Patrik ; Megia, Ana ; Vendrell, Joan ; Pfitzner, Bjarne ; Redondo, Alberto ; Ríos Insua, David ; Albert, Justin Amadeus ; Zhou, Lin ; Arnrich, Bert ; Szabó, Ildikó ; Fodor, Szabina ; Ternai, Katalin ; Bhowmik, Rajarshi ; Campero Durand, Gabriel ; Shevchenko, Pavlo ; Malysheva, Milena ; Prymak, Ivan ; Saake, Gunter

The “HPI Future SOC Lab” is a cooperation of the Hasso Plattner Institute (HPI) and industry partners. Its mission is to enable and promote exchange and interaction between the research community and the industry partners. The HPI Future SOC Lab provides researchers with free of charge access to a complete infrastructure of state of the art hard and software. This infrastructure includes components, which might be too expensive for an ordinary research environment, such as servers with up to 64 cores and 2 TB main memory. The offerings address researchers particularly from but not limited to the areas of computer science and business information systems. Main areas of research include cloud computing, parallelization, and In-Memory technologies. This technical report presents results of research projects executed in 2019. Selected projects have presented their results on April 9th and November 12th 2019 at the Future SOC Lab Day events.

Purely attention based local feature integration for video classification (2020)

Long, Xiang ; de Melo, Gerard ; He, Dongliang ; Li, Fu ; Chi, Zhizhen ; Wen, Shilei ; Gan, Chuang

Recently, substantial research effort has focused on how to apply CNNs or RNNs to better capture temporal patterns in videos, so as to improve the accuracy of video classification. In this paper, we investigate the potential of a purely attention based local feature integration. Accounting for the characteristics of such features in video classification, we first propose Basic Attention Clusters (BAC), which concatenates the output of multiple attention units applied in parallel, and introduce a shifting operation to capture more diverse signals. Experiments show that BAC can achieve excellent results on multiple datasets. However, BAC treats all feature channels as an indivisible whole, which is suboptimal for achieving a finer-grained local feature integration over the channel dimension. Additionally, it treats the entire local feature sequence as an unordered set, thus ignoring the sequential relationships. To improve over BAC, we further propose the channel pyramid attention schema by splitting features into sub-features at multiple scales for coarse-to-fine sub-feature interaction modeling, and propose the temporal pyramid attention schema by dividing the feature sequences into ordered sub-sequences of multiple lengths to account for the sequential order. Our final model pyramidxpyramid attention clusters (PPAC) combines both channel pyramid attention and temporal pyramid attention to focus on the most important sub-features, while also preserving the temporal information of the video. We demonstrate the effectiveness of PPAC on seven real-world video classification datasets. Our model achieves competitive results across all of these, showing that our proposed framework can consistently outperform the existing local feature integration methods across a range of different scenarios.

Affect-aware word clouds (2020)

Kulahcioglu, Tugba ; Melo, Gerard de

Word clouds are widely used for non-analytic purposes, such as introducing a topic to students, or creating a gift with personally meaningful text. Surveys show that users prefer tools that yield word clouds with a stronger emotional impact. Fonts and color palettes are powerful typographical signals that may determine this impact. Typically, these signals are assigned randomly, or expected to be chosen by the users. We present an affect-aware font and color palette selection methodology that aims to facilitate more informed choices. We infer associations of fonts with a set of eight affects, and evaluate the resulting data in a series of user studies both on individual words as well as in word clouds. Relying on a recent study to procure affective color palettes, we carry out a similar user study to understand the impact of color choices on word clouds. Our findings suggest that both fonts and color palettes are powerful tools contributing to the affects evoked by a word cloud. The experiments further confirm that the novel datasets we propose are successful in enabling this. We also find that, for the majority of the affects, both signals need to be congruent to create a stronger impact. Based on this data, we implement a prototype that allows users to specify a desired affect and recommends congruent fonts and color palettes for the word.

Commonsense based text mining on urban policy (2023)

Puri, Manish ; Varde, Aparna S. ; Melo, Gerard de

Local laws on urban policy, i.e., ordinances directly affect our daily life in various ways (health, business etc.), yet in practice, for many citizens they remain impervious and complex. This article focuses on an approach to make urban policy more accessible and comprehensible to the general public and to government officials, while also addressing pertinent social media postings. Due to the intricacies of the natural language, ranging from complex legalese in ordinances to informal lingo in tweets, it is practical to harness human judgment here. To this end, we mine ordinances and tweets via reasoning based on commonsense knowledge so as to better account for pragmatics and semantics in the text. Ours is pioneering work in ordinance mining, and thus there is no prior labeled training data available for learning. This gap is filled by commonsense knowledge, a prudent choice in situations involving a lack of adequate training data. The ordinance mining can be beneficial to the public in fathoming policies and to officials in assessing policy effectiveness based on public reactions. This work contributes to smart governance, leveraging transparency in governing processes via public involvement. We focus significantly on ordinances contributing to smart cities, hence an important goal is to assess how well an urban region heads towards a smart city as per its policies mapping with smart city characteristics, and the corresponding public satisfaction.

Masked-Piper: masking personal identities in visual recordings while preserving multimodal information (2022)

Owoyele, Babajide ; Trujillo, James ; de Melo, Gerard ; Pouw, Wim

In this increasingly data-rich world, visual recordings of human behavior are often unable to be shared due to concerns about privacy. Consequently, data sharing in fields such as behavioral science, multimodal communication, and human movement research is often limited. In addition, in legal and other non-scientific contexts, privacy-related concerns may preclude the sharing of video recordings and thus remove the rich multimodal context that humans recruit to communicate. Minimizing the risk of identity exposure while preserving critical behavioral information would maximize utility of public resources (e.g., research grants) and time invested in audio-visual research. Here we present an open-source computer vision tool that masks the identities of humans while maintaining rich information about communicative body movements. Furthermore, this masking tool can be easily applied to many videos, leveraging computational tools to augment the reproducibility and accessibility of behavioral research. The tool is designed for researchers and practitioners engaged in kinematic and affective research. Application areas include teaching/education, communication and human movement research, CCTV, and legal contexts.

1 to 5

Author(s)
Title
Additional Person(s)
Referee(s)
Abstract
Fulltext

Refine

Has Fulltext

Author

Year of publication

Document Type

Language

Is part of the Bibliography

Keywords

Institute

5 search hits