Refine
Year of publication
Document Type
- Article (13)
- Conference Proceeding (5)
- Monograph/Edited Volume (3)
- Other (3)
- Working Paper (1)
Language
- English (25)
Is part of the Bibliography
- yes (25)
Keywords
- Cloud Computing (4)
- E-Mail Tracking (4)
- Forschungsprojekte (4)
- Future SOC Lab (4)
- In-Memory Technologie (4)
- Multicore Architekturen (4)
- Privacy (4)
- cloud computing (4)
- maschinelles Lernen (4)
- techniques: imaging spectroscopy (4)
Institute
- Fachgruppe Betriebswirtschaftslehre (6)
- Hasso-Plattner-Institut für Digital Engineering GmbH (4)
- Institut für Ernährungswissenschaft (4)
- Institut für Physik und Astronomie (4)
- Institut für Informatik und Computational Science (3)
- Wirtschaftswissenschaften (2)
- Fachgruppe Volkswirtschaftslehre (1)
- Hasso-Plattner-Institut für Digital Engineering gGmbH (1)
The “HPI Future SOC Lab” is a cooperation of the Hasso Plattner Institute (HPI) and industry partners. Its mission is to enable and promote exchange and interaction between the research community and the industry partners.
The HPI Future SOC Lab provides researchers with free of charge access to a complete infrastructure of state of the art hard and software. This infrastructure includes components, which might be too expensive for an ordinary research environment, such as servers with up to 64 cores and 2 TB main memory. The offerings address researchers particularly from but not limited to the areas of computer science and business information systems. Main areas of research include cloud computing, parallelization, and In-Memory technologies.
This technical report presents results of research projects executed in 2018. Selected projects have presented their results on April 17th and November 14th 2017 at the Future SOC Lab Day events.
This study pushes our understanding of research reliability by reproducing and replicating claims from 110 papers in leading economic and political science journals. The analysis involves computational reproducibility checks and robustness assessments. It reveals several patterns. First, we uncover a high rate of fully computationally reproducible results (over 85%). Second, excluding minor issues like missing packages or broken pathways, we uncover coding errors for about 25% of studies, with some studies containing multiple errors. Third, we test the robustness of the results to 5,511 re-analyses. We find a robustness reproducibility of about 70%. Robustness reproducibility rates are relatively higher for re-analyses that introduce new data and lower for re-analyses that change the sample or the definition of the dependent variable. Fourth, 52% of re-analysis effect size estimates are smaller than the original published estimates and the average statistical significance of a re-analysis is 77% of the original. Lastly, we rely on six teams of researchers working independently to answer eight additional research questions on the determinants of robustness reproducibility. Most teams find a negative relationship between replicators' experience and reproducibility, while finding no relationship between reproducibility and the provision of intermediate or even raw data combined with the necessary cleaning codes.
The “HPI Future SOC Lab” is a cooperation of the Hasso Plattner Institute (HPI) and industry partners. Its mission is to enable and promote exchange and interaction between the research community and the industry partners.
The HPI Future SOC Lab provides researchers with free of charge access to a complete infrastructure of state of the art hard and software. This infrastructure includes components, which might be too expensive for an ordinary research environment, such as servers with up to 64 cores and 2 TB main memory. The offerings address researchers particularly from but not limited to the areas of computer science and business information systems. Main areas of research include cloud computing, parallelization, and In-Memory technologies.
This technical report presents results of research projects executed in 2019. Selected projects have presented their results on April 9th and November 12th 2019 at the Future SOC Lab Day events.
Email tracking allows email senders to collect fine-grained behavior and location data on email recipients, who are uniquely identifiable via their email address. Such tracking invades user privacy in that email tracking techniques gather data without user consent or awareness. Striving to increase privacy in email communication, this paper develops a detection engine to be the core of a selective tracking blocking mechanism in the form of three contributions. First, a large collection of email newsletters is analyzed to show the wide usage of tracking over different countries, industries and time. Second, we propose a set of features geared towards the identification of tracking images under real-world conditions. Novel features are devised to be computationally feasible and efficient, generalizable and resilient towards changes in tracking infrastructure. Third, we test the predictive power of these features in a benchmarking experiment using a selection of state-of-the-art classifiers to clarify the effectiveness of model-based tracking identification. We evaluate the expected accuracy of the approach on out-of-sample data, over increasing periods of time, and when faced with unknown senders. (C) 2018 Elsevier B.V. All rights reserved.
We utilise multi-epoch MUSE spectroscopy to study binary stars in the core of the Galactic globular cluster NGC 3201. Our sample consists of 3553 stars with 54 883 spectra in total comprising 3200 main-sequence stars up to 4 magnitudes below the turn-off. Each star in our sample has between 3 and 63 (with a median of 14) reliable radial velocity measurements within five years of observations. We introduce a statistical method to determine the probability of a star showing radial velocity variations based on the whole inhomogeneous radial velocity sample. Using HST photometry and an advanced dynamical MOCCA simulation of this specific cluster we overcome observational biases that previous spectroscopic studies had to deal with. This allows us to infer a binary frequency in the MUSE field of view and enables us to deduce the underlying true binary frequency of (6.75 +/- 0.72)% in NGC 3201. The comparison of the MUSE observations with the MOCCA simulation suggests a large portion of primordial binaries. We can also confirm a radial increase in the binary fraction towards the cluster centre due to mass segregation. We discovered that in the core of NGC 3201 at least (57.5 +/- 7.9)% of blue straggler stars are in a binary system. For the first time in a study of globular clusters, we were able to fit Keplerian orbits to a significant sample of 95 binaries. We present the binary system properties of eleven blue straggler stars and the connection to SX Phoenicis-type stars. We show evidence that two blue straggler formation scenarios, the mass transfer in binary (or triple) star systems and the coalescence due to binary-binary interactions, are present in our data. We also describe the binary and spectroscopic properties of four sub-subgiant (or red straggler) stars. Furthermore, we discovered two new black hole candidates with minimum masses (M sin i) of (7.68 +/- 0.50)M-circle dot, (4.4 +/- 2.8)M-circle dot, and refine the minimum mass estimate on the already published black hole to (4.53 +/- 0.21)M-circle dot, These black holes are consistent with an extensive black hole subsystem hosted by NGC 3201.
Context. Galactic globular clusters (GCs) are now known to host multiple populations displaying particular abundance variations. The different populations within a GC can be well distinguished following their position in the pseudo two-colors diagrams, also referred to as "chromosome maps". These maps are constructed using optical and near-UV photometry available from the Hubble Space Telescope (HST) UV survey of GCs. However, the chemical tagging of the various populations in the chromosome maps is hampered by the fact that HST photometry and elemental abundances are both only available for a limited number of stars. Aims. The spectra collected as part of the MUSE survey of globular clusters provide a spectroscopic counterpart to the HST photometric catalogs covering the central regions of GCs. In this paper, we use the MUSE spectra of 1115 red giant branch (RGB) stars in NGC 2808 to characterize the abundance variations seen in the multiple populations of this cluster. Methods. We used the chromosome map of NGC 2808 to divide the RGB stars into their respective populations. We then combined the spectra of all stars belonging to a given population, resulting in one high signal-to-noise ratio spectrum representative of each population. Results. Variations in the spectral lines of O, Na, Mg, and Al are clearly detected among four of the populations. In order to quantify these variations, we measured equivalent width differences and created synthetic populations spectra that were used to determine abundance variations with respect to the primordial population of the cluster. Our results are in good agreement with the values expected from previous studies based on high-resolution spectroscopy. We do not see any significant variations in the spectral lines of Ca, K, and Ba. We also do not detect abundance variations among the stars belonging to the primordial population of NGC 2808. Conclusions. We demonstrate that in spite of their low resolution, the MUSE spectra can be used to investigate abundance variations in the context of multiple populations.
A nova is a cataclysmic event on the surface of a white dwarf in a binary system that increases the overall brightness by several orders of magnitude. Although binary systems with a white dwarf are expected to be overabundant in globular clusters compared with in the Galaxy, only two novae from Galactic globular clusters have been observed. We present the discovery of an emission nebula in the Galactic globular cluster M 22 (NGC 6656) in observations made with the integral-field spectrograph MUSE. We extracted the spectrum of the nebula and used the radial velocity determined from the emission lines to confirm that the nebula is part of NGC 6656. Emission-line ratios were used to determine the electron temperature and density. It is estimated to have a mass of 1-17 x 10(-5) M-circle dot. This mass and the emission-line ratios indicate that the nebula is a nova remnant. Its position coincides with the reported location of a "guest star", an ancient Chinese term for transients, observed in May 48 BCE. With this discovery, this nova may be one of the oldest confirmed extra-solar events recorded in human history.
A stellar census in globular clusters with MUSE: A spectral catalogue of emission-line sources
(2019)
Aims. Globular clusters produce many exotic stars due to a much higher frequency of dynamical interactions in their dense stellar environments. Some of these objects were observed together with several hundred thousand other stars in our MUSE survey of 26 Galactic globular clusters. Assuming that at least a few exotic stars have exotic spectra (i.e. spectra that contain emission lines), we can use this large spectroscopic data set of over a million stellar spectra as a blind survey to detect stellar exotica in globular clusters. Methods. To detect emission lines in each spectrum, we modelled the expected shape of an emission line as a Gaussian curve. This template was used for matched filtering on the di fferences between each observed 1D spectrum and its fitted spectral model. The spectra with the most significant detections of H alpha emission are checked visually and cross-matched with published catalogues. Results. We find 156 stars with H alpha emission, including several known cataclysmic variables (CV) and two new CVs, pulsating variable stars, eclipsing binary stars, the optical counterpart of a known black hole, several probable sub-subgiants and red stragglers, and 21 background emission-line galaxies. We find possible optical counterparts to 39 X-ray sources, as we detected H alpha emission in several spectra of stars that are close to known positions of Chandra X-ray sources. This spectral catalogue can be used to supplement existing or future X-ray or radio observations with spectra of potential optical counterparts to classify the sources.
Track and Treat
(2018)
E-Mail tracking mechanisms gather information on individual recipients’ reading behavior. Previous studies show that e-mail newsletters commonly include tracking elements. However, prior work does not examine the degree to which e-mail senders actually employ gathered user information. The paper closes this research gap by means of an experimental study to clarify the use of tracking-based infor- mation. To that end, twelve mail accounts are created, each of which subscribes to a pre-defined set of newsletters from companies based in Germany, the UK, and the USA. Systematically varying e-mail reading patterns across accounts, each account simulates a different type of user with individual read- ing behavior. Assuming senders to track e-mail reading habits, we expect changes in mailer behavior. The analysis confirms the prominence of tracking in that over 92% of the newsletter e-mails contain tracking images. For 13 out of 44 senders an adjustment of communication policy in response to user reading behavior is observed. Observed effects include sending newsletters at different times, adapting advertised products to match the users’ IT environment, increased or decreased mailing frequency, and mobile-specific adjustments. Regarding legal issues, not all companies that adapt the mail-sending behavior state the usage of such mechanisms in their privacy policy.
E-Mail Tracking
(2016)
E-mail advertisement, as one instrument in the marketing mix, allows companies to collect fine-grained behavioural data about individual users’ e-mail reading habits realised through sophisticated tracking mechanisms. Such tracking can be harmful for user privacy and security. This problem is especially severe since e-mail tracking techniques gather data without user consent. Striving to increase privacy and security in e-mail communication, the paper makes three contributions. First, a large database of newsletter e-mails is developed. This data facilitates investigating the prevalence of e- mail tracking among 300 global enterprises from Germany, the United Kingdom and the United States. Second, countermeasures are developed for automatically identifying and blocking e-mail tracking mechanisms without impeding the user experience. The approach consists of identifying important tracking descriptors and creating a neural network-based detection model. Last, the effectiveness of the proposed approach is established by means of empirical experimentation. The results suggest a classification accuracy of 99.99%.