Refine
Year of publication
Document Type
- Article (41)
- Monograph/Edited Volume (11)
- Other (3)
- Conference Proceeding (1)
- Postprint (1)
- Preprint (1)
Is part of the Bibliography
- yes (58)
Keywords
- radiation mechanisms: non-thermal (8)
- gamma rays: galaxies (6)
- galaxies: active (5)
- gamma rays: general (5)
- ISM: supernova remnants (4)
- data profiling (4)
- Datenintegration (3)
- duplicate detection (3)
- similarity measures (3)
- Data Integration (2)
How inclusive are we?
(2022)
ACM SIGMOD, VLDB and other database organizations have committed to fostering an inclusive and diverse community, as do many other scientific organizations. Recently, different measures have been taken to advance these goals, especially for underrepresented groups. One possible measure is double-blind reviewing, which aims to hide gender, ethnicity, and other properties of the authors. <br /> We report the preliminary results of a gender diversity analysis of publications of the database community across several peer-reviewed venues, and also compare women's authorship percentages in both single-blind and double-blind venues along the years. We also obtained a cross comparison of the obtained results in data management with other relevant areas in Computer Science.
Data analytics are moving beyond the limits of a single data processing platform. A cross-platform query optimizer is necessary to enable applications to run their tasks over multiple platforms efficiently and in a platform-agnostic manner. For the optimizer to be effective, it must consider data movement costs across different data processing platforms. In this paper, we present the graph-based data movement strategy used by RHEEM, our open-source cross-platform system. In particular, we (i) model the data movement problem as a new graph problem, which we prove to be NP-hard, and (ii) propose a novel graph exploration algorithm, which allows RHEEM to discover multiple hidden opportunities for cross-platform data processing.
Spreadsheets are among the most commonly used file formats for data management, distribution, and analysis. Their widespread employment makes it easy to gather large collections of data, but their flexible canvas-based structure makes automated analysis difficult without heavy preparation. One of the common problems that practitioners face is the presence of multiple, independent regions in a single spreadsheet, possibly separated by repeated empty cells. We define such files as "multiregion" files. In collections of various spreadsheets, we can observe that some share the same layout. We present the Mondrian approach to automatically identify layout templates across multiple files and systematically extract the corresponding regions. Our approach is composed of three phases: first, each file is rendered as an image and inspected for elements that could form regions; then, using a clustering algorithm, the identified elements are grouped to form regions; finally, every file layout is represented as a graph and compared with others to find layout templates. We compare our method to state-of-the-art table recognition algorithms on two corpora of real-world enterprise spreadsheets. Our approach shows the best performances in detecting reliable region boundaries within each file and can correctly identify recurring layouts across files.
RHEEMix in the data jungle
(2020)
Data analytics are moving beyond the limits of a single platform. In this paper, we present the cost-based optimizer of Rheem, an open-source cross-platform system that copes with these new requirements. The optimizer allocates the subtasks of data analytic tasks to the most suitable platforms. Our main contributions are: (i) a mechanism based on graph transformations to explore alternative execution strategies; (ii) a novel graph-based approach to determine efficient data movement plans among subtasks and platforms; and (iii) an efficient plan enumeration algorithm, based on a novel enumeration algebra. We extensively evaluate our optimizer under diverse real tasks. We show that our optimizer can perform tasks more than one order of magnitude faster when using multiple platforms than when using a single platform.
RHEEMix in the data jungle
(2020)
Data analytics are moving beyond the limits of a single platform. In this paper, we present the cost-based optimizer of Rheem, an open-source cross-platform system that copes with these new requirements. The optimizer allocates the subtasks of data analytic tasks to the most suitable platforms. Our main contributions are: (i) a mechanism based on graph transformations to explore alternative execution strategies; (ii) a novel graph-based approach to determine efficient data movement plans among subtasks and platforms; and (iii) an efficient plan enumeration algorithm, based on a novel enumeration algebra. We extensively evaluate our optimizer under diverse real tasks. We show that our optimizer can perform tasks more than one order of magnitude faster when using multiple platforms than when using a single platform.
Design and Implementation of service-oriented architectures imposes a huge number of research questions from the fields of software engineering, system analysis and modeling, adaptability, and application integration. Component orientation and web services are two approaches for design and realization of complex web-based system. Both approaches allow for dynamic application adaptation as well as integration of enterprise application. Commonly used technologies, such as J2EE and .NET, form de facto standards for the realization of complex distributed systems. Evolution of component systems has lead to web services and service-based architectures. This has been manifested in a multitude of industry standards and initiatives such as XML, WSDL UDDI, SOAP, etc. All these achievements lead to a new and promising paradigm in IT systems engineering which proposes to design complex software solutions as collaboration of contractually defined software services. Service-Oriented Systems Engineering represents a symbiosis of best practices in object-orientation, component-based development, distributed computing, and business process management. It provides integration of business and IT concerns. The annual Ph.D. Retreat of the Research School provides each member the opportunity to present his/her current state of their research and to give an outline of a prospective Ph.D. thesis. Due to the interdisciplinary structure of the Research Scholl, this technical report covers a wide range of research topics. These include but are not limited to: Self-Adaptive Service-Oriented Systems, Operating System Support for Service-Oriented Systems, Architecture and Modeling of Service-Oriented Systems, Adaptive Process Management, Services Composition and Workflow Planning, Security Engineering of Service-Based IT Systems, Quantitative Analysis and Optimization of Service-Oriented Systems, Service-Oriented Systems in 3D Computer Graphics sowie Service-Oriented Geoinformatics.
The integration of multiple data sources is a common problem in a large variety of applications. Traditionally, handcrafted similarity measures are used to discover, merge, and integrate multiple representations of the same entity-duplicates-into a large homogeneous collection of data. Often, these similarity measures do not cope well with the heterogeneity of the underlying dataset. In addition, domain experts are needed to manually design and configure such measures, which is both time-consuming and requires extensive domain expertise. <br /> We propose a deep Siamese neural network, capable of learning a similarity measure that is tailored to the characteristics of a particular dataset. With the properties of deep learning methods, we are able to eliminate the manual feature engineering process and thus considerably reduce the effort required for model construction. In addition, we show that it is possible to transfer knowledge acquired during the deduplication of one dataset to another, and thus significantly reduce the amount of data required to train a similarity measure. We evaluated our method on multiple datasets and compare our approach to state-of-the-art deduplication methods. Our approach outperforms competitors by up to +26 percent F-measure, depending on task and dataset. In addition, we show that knowledge transfer is not only feasible, but in our experiments led to an improvement in F-measure of up to +4.7 percent.
Unique column combinations of a relational database table are sets of columns that contain only unique values. Discovering such combinations is a fundamental research problem and has many different data management and knowledge discovery applications. Existing discovery algorithms are either brute force or have a high memory load and can thus be applied only to small datasets or samples. In this paper, the wellknown GORDIAN algorithm and "Apriori-based" algorithms are compared and analyzed for further optimization. We greatly improve the Apriori algorithms through efficient candidate generation and statistics-based pruning methods. A hybrid solution HCAGORDIAN combines the advantages of GORDIAN and our new algorithm HCA, and it significantly outperforms all previous work in many situations.
VLDB 2021
(2021)
The 47th International Conference on Very Large Databases (VLDB'21) was held on August 16-20, 2021 as a hybrid conference. It attracted 180 in-person attendees in Copenhagen and 840 remote attendees. In this paper, we describe our key decisions as general chairs and program committee chairs and share the lessons we learned.
Discovery of high and very high-energy emission from the BL Lacertae object SHBL J001355.9-185406
(2013)
The detection of the high-frequency peaked BL Lac object (HBL) SHBL J001355.9-185406 (z = 0.095) at high (HE; 100 MeV < E < 300 GeV) and very high-energy (VHE; E > 100 GeV) with the Fermi Large Area Telescope (LAT) and the High Energy Stereoscopic System (H.E.S.S.) is reported. Dedicated observations were performed with the H. E. S. S. telescopes, leading to a detection at the 5.5 sigma significance level. The measured flux above 310 GeV is (8.3 +/- 1.7(stat) +/- 1.7(sys)) x 10(-13) photons cm(-2) s(-1) (about 0.6% of that of the Crab Nebula), and the power-law spectrum has a photon index of Gamma = 3.4 +/- 0.5(stat) +/- 0.2(sys). Using 3.5 years of publicly available Fermi-LAT data, a faint counterpart has been detected in the LAT data at the 5.5 sigma significance level, with an integrated flux above 300 MeV of (9.3 +/- 3.4(stat) +/- 0.8(sys)) x 10(-10) photons cm(-2) s(-1) and a photon index of Gamma = 1.96 +/- 0.20(stat) +/- 0.08(sys). X-ray observations with Swift-XRT allow the synchrotron peak energy in vF(v) representation to be located at similar to 1.0 keV. The broadband spectral energy distribution is modelled with a one-zone synchrotron self-Compton (SSC) model and the optical data by a black-body emission describing the thermal emission of the host galaxy. The derived parameters are typical of HBLs detected at VHE, with a particle-dominated jet.
Context. About 40% of the observation time of the High Energy Stereoscopic System (H.E.S.S.) is dedicated to studying active galactic nuclei (AGN), with the aim of increasing the sample of known extragalactic very-high-energy (VHE, E > 100 GeV) sources and constraining the physical processes at play in potential emitters.
Aims. H.E.S.S. observations of AGN, spanning a period from April 2004 to December 2011, are investigated to constrain their gamma-ray fluxes. Only the 47 sources without significant excess detected at the position of the targets are presented.
Methods. Upper limits on VHE fluxes of the targets were computed and a search for variability was performed on the nightly time scale.
Results. For 41 objects, the flux upper limits we derived are the most constraining reported to date. These constraints at VHE are compared with the flux level expected from extrapolations of Fermi-LAT measurements in the two-year catalog of AGN. The H.E.S.S. upper limits are at least a factor of two lower than the extrapolated Fermi-LAT fluxes for 11 objects Taking into account the attenuation by the extragalactic background light reduces the tension for all but two of them, suggesting intrinsic curvature in the high-energy spectra of these two AGN.
Conclusions. Compilation efforts led by current VHE instruments are of critical importance for target-selection strategies before the advent of the Cherenkov Telescope Array (CTA).
Context. On March 4, 2013 the Fermi-EAT and AGILE reported a flare from the direction of the Crab nebula in which the high-energy (HE; E > 100 MeV) flux was six times above its quiescent level. Simultaneous observations in other energy bands give us hints about the emission processes during the flare episode and the physics of pulsar wind nebulae in general.
Aims. We search for variability in the emission of the Crab nebula at very-high energies (VHF,; E > 100 GeV), using contemporaneous data taken with the H.E.S.S. array of Cherenkov telescopes.
Methods. Observational data taken with the H.E.S.S. instrument on five consecutive days during the flare were analysed for the flux and spectral shape of the emission from the Crab nebula. Night-wise light curves are presented with energy thresholds of 1 TeV and 5 TeV.
Results. The observations conducted with H.E.S.S. on March 6 to March 10, 2013 show no significant changes in the flux. They limit the variation in the integral flux above 1 TeV to less than 63% and the integral flux above 5 TeV to less than 78% at a 95% confidence level.
The results of follow-up observations of the TeV gamma-ray source HESS J1640-465 from 2004 to 2011 with the High Energy Stereoscopic System (HESS) are reported in this work. The spectrum is well described by an exponential cut-off power law with photon index Gamma = 2.11 +/- 0.09(stat) +/- 0.10(sys), and a cut-off energy of E-2 = 6.0(-1.2)(+2.0) TeV. The TeV emission is significantly extended and overlaps with the northwestern part of the shell of the SNR G338.3-0.0. The new HESS results, a re-analysis of archival XMM-Newton data and multiwavelength observations suggest that a significant part of the gamma-ray emission from HESS J1640-465 originates in the supernova remnant shell. In a hadronic scenario, as suggested by the smooth connection of the GeV and TeV spectra, the product of total proton energy and mean target density could be as high as W(p)n(H) similar to 4 x 10(52)(d/10kpc)(2) erg cm(-3).
Aims. Previous observations with the High Energy Stereoscopic System (H.E.S.S.) have revealed an extended very-high-energy (VHE; E > 100 GeV) gamma-ray source, HESS J1834-087, coincident with the supernova remnant (SNR) W41. The origin of the gamma-ray emission was investigated in more detail with the H.E.S.S. array and the Large Area Telescope (LAT) onboard the Fermi Gamma-ray Space Telescope.
Methods. The gamma-ray data provided by 61 h of observations with H.E.S.S., and four years with the Fermi LAT were analyzed, covering over five decades in energy from 1.8 GeV up to 30 TeV. The morphology and spectrum of the TeV and GeV sources were studied and multiwavelength data were used to investigate the origin of the gamma-ray emission toward W41.
Results. The TeV source can be modeled with a sum of two components: one point-like and one significantly extended (sigma(TeV) = 0.17 degrees +/- 0.01 degrees), both centered on SNR W41 and exhibiting spectra described by a power law with index Gamma(TeV) similar or equal to 2.6. The GeV source detected with Fermi LAT is extended (sigma(GeV) = 0.15 degrees +/- 0.03 degrees) and morphologically matches the VHE emission. Its spectrum can be described by a power-law model with an index Gamma(GeV) = 2.15 +/- 0.12 and smoothly joins the spectrum of the whole TeV source. A break appears in the gamma-ray spectra around 100 GeV. No pulsations were found in the GeV range.
Conclusions. Two main scenarios are proposed to explain the observed emission: a pulsar wind nebula (PWN) or the interaction of SNR W41 with an associated molecular cloud. X-ray observations suggest the presence of a point-like source (a pulsar candidate) near the center of the remnant and nonthermal X-ray diffuse emission that could arise from the possibly associated PWN. The PWN scenario is supported by the compatible positions of the TeV and GeV sources with the putative pulsar. However, the spectral energy distribution from radio to gamma-rays is reproduced by a one-zone leptonic model only if an excess of low-energy electrons is injected following a Maxwellian distribution by a pulsar with a high spin-down power (> 10(37) erg s(-1)). This additional low-energy component is not needed if we consider that the point-like TeV source is unrelated to the extended GeV and TeV sources. The interacting SNR scenario is supported by the spatial coincidence between the gamma-ray sources, the detection of OH (1720 MHz) maser lines, and the hadronic modeling.
Discovery of very high energy gamma-ray emission from the BL Lacertae
object PKS0301-243 with HESS
(2013)
The active galactic nucleus PKS 0301-243 (z = 0.266) is a high-synchrotron-peaked BL Lac object that is detected at high energies (HE, 100 MeV < E < 100 GeV) by Fermi/LAT. This paper reports on the discovery of PKS 0301-243 at very high energies (E > 100 GeV) by the High Energy Stereoscopic System (H.E.S.S.) from observations between September 2009 and December 2011 for a total live time of 34.9 h. Gamma rays above 200 GeV are detected at a significance of 9.4 sigma. A hint of variability at the 2.5 sigma level is found. An integral flux I(E > 200GeV) = (3.3 +/- 1.1(stat) +/- 0.7(syst)) x 10(-12) ph cm(-2) s(-1) and a photon index Gamma = 4.6 +/- 0.7(stat) +/- 0.2(syst) are measured. Multi-wavelength light curves in HE, X-ray and optical bands show strong variability, and a minimal variability timescale of eight days is estimated from the optical light curve. A single-zone leptonic synchrotron self-Compton scenario satisfactorily reproduces the multi-wavelength data. In this model, the emitting region is out of equipartition and the jet is particle dominated. Because of its high redshift compared to other sources observed at TeV energies, the very high energy emission from PKS 0301-243 is attenuated by the extragalactic background light (EBL) and the measured spectrum is used to derive an upper limit on the opacity of the EBL.
Composite supernova remnants (SNRs) constitute a small subclass of the remnants of massive stellar explosions where non-thermal radiation is observed from both the expanding shell-like shock front and from a pulsar wind nebula (PWN) located inside of the SNR. These systems represent a unique evolutionary phase of SNRs where observations in the radio, X-ray, and gamma-ray regimes allow the study of the co-evolution of both these energetic phenomena. In this article, we report results from observations of the shell-type SNR G15.4+0.1 performed with the High Energy Stereoscopic System (H. E. S. S.) and XMM-Newton. A compact TeV gamma-ray source, HESS J1818-154, located in the center and contained within the shell of G15.4+0.1 is detected by H. E. S. S. and featurs a spectrum best represented by a power-law model with a spectral index of -2.3 +/- 0.3(stat) +/- 0.2(sys) and an integral flux of F(>0.42 TeV) = (0.9 +/- 0.3(stat) +/- 0.2(sys)) x 10(-12) cm(-2) s(-1). Furthermore, a recent observation with XMM-Newton reveals extended X-ray emission strongly peaked in the center of G15.4+0.1. The X-ray source shows indications of an energy-dependent morphology featuring a compact core at energies above 4 keV and more extended emission that fills the entire region within the SNR at lower energies. Together, the X-ray and VHE gamma-ray emission provide strong evidence of a PWN located inside the shell of G15.4+0.1 and this SNR can therefore be classified as a composite based on these observations. The radio, X-ray, and gamma-ray emission from the PWN is compatible with a one-zone leptonic model that requires a low average magnetic field inside the emission region. An unambiguous counterpart to the putative pulsar, which is thought to power the PWN, has been detected neither in radio nor in X-ray observations of G15.4+0.1.
Context. Very-high-energy (VHE; E > 100 GeV) gamma-ray emission from blazars inevitably gives rise to electron-positron pair production through the interaction of these gamma-rays with the extragalactic background light (EBL). Depending on the magnetic fields in the proximity of the source, the cascade initiated from pair production can result in either an isotropic halo around an initially- beamed source or a magnetically- broadened cascade :aux.
Aims. Both extended pair-halo (PH) and magnetically broadened cascade (MBC) emission from regions surrounding the blazars 1ES 1101-232, IRS 0229+200, and PKS 2155-304 were searched for using VHE y-ray data taken with the High Energy Stereoscopic System (HESS.) and high-energy (HE; 100 MeV < E < 100 GeV) gamma-ray data with the Fermi Large Area Telescope (LAT).
Methods. By comparing the angular distributions of the reconstructed gamma-ray events to the angular profiles calculated from detailed theoretical models, the presence of PH and MBC was investigated.
Results. Upper limits on the extended emission around lES 1101-232, lES 0229+200, and PKS 2155-304 are found to be at a level of a few per cent of the Crab nebula flux above 1 TeV, depending on the assumed photon index of the cascade emission. Assuming strong extra-Galactic magnetic field (EGME) values, >10(-12) G, this limits the production of pair haloes developing from electromagnetic cascades. For weaker magnetic fields, in which electromagnetic cascades would result in MBCs. EGMF strengths in the range (0.3-3) x 10(-15) G were excluded for PKS 2155-304 at the 99% confidence level, under the assumption of a 1 Mpc coherence length.
A deep observation campaign carried out by the High Energy Stereoscopic System (HESS) on Centaurus A enabled the discovery of gamma-rays from the blazar 1ES 1312-423, 2 degrees away from the radio galaxy. With a differential flux at 1 TeV of phi(1 TeV) = (1.9 +/- 0.6(stat) +/- 0.4(sys)) x 10(-13) cm(-2) s(-1) TeV-1 corresponding to 0.5 per cent of the Crab nebula differential flux and a spectral index Gamma = 2.9 +/- 0.5(stat) +/- 0.2(sys), 1ES 1312-423 is one of the faintest sources ever detected in the very high energy (E > 100 GeV) extragalactic sky. A careful analysis using three and a half years of Fermi Large Area Telescope (Fermi-LAT) data allows the discovery at high energies (E > 100 MeV) of a hard spectrum (Gamma = 1.4 +/- 0.4(stat) +/- 0.2(sys)) source coincident with 1ES 1312-423. Radio, optical, UV and X-ray observations complete the spectral energy distribution of this blazar, now covering 16 decades in energy. The emission is successfully fitted with a synchrotron self-Compton model for the non-thermal component, combined with a blackbody spectrum for the optical emission from the host galaxy.
Search for TeV Gamma-ray emission from GRB 100621A, an extremely bright GRB in X-rays, with HESS
(2014)
The long gamma-ray burst (GRB) 100621A, at the time the brightest X-ray transient ever detected by Swift-XRT in the 0.3-10 keV range, has been observed with the H.E.S.S. imaging air Cherenkov telescope array, sensitive to gamma radiation in the very-high-energy (VHE, >100 GeV) regime. Due to its relatively small redshift of z similar to 0.5, the favourable position in the southern sky and the relatively short follow-up time (<700 s after the satellite trigger) of the H.E.S.S. observations, this GRB could be within the sensitivity reach of the HESS. instrument. The analysis of the HESS. data shows no indication of emission and yields an integral flux upper limit above similar to 380 GeV of 4.2 x 10(-12) cm(-2) s(-1) s (95% confidence level), assuming a simple Band function extension model. A comparison to a spectral-temporal model, normalised to the prompt flux at sub-MeV energies, constraints the existence of a temporally extended and strong additional hard power law, as has been observed in the other bright X-ray GRB 130427A. A comparison between the HESS. upper limit and the contemporaneous energy output in X-rays constrains the ratio between the X-ray and VHE gamma-ray fluxes to be greater than 0.4. This value is an important quantity for modelling the afterglow and can constrain leptonic emission scenarios, where leptons are responsible for the X-ray emission and might produce VHE gamma rays.