The search result changed since you submitted your search request. Documents might be displayed in a different sort order.
  • search hit 16 of 40
Back to Result List

Challenges in applying machine learning models for hydrological inference: a case study for flooding events across Germany

  • Machine learning (ML) algorithms are being increasingly used in Earth and Environmental modeling studies owing to the ever-increasing availability of diverse data sets and computational resources as well as advancement in ML algorithms. Despite advances in their predictive accuracy, the usefulness of ML algorithms for inference remains elusive. In this study, we employ two popular ML algorithms, artificial neural networks and random forest, to analyze a large data set of flood events across Germany with the goals to analyze their predictive accuracy and their usability to provide insights to hydrologic system functioning. The results of the ML algorithms are contrasted against a parametric approach based on multiple linear regression. For analysis, we employ a model-agnostic framework named Permuted Feature Importance to derive the influence of models' predictors. This allows us to compare the results of different algorithms for the first time in the context of hydrology. Our main findings are that (1) the ML models achieve higherMachine learning (ML) algorithms are being increasingly used in Earth and Environmental modeling studies owing to the ever-increasing availability of diverse data sets and computational resources as well as advancement in ML algorithms. Despite advances in their predictive accuracy, the usefulness of ML algorithms for inference remains elusive. In this study, we employ two popular ML algorithms, artificial neural networks and random forest, to analyze a large data set of flood events across Germany with the goals to analyze their predictive accuracy and their usability to provide insights to hydrologic system functioning. The results of the ML algorithms are contrasted against a parametric approach based on multiple linear regression. For analysis, we employ a model-agnostic framework named Permuted Feature Importance to derive the influence of models' predictors. This allows us to compare the results of different algorithms for the first time in the context of hydrology. Our main findings are that (1) the ML models achieve higher prediction accuracy than linear regression, (2) the results reflect basic hydrological principles, but (3) further inference is hindered by the heterogeneity of results across algorithms. Thus, we conclude that the problem of equifinality as known from classical hydrological modeling also exists for ML and severely hampers its potential for inference. To account for the observed problems, we propose that when employing ML for inference, this should be made by using multiple algorithms and multiple methods, of which the latter should be embedded in a cross-validation routine.show moreshow less

Download full text files

  • pmnr1193.pdfeng
    (3773KB)

    SHA-51296f93d4c22976413f534cce5ffad50407d08e75d80af195572a747a87d2fe0c0201eb7c17f1e0430322b745982010ad4d30cb7323c4e047ecdf09ab706eef13f

Export metadata

Additional Services

Search Google Scholar Statistics
Metadaten
Author details:Lennart SchmidtORCiD, Falk HeßeORCiDGND, Sabine AttingerORCiDGND, Rohini KumarORCiDGND
URN:urn:nbn:de:kobv:517-opus4-523843
DOI:https://doi.org/10.25932/publishup-52384
ISSN:1866-8372
Title of parent work (German):Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe
Publication series (Volume number):Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe (1193)
Publication type:Postprint
Language:English
Date of first publication:2019/07/04
Publication year:2020
Publishing institution:Universität Potsdam
Release date:2021/11/03
Tag:floods; inference; machine learning
Issue:5
Article number:e2019WR025924
Number of pages:12
Source:Water Resources Research, 56, e2019WR025924. https://doi.org/10.1029/2019WR025924
Organizational units:Mathematisch-Naturwissenschaftliche Fakultät / Institut für Umweltwissenschaften und Geographie
DDC classification:5 Naturwissenschaften und Mathematik / 55 Geowissenschaften, Geologie / 550 Geowissenschaften
Peer review:Referiert
Publishing method:Open Access / Green Open-Access
License (German):License LogoCC-BY - Namensnennung 4.0 International
External remark:Bibliographieeintrag der Originalveröffentlichung/Quelle
External remark:Bibliographieeintrag der Originalveröffentlichung/Quelle
Accept ✔
This website uses technically necessary session cookies. By continuing to use the website, you agree to this. You can find our privacy policy here.