• Treffer 1 von 11
Zurück zur Trefferliste

FIBER

  • Objectives: The development of clinical predictive models hinges upon the availability of comprehensive clinical data. Tapping into such resources requires considerable effort from clinicians, data scientists, and engineers. Specifically, these efforts are focused on data extraction and preprocessing steps required prior to modeling, including complex database queries. A handful of software libraries exist that can reduce this complexity by building upon data standards. However, a gap remains concerning electronic health records (EHRs) stored in star schema clinical data warehouses, an approach often adopted in practice. In this article, we introduce the FlexIBle EHR Retrieval (FIBER) tool: a Python library built on top of a star schema (i2b2) clinical data warehouse that enables flexible generation of modeling-ready cohorts as data frames. Materials and Methods: FIBER was developed on top of a large-scale star schema EHR database which contains data from 8 million patients and over 120 million encounters. To illustrateObjectives: The development of clinical predictive models hinges upon the availability of comprehensive clinical data. Tapping into such resources requires considerable effort from clinicians, data scientists, and engineers. Specifically, these efforts are focused on data extraction and preprocessing steps required prior to modeling, including complex database queries. A handful of software libraries exist that can reduce this complexity by building upon data standards. However, a gap remains concerning electronic health records (EHRs) stored in star schema clinical data warehouses, an approach often adopted in practice. In this article, we introduce the FlexIBle EHR Retrieval (FIBER) tool: a Python library built on top of a star schema (i2b2) clinical data warehouse that enables flexible generation of modeling-ready cohorts as data frames. Materials and Methods: FIBER was developed on top of a large-scale star schema EHR database which contains data from 8 million patients and over 120 million encounters. To illustrate FIBER's capabilities, we present its application by building a heart surgery patient cohort with subsequent prediction of acute kidney injury (AKI) with various machine learning models. Results: Using FIBER, we were able to build the heart surgery cohort (n = 12 061), identify the patients that developed AKI (n = 1005), and automatically extract relevant features (n = 774). Finally, we trained machine learning models that achieved area under the curve values of up to 0.77 for this exemplary use case. Conclusion: FIBER is an open-source Python library developed for extracting information from star schema clinical data warehouses and reduces time-to-modeling, helping to streamline the clinical modeling process.zeige mehrzeige weniger

Metadaten exportieren

Weitere Dienste

Suche bei Google Scholar Statistik - Anzahl der Zugriffe auf das Dokument
Metadaten
Verfasserangaben:Suparno DattaORCiD, Jan Philipp Sachs, Harry Freitas da CruzORCiDGND, Tom Martensen, Philipp BodeORCiD, Ariane Morassi SassoORCiD, Benjamin S. GlicksbergORCiD, Erwin BöttingerGND
DOI:https://doi.org/10.1093/jamiaopen/ooab048
ISSN:2574-2531
Pubmed ID:https://pubmed.ncbi.nlm.nih.gov/34350388
Titel des übergeordneten Werks (Englisch):JAMIA open
Untertitel (Englisch):enabling flexible retrieval of electronic health records data for clinical predictive modeling
Verlag:Oxford Univ. Press
Verlagsort:Oxford
Publikationstyp:Wissenschaftlicher Artikel
Sprache:Englisch
Datum der Erstveröffentlichung:02.08.2021
Erscheinungsjahr:2021
Datum der Freischaltung:12.01.2024
Freies Schlagwort / Tag:databases; electronic health records; factual; information storage and; retrieval; software/instrumentation; workflow
Band:4
Ausgabe:3
Aufsatznummer:ooab048
Seitenanzahl:10
Fördernde Institution:Office of Research Infrastructure of the National Institutes of Health [S10OD026880]; European UnionEuropean Commission [826117]
Organisationseinheiten:An-Institute / Hasso-Plattner-Institut für Digital Engineering gGmbH
DDC-Klassifikation:6 Technik, Medizin, angewandte Wissenschaften / 61 Medizin und Gesundheit / 610 Medizin und Gesundheit
Peer Review:Referiert
Publikationsweg:Open Access / Gold Open-Access
DOAJ gelistet
Lizenz (Deutsch):License LogoCC-BY-NC - Namensnennung, nicht kommerziell 4.0 International
Verstanden ✔
Diese Webseite verwendet technisch erforderliche Session-Cookies. Durch die weitere Nutzung der Webseite stimmen Sie diesem zu. Unsere Datenschutzerklärung finden Sie hier.