publish.UP 004 Datenverarbeitung; Informatik

004 Datenverarbeitung; Informatik

2 search hits

1 to 2

Sort by

Using interpretability approaches to update "black-box" clinical prediction models (2021)

Freitas da Cruz, Harry ; Pfahringer, Boris ; Martensen, Tom ; Schneider, Frederic ; Meyer, Alexander ; Böttinger, Erwin ; Schapranow, Matthieu-Patrick

Despite advances in machine learning-based clinical prediction models, only few of such models are actually deployed in clinical contexts. Among other reasons, this is due to a lack of validation studies. In this paper, we present and discuss the validation results of a machine learning model for the prediction of acute kidney injury in cardiac surgery patients initially developed on the MIMIC-III dataset when applied to an external cohort of an American research hospital. To help account for the performance differences observed, we utilized interpretability methods based on feature importance, which allowed experts to scrutinize model behavior both at the global and local level, making it possible to gain further insights into why it did not behave as expected on the validation cohort. The knowledge gleaned upon derivation can be potentially useful to assist model update during validation for more generalizable and simpler models. We argue that interpretability methods should be considered by practitioners as a further tool to help explain performance differences and inform model update in validation studies.

Phe2vec (2021)

De Freitas, Jessica K. ; Johnson, Kipp W. ; Golden, Eddye ; Nadkarni, Girish N. ; Dudley, Joel T. ; Böttinger, Erwin ; Glicksberg, Benjamin S. ; Miotto, Riccardo

Robust phenotyping of patients from electronic health records (EHRs) at scale is a challenge in clinical informatics. Here, we introduce Phe2vec, an automated framework for disease phenotyping from EHRs based on unsupervised learning and assess its effectiveness against standard rule-based algorithms from Phenotype KnowledgeBase (PheKB). Phe2vec is based on pre-computing embeddings of medical concepts and patients' clinical history. Disease phenotypes are then derived from a seed concept and its neighbors in the embedding space. Patients are linked to a disease if their embedded representation is close to the disease phenotype. Comparing Phe2vec and PheKB cohorts head-to-head using chart review, Phe2vec performed on par or better in nine out of ten diseases. Differently from other approaches, it can scale to any condition and was validated against widely adopted expert-based standards. Phe2vec aims to optimize clinical informatics research by augmenting current frameworks to characterize patients by condition and derive reliable disease cohorts.

1 to 2

004 Datenverarbeitung; Informatik

Refine

Has Fulltext

Author

Year of publication

Document Type

Language

Is part of the Bibliography

Keywords

Institute

2 search hits