• search hit 11 of 1103
Back to Result List

Large-scale literature mining to assess the relation between anti-cancer drugs and cancer types

  • Background: There is a huge body of scientific literature describing the relation between tumor types and anti-cancer drugs. The vast amount of scientific literature makes it impossible for researchers and physicians to extract all relevant information manually. Methods: In order to cope with the large amount of literature we applied an automated text mining approach to assess the relations between 30 most frequent cancer types and 270 anti-cancer drugs. We applied two different approaches, a classical text mining based on named entity recognition and an AI-based approach employing word embeddings. The consistency of literature mining results was validated with 3 independent methods: first, using data from FDA approvals, second, using experimentally measured IC-50 cell line data and third, using clinical patient survival data. Results: We demonstrated that the automated text mining was able to successfully assess the relation between cancer types and anti-cancer drugs. All validation methods showed a good correspondenceBackground: There is a huge body of scientific literature describing the relation between tumor types and anti-cancer drugs. The vast amount of scientific literature makes it impossible for researchers and physicians to extract all relevant information manually. Methods: In order to cope with the large amount of literature we applied an automated text mining approach to assess the relations between 30 most frequent cancer types and 270 anti-cancer drugs. We applied two different approaches, a classical text mining based on named entity recognition and an AI-based approach employing word embeddings. The consistency of literature mining results was validated with 3 independent methods: first, using data from FDA approvals, second, using experimentally measured IC-50 cell line data and third, using clinical patient survival data. Results: We demonstrated that the automated text mining was able to successfully assess the relation between cancer types and anti-cancer drugs. All validation methods showed a good correspondence between the results from literature mining and independent confirmatory approaches. The relation between most frequent cancer types and drugs employed for their treatment were visualized in a large heatmap. All results are accessible in an interactive web-based knowledge base using the following link: . Conclusions: Our approach is able to assess the relations between compounds and cancer types in an automated manner. Both, cancer types and compounds could be grouped into different clusters. Researchers can use the interactive knowledge base to inspect the presented results and follow their own research questions, for example the identification of novel indication areas for known drugs.show moreshow less

Export metadata

Additional Services

Search Google Scholar Statistics
Metadaten
Author details:Chris BauerORCiDGND, Ralf Herwig, Matthias LienhardGND, Paul PrasseORCiDGND, Tobias SchefferORCiD, Johannes Schuchhardt
DOI:https://doi.org/10.1186/s12967-021-02941-z
ISSN:1479-5876
Pubmed ID:https://pubmed.ncbi.nlm.nih.gov/34174885
Title of parent work (English):Journal of translational medicine
Publisher:BioMed Central
Place of publishing:London
Publication type:Article
Language:English
Date of first publication:2021/06/26
Publication year:2021
Release date:2024/01/22
Tag:Anti-cancer drugs; Database; Literature mining; Tumor types; Word embeddings
Volume:19
Issue:1
Article number:274
Number of pages:13
Funding institution:Bundesministerium fur Bildung und ForschungFederal Ministry of Education & Research (BMBF) [01IS18044A-C]
Organizational units:Mathematisch-Naturwissenschaftliche Fakultät / Institut für Informatik und Computational Science
DDC classification:6 Technik, Medizin, angewandte Wissenschaften / 61 Medizin und Gesundheit / 610 Medizin und Gesundheit
Peer review:Referiert
Publishing method:Open Access / Gold Open-Access
DOAJ gelistet
License (German):License LogoCC-BY - Namensnennung 4.0 International
Accept ✔
This website uses technically necessary session cookies. By continuing to use the website, you agree to this. You can find our privacy policy here.