CUBCO+: prediction of protein complexes based on min-cut network partitioning into biclique spanned subgraphs
- High-throughput proteomics approaches have resulted in large-scale protein–protein interaction (PPI) networks that have been employed for the prediction of protein complexes. However, PPI networks contain false-positive as well as false-negative PPIs that affect the protein complex prediction algorithms. To address this issue, here we propose an algorithm called CUBCO+ that: (1) employs GO semantic similarity to retain only biologically relevant interactions with a high similarity score, (2) based on link prediction approaches, scores the false-negative edges, and (3) incorporates the resulting scores to predict protein complexes. Through comprehensive analyses with PPIs from Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens, we show that CUBCO+ performs as well as the approaches that predict protein complexes based on recently introduced graph partitions into biclique spanned subgraphs and outperforms the other state-of-the-art approaches. Moreover, we illustrate that in combination with GO semantic similarity,High-throughput proteomics approaches have resulted in large-scale protein–protein interaction (PPI) networks that have been employed for the prediction of protein complexes. However, PPI networks contain false-positive as well as false-negative PPIs that affect the protein complex prediction algorithms. To address this issue, here we propose an algorithm called CUBCO+ that: (1) employs GO semantic similarity to retain only biologically relevant interactions with a high similarity score, (2) based on link prediction approaches, scores the false-negative edges, and (3) incorporates the resulting scores to predict protein complexes. Through comprehensive analyses with PPIs from Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens, we show that CUBCO+ performs as well as the approaches that predict protein complexes based on recently introduced graph partitions into biclique spanned subgraphs and outperforms the other state-of-the-art approaches. Moreover, we illustrate that in combination with GO semantic similarity, CUBCO+ enables us to predict more accurate protein complexes in 36% of the cases in comparison to CUBCO as its predecessor.…
Verfasserangaben: | Sara OmranianORCiDGND, Zoran NikoloskiORCiDGND |
---|---|
DOI: | https://doi.org/10.1007/s41109-022-00508-5 |
ISSN: | 2364-8228 |
Titel des übergeordneten Werks (Englisch): | Applied Network Science |
Verlag: | Springer International Publishing |
Verlagsort: | Cham |
Publikationstyp: | Wissenschaftlicher Artikel |
Sprache: | Englisch |
Datum der Erstveröffentlichung: | 11.10.2022 |
Erscheinungsjahr: | 2022 |
Datum der Freischaltung: | 03.04.2023 |
Freies Schlagwort / Tag: | Network clustering; Protein complexes; Protein–protein interaction; Species comparison |
Band: | 7 |
Aufsatznummer: | 71 |
Seitenanzahl: | 12 |
Fördernde Institution: | Universität Potsdam |
Fördernde Institution: | Deutsche Forschungsgemeinschaft (DFG) |
Fördernummer: | PA 2022_189 |
Fördernummer: | Projektnummer 491466077 |
Organisationseinheiten: | Mathematisch-Naturwissenschaftliche Fakultät / Institut für Biochemie und Biologie |
DDC-Klassifikation: | 3 Sozialwissenschaften / 30 Sozialwissenschaften, Soziologie / 300 Sozialwissenschaften |
Peer Review: | Referiert |
Fördermittelquelle: | Publikationsfonds der Universität Potsdam |
Publikationsweg: | Open Access / Gold Open-Access |
Lizenz (Deutsch): | CC-BY - Namensnennung 4.0 International |
Externe Anmerkung: | Zweitveröffentlichung in der Schriftenreihe Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe ; 1315 |