The search result changed since you submitted your search request. Documents might be displayed in a different sort order.
  • search hit 9 of 95
Back to Result List

Biological cluster evaluation for gene function prediction

  • Recent advances in high-throughput omics techniques render it possible to decode the function of genes by using the "guilt-by-association" principle on biologically meaningful clusters of gene expression data. However, the existing frameworks for biological evaluation of gene clusters are hindered by two bottleneck issues: (1) the choice for the number of clusters, and (2) the external measures which do not take in consideration the structure of the analyzed data and the ontology of the existing biological knowledge. Here, we address the identified bottlenecks by developing a novel framework that allows not only for biological evaluation of gene expression clusters based on existing structured knowledge, but also for prediction of putative gene functions. The proposed framework facilitates propagation of statistical significance at each of the following steps: (1) estimating the number of clusters, (2) evaluating the clusters in terms of novel external structural measures, (3) selecting an optimal clustering algorithm, and (4)Recent advances in high-throughput omics techniques render it possible to decode the function of genes by using the "guilt-by-association" principle on biologically meaningful clusters of gene expression data. However, the existing frameworks for biological evaluation of gene clusters are hindered by two bottleneck issues: (1) the choice for the number of clusters, and (2) the external measures which do not take in consideration the structure of the analyzed data and the ontology of the existing biological knowledge. Here, we address the identified bottlenecks by developing a novel framework that allows not only for biological evaluation of gene expression clusters based on existing structured knowledge, but also for prediction of putative gene functions. The proposed framework facilitates propagation of statistical significance at each of the following steps: (1) estimating the number of clusters, (2) evaluating the clusters in terms of novel external structural measures, (3) selecting an optimal clustering algorithm, and (4) predicting gene functions. The framework also includes a method for evaluation of gene clusters based on the structure of the employed ontology. Moreover, our method for obtaining a probabilistic range for the number of clusters is demonstrated valid on synthetic data and available gene expression profiles from Saccharomyces cerevisiae. Finally, we propose a network-based approach for gene function prediction which relies on the clustering of optimal score and the employed ontology. Our approach effectively predicts gene function on the Saccharomyces cerevisiae data set and is also employed to obtain putative gene functions for an Arabidopsis thaliana data set.show moreshow less

Export metadata

Additional Services

Search Google Scholar Statistics
Metadaten
Author details:Sebastian Klie, Zoran NikoloskiORCiDGND, Joachim SelbigGND
DOI:https://doi.org/10.1089/cmb.2009.0129
ISSN:1066-5277
ISSN:1557-8666
Pubmed ID:https://pubmed.ncbi.nlm.nih.gov/20059365
Title of parent work (English):Journal of computational biology
Publisher:Liebert
Place of publishing:New Rochelle
Publication type:Article
Language:English
Year of first publication:2014
Publication year:2014
Release date:2017/03/27
Tag:NP-completeness; algorithms; biochemical networks; combinatorics; computational molecular biology; databases; functional genomics; gene expression
Volume:21
Issue:6
Number of pages:18
First page:428
Last Page:445
Organizational units:Mathematisch-Naturwissenschaftliche Fakultät / Institut für Biochemie und Biologie
Peer review:Referiert
Accept ✔
This website uses technically necessary session cookies. By continuing to use the website, you agree to this. You can find our privacy policy here.