Refine
Year of publication
Document Type
- Article (52)
- Postprint (13)
- Review (2)
- Monograph/Edited Volume (1)
Language
- English (68)
Is part of the Bibliography
- yes (68)
Keywords
- Quantitative Trait Locus (4)
- Quantitative Trait Locus analysis (4)
- metabolomics (4)
- recombinant inbred line (4)
- Gene Ontology (2)
- Glioma (2)
- Partial Little Square (2)
- Phosphorylation Site (2)
- dominance effect (2)
- feature selection (2)
- gene expression (2)
- heterosis (2)
- phosphorylated amino acid (2)
- prediction (2)
- recombinant inbred line population (2)
- single nucleotide polymorphism mapping (2)
- slim term (2)
- Algebraic geometry (1)
- Arabidopsis thaliana (1)
- Bifurcation parameters (1)
- Biomass (1)
- Calvin cycle (1)
- Complexity (1)
- Conjunctive Normal Form (1)
- Constraint-based approaches (1)
- Correlation networks (1)
- DNA methylation (1)
- Data integration (1)
- Dickkopf 1 (1)
- Disjunctive Normal Form (1)
- Docking interactions (1)
- Evolution (1)
- External structural measures (1)
- Full Adder (1)
- Fusion (1)
- Gap junction (1)
- Gene function prediction (1)
- Gene regulatory network (1)
- Gene structure (1)
- Genotype Inference (1)
- Gliomas (1)
- Graph theory (1)
- HOMA (1)
- Haplotype Inference (1)
- Human mesenchymal stem cells (1)
- Hybrid prediction (1)
- Hypoxia (1)
- Intercellular crosstalk (1)
- LASSO (1)
- Maize (1)
- Mesenchymal stem cell (1)
- Metabolic networks (1)
- Metabolite profiles (1)
- Microarray data (1)
- Multistationarity (1)
- Mutual Information (1)
- NP-completeness (1)
- Phenomics (1)
- Regression (1)
- Robustness (1)
- STAT6 (1)
- Sequence alignment (1)
- Signal-transduction (1)
- Small-world networks (1)
- Support vector machines (1)
- Syncytium (1)
- Th2 cells (1)
- Transcription factors (1)
- Transductive learning (1)
- U87 glioma cells (1)
- Zea mays (1)
- action language (1)
- algorithms (1)
- answer set programming (1)
- arabidopsis (1)
- balance analysis (1)
- biochemical networks (1)
- biological network model (1)
- biological robustness (1)
- biomarker (1)
- biomass (1)
- centrality (1)
- combinatorics (1)
- comparative proteomics (1)
- computational biochemistry (1)
- computational molecular biology (1)
- couple reaction (1)
- coupling relationship (1)
- databases (1)
- decision tree (1)
- differential gene expression (1)
- efficient (1)
- endothelial progenitor cell (1)
- fasting glucose (1)
- functional genomics (1)
- gene expression matrix (1)
- gene-expression (1)
- genetic variability (1)
- hematopoietic stem cell (1)
- heterogeneous tissue (1)
- homogeneous cell population (1)
- impaired glucose tolerance (1)
- information (1)
- insulin (1)
- insulin resistance (1)
- kidney cancer (1)
- linear programming problem (1)
- linkage disequilibrium (1)
- lipoxygenase (1)
- mass accuracy (1)
- metabolic network (1)
- metabolic networks (1)
- metabolic regulation (1)
- metabolism (1)
- metabolite (1)
- metabolite profiling (1)
- metastasis (1)
- microarray data (1)
- microdissection (1)
- models (1)
- morphological analysis (1)
- muscle development (1)
- null model (1)
- package (1)
- pathways (1)
- phenotype (1)
- plant biology (1)
- plasma (1)
- polycystic ovary syndrome (1)
- potato (1)
- potato tuber (1)
- principal component (1)
- proinsulin (1)
- protease inhibitor (1)
- protein isoforms (1)
- pure parsimony (1)
- quantile normalization (1)
- random forest (1)
- randomization (1)
- reconstruction (1)
- regression (1)
- regular exercise training (1)
- resistance (1)
- reversible reaction (1)
- saccharomyces-cerevisiae (1)
- seedlings (1)
- selection (1)
- significance (1)
- solanum (1)
- stress-response (1)
- subcellular localization (1)
- support vector machine (1)
- systems biology (1)
- transcript profiling (1)
- trehalose synthesis (1)
- type 2 diabetes (1)
- type 2 diabetes mellitus (1)
Background: Protein phosphorylation is an important post-translational modification influencing many aspects of dynamic cellular behavior. Site-specific phosphorylation of amino acid residues serine, threonine, and tyrosine can have profound effects on protein structure, activity, stability, and interaction with other biomolecules. Phosphorylation sites can be affected in diverse ways in members of any species, one such way is through single nucleotide polymorphisms (SNPs). The availability of large numbers of experimentally identified phosphorylation sites, and of natural variation datasets in Arabidopsis thaliana prompted us to analyze the effect of non-synonymous SNPs (nsSNPs) onto phosphorylation sites.
Results: From the analyses of 7,178 experimentally identified phosphorylation sites we found that: (i) Proteins with multiple phosphorylation sites occur more often than expected by chance. (ii) Phosphorylation hotspots show a preference to be located outside conserved domains. (iii) nsSNPs affected experimental phosphorylation sites as much as the corresponding non-phosphorylated amino acid residues. (iv) Losses of experimental phosphorylation sites by nsSNPs were identified in 86 A. thaliana proteins, among them receptor proteins were overrepresented.
These results were confirmed by similar analyses of predicted phosphorylation sites in A. thaliana. In addition, predicted threonine phosphorylation sites showed a significant enrichment of nsSNPs towards asparagines and a significant depletion of the synonymous substitution. Proteins in which predicted phosphorylation sites were affected by nsSNPs (loss and gain), were determined to be mainly receptor proteins, stress response proteins and proteins involved in nucleotide and protein binding. Proteins involved in metabolism, catalytic activity and biosynthesis were less affected.
Conclusions: We analyzed more than 7,100 experimentally identified phosphorylation sites in almost 4,300 protein-coding loci in silico, thus constituting the largest phosphoproteomics dataset for A. thaliana available to date. Our findings suggest a relatively high variability in the presence or absence of phosphorylation sites between different natural accessions in receptor and other proteins involved in signal transduction. Elucidating the effect of phosphorylation sites affected by nsSNPs on adaptive responses represents an exciting research goal for the future.
Background: Protein phosphorylation is an important post-translational modification influencing many aspects of dynamic cellular behavior. Site-specific phosphorylation of amino acid residues serine, threonine, and tyrosine can have profound effects on protein structure, activity, stability, and interaction with other biomolecules. Phosphorylation sites can be affected in diverse ways in members of any species, one such way is through single nucleotide polymorphisms (SNPs). The availability of large numbers of experimentally identified phosphorylation sites, and of natural variation datasets in Arabidopsis thaliana prompted us to analyze the effect of non-synonymous SNPs (nsSNPs) onto phosphorylation sites.
Results: From the analyses of 7,178 experimentally identified phosphorylation sites we found that: (i) Proteins with multiple phosphorylation sites occur more often than expected by chance. (ii) Phosphorylation hotspots show a preference to be located outside conserved domains. (iii) nsSNPs affected experimental phosphorylation sites as much as the corresponding non-phosphorylated amino acid residues. (iv) Losses of experimental phosphorylation sites by nsSNPs were identified in 86 A. thaliana proteins, among them receptor proteins were overrepresented.
These results were confirmed by similar analyses of predicted phosphorylation sites in A. thaliana. In addition, predicted threonine phosphorylation sites showed a significant enrichment of nsSNPs towards asparagines and a significant depletion of the synonymous substitution. Proteins in which predicted phosphorylation sites were affected by nsSNPs (loss and gain), were determined to be mainly receptor proteins, stress response proteins and proteins involved in nucleotide and protein binding. Proteins involved in metabolism, catalytic activity and biosynthesis were less affected.
Conclusions: We analyzed more than 7,100 experimentally identified phosphorylation sites in almost 4,300 protein-coding loci in silico, thus constituting the largest phosphoproteomics dataset for A. thaliana available to date. Our findings suggest a relatively high variability in the presence or absence of phosphorylation sites between different natural accessions in receptor and other proteins involved in signal transduction. Elucidating the effect of phosphorylation sites affected by nsSNPs on adaptive responses represents an exciting research goal for the future.
The main objective of this study was to identify genomic regions involved in biomass heterosis using QTL, generation means, and mode-of-inheritance classification analyses. In a modified North Carolina Design III we backcrossed 429 recombinant inbred line and 140 introgression line populations to the two parental accessions, C24 and Col-0, whose F 1 hybrid exhibited 44% heterosis for biomass. Mid-parent heterosis in the RILs ranged from −31 to 99% for dry weight and from −58 to 143% for leaf area. We detected ten genomic positions involved in biomass heterosis at an early developmental stage, individually explaining between 2.4 and 15.7% of the phenotypic variation. While overdominant gene action was prevalent in heterotic QTL, our results suggest that a combination of dominance, overdominance and epistasis is involved in biomass heterosis in this Arabidopsis cross.
The main objective of this study was to identify genomic regions involved in biomass heterosis using QTL, generation means, and mode-of-inheritance classification analyses. In a modified North Carolina Design III we backcrossed 429 recombinant inbred line and 140 introgression line populations to the two parental accessions, C24 and Col-0, whose F 1 hybrid exhibited 44% heterosis for biomass. Mid-parent heterosis in the RILs ranged from −31 to 99% for dry weight and from −58 to 143% for leaf area. We detected ten genomic positions involved in biomass heterosis at an early developmental stage, individually explaining between 2.4 and 15.7% of the phenotypic variation. While overdominant gene action was prevalent in heterotic QTL, our results suggest that a combination of dominance, overdominance and epistasis is involved in biomass heterosis in this Arabidopsis cross.
Prediction of hybrid biomass in Arabidopsis thaliana by selected parental SNP and metabolic markers
(2009)
A recombinant inbred line (RIL) population, derived from two Arabidopsis thaliana accessions, and the corresponding testcrosses with these two original accessions were used for the development and validation of machine learning models to predict the biomass of hybrids. Genetic and metabolic information of the RILs served as predictors. Feature selection reduced the number of variables (genetic and metabolic markers) in the models by more than 80% without impairing the predictive power. Thus, potential biomarkers have been revealed. Metabolites were shown to bear information on inherited macroscopic phenotypes. This proof of concept could be interesting for breeders. The example population exhibits substantial mid-parent biomass heterosis. The results of feature selection could therefore be used to shed light on the origin of heterosis. In this respect, mainly dominance effects were detected.
Prediction of hybrid biomass in Arabidopsis thaliana by selected parental SNP and metabolic markers
(2009)
A recombinant inbred line (RIL) population, derived from two Arabidopsis thaliana accessions, and the corresponding testcrosses with these two original accessions were used for the development and validation of machine learning models to predict the biomass of hybrids. Genetic and metabolic information of the RILs served as predictors. Feature selection reduced the number of variables (genetic and metabolic markers) in the models by more than 80% without impairing the predictive power. Thus, potential biomarkers have been revealed. Metabolites were shown to bear information on inherited macroscopic phenotypes. This proof of concept could be interesting for breeders. The example population exhibits substantial mid-parent biomass heterosis. The results of feature selection could therefore be used to shed light on the origin of heterosis. In this respect, mainly dominance effects were detected.
To develop and investigate detailed mathematical models of metabolic processes is one of the primary challenges in systems biology. However, despite considerable advance in the topological analysis of metabolic networks, kinetic modeling is still often severely hampered by inadequate knowledge of the enzyme-kinetic rate laws and their associated parameter values. Here we propose a method that aims to give a quantitative account of the dynamical capabilities of a metabolic system, without requiring any explicit information about the functional form of the rate equations. Our approach is based on constructing a local linear model at each point in parameter space, such that each element of the model is either directly experimentally accessible or amenable to a straightforward biochemical interpretation. This ensemble of local linear models, encompassing all possible explicit kinetic models, then allows for a statistical exploration of the comprehensive parameter space. The method is exemplified on two paradigmatic metabolic systems: the glycolytic pathway of yeast and a realistic-scale representation of the photosynthetic Calvin cycle.
Background: The biological interpretation of large-scale gene expression data is one of the paramount challenges in current bioinformatics. In particular, placing the results in the context of other available functional genomics data, such as existing bio-ontologies, has already provided substantial improvement for detecting and categorizing genes of interest. One common approach is to look for functional annotations that are significantly enriched within a group or cluster of genes, as compared to a reference group. Results: In this work, we suggest the information-theoretic concept of mutual information to investigate the relationship between groups of genes, as given by data-driven clustering, and their respective functional categories. Drawing upon related approaches (Gibbons and Roth, Genome Research 12: 1574-1581, 2002), we seek to quantify to what extent individual attributes are sufficient to characterize a given group or cluster of genes. Conclusion: We show that the mutual information provides a systematic framework to assess the relationship between groups or clusters of genes and their functional annotations in a quantitative way. Within this framework, the mutual information allows us to address and incorporate several important issues, such as the interdependence of functional annotations and combinatorial combinations of attributes. It thus supplements and extends the conventional search for overrepresented attributes within a group or cluster of genes. In particular taking combinations of attributes into account, the mutual information opens the way to uncover specific functional descriptions of a group of genes or clustering result. All datasets and functional annotations used in this study are publicly available. All scripts used in the analysis are provided as additional files.
More effort — more results
(2016)
The development of 'omics' technologies has progressed to address complex biological questions that underlie various plant functions thereby producing copious amounts of data. The need to assimilate large amounts of data into biologically meaningful interpretations has necessitated the development of statistical methods to integrate multidimensional information. Throughout this review, we provide examples of recent outcomes of 'omics' data integration together with an overview of available statistical methods and tools.
F2C2
(2012)
Background: Flux coupling analysis (FCA) has become a useful tool in the constraint-based analysis of genome-scale metabolic networks. FCA allows detecting dependencies between reaction fluxes of metabolic networks at steady-state. On the one hand, this can help in the curation of reconstructed metabolic networks by verifying whether the coupling between reactions is in agreement with the experimental findings. On the other hand, FCA can aid in defining intervention strategies to knock out target reactions.
Results: We present a new method F2C2 for FCA, which is orders of magnitude faster than previous approaches. As a consequence, FCA of genome-scale metabolic networks can now be performed in a routine manner.
Conclusions: We propose F2C2 as a fast tool for the computation of flux coupling in genome-scale metabolic networks. F2C2 is freely available for non-commercial use at https://sourceforge.net/projects/f2c2/files/.