TY - JOUR A1 - Schudoma, Christian A1 - Larhlimi, Abdelhalim A1 - Walther, Dirk T1 - The influence of the local sequence environment on RNA loop structures JF - RNA : a publication of the RNA Society N2 - RNA folding is assumed to be a hierarchical process. The secondary structure of an RNA molecule, signified by base-pairing and stacking interactions between the paired bases, is formed first. Subsequently, the RNA molecule adopts an energetically favorable three-dimensional conformation in the structural space determined mainly by the rotational degrees of freedom associated with the backbone of regions of unpaired nucleotides (loops). To what extent the backbone conformation of RNA loops also results from interactions within the local sequence context or rather follows global optimization constraints alone has not been addressed yet. Because the majority of base stacking interactions are exerted locally, a critical influence of local sequence on local structure appears plausible. Thus, local loop structure ought to be predictable, at least in part, from the local sequence context alone. To test this hypothesis, we used Random Forests on a nonredundant data set of unpaired nucleotides extracted from 97 X-ray structures from the Protein Data Bank (PDB) to predict discrete backbone angle conformations given by the discretized eta/theta-pseudo-torsional space. Predictions on balanced sets with four to six conformational classes using local sequence information yielded average accuracies of up to 55%, thus significantly better than expected by chance (17%-25%). Bases close to the central nucleotide appear to be most tightly linked to its conformation. Our results suggest that RNA loop structure does not only depend on long-range base-pairing interactions; instead, it appears that local sequence context exerts a significant influence on the formation of the local loop structure. KW - RNA KW - 3D structure KW - structure prediction KW - Random Forests KW - machine learning KW - backbone conformation Y1 - 2011 U6 - https://doi.org/10.1261/rna.2550211 SN - 1355-8382 VL - 17 IS - 7 SP - 1247 EP - 1257 PB - Cold Spring Harbor Laboratory Press CY - Cold Spring Harbor, NY ER - TY - GEN A1 - Durek, Pawel A1 - Schudoma, Christian A1 - Weckwerth, Wolfram A1 - Selbig, Joachim A1 - Walther, Dirk T1 - Detection and characterization of 3D-signature phosphorylation site motifs and their contribution towards improved phosphorylation site prediction in proteins N2 - Background: Phosphorylation of proteins plays a crucial role in the regulation and activation of metabolic and signaling pathways and constitutes an important target for pharmaceutical intervention. Central to the phosphorylation process is the recognition of specific target sites by protein kinases followed by the covalent attachment of phosphate groups to the amino acids serine, threonine, or tyrosine. The experimental identification as well as computational prediction of phosphorylation sites (P-sites) has proved to be a challenging problem. Computational methods have focused primarily on extracting predictive features from the local, one-dimensional sequence information surrounding phosphorylation sites. Results: We characterized the spatial context of phosphorylation sites and assessed its usability for improved phosphorylation site predictions. We identified 750 non-redundant, experimentally verified sites with three-dimensional (3D) structural information available in the protein data bank (PDB) and grouped them according to their respective kinase family. We studied the spatial distribution of amino acids around phosphorserines, phosphothreonines, and phosphotyrosines to extract signature 3D-profiles. Characteristic spatial distributions of amino acid residue types around phosphorylation sites were indeed discernable, especially when kinase-family-specific target sites were analyzed. To test the added value of using spatial information for the computational prediction of phosphorylation sites, Support Vector Machines were applied using both sequence as well as structural information. When compared to sequence-only based prediction methods, a small but consistent performance improvement was obtained when the prediction was informed by 3D-context information. Conclusion: While local one-dimensional amino acid sequence information was observed to harbor most of the discriminatory power, spatial context information was identified as relevant for the recognition of kinases and their cognate target sites and can be used for an improved prediction of phosphorylation sites. A web-based service (Phos3D) implementing the developed structurebased P-site prediction method has been made available at http://phos3d.mpimp-golm.mpg.de. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - paper 141 KW - Support vector machines KW - Microarray data KW - Docking interactions KW - Signal-transduction KW - Sequence alignment Y1 - 2009 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus-45129 ER - TY - THES A1 - Schudoma, Christian T1 - Bioinformatic approaches to sequence-structure relationships in RNA loops Y1 - 2011 CY - Potsdam ER - TY - JOUR A1 - Sprenger, Heike A1 - Rudack, Katharina A1 - Schudoma, Christian A1 - Neumann, Arne A1 - Seddig, Sylvia A1 - Peters, Rolf A1 - Zuther, Ellen A1 - Kopka, Joachim A1 - Hincha, Dirk K. A1 - Walther, Dirk A1 - Koehl, Karin T1 - Assessment of drought tolerance and its potential yield penalty in potato JF - Functional plant biology : an international journal of plant function N2 - Climate models predict an increased likelihood of seasonal droughts for many areas of the world. Breeding for drought tolerance could be accelerated by marker-assisted selection. As a basis for marker identification, we studied the genetic variance, predictability of field performance and potential costs of tolerance in potato (Solanum tuberosum L.). Potato produces high calories per unit of water invested, but is drought-sensitive. In 14 independent pot or field trials, 34 potato cultivars were grown under optimal and reduced water supply to determine starch yield. In an artificial dataset, we tested several stress indices for their power to distinguish tolerant and sensitive genotypes independent of their yield potential. We identified the deviation of relative starch yield from the experimental median (DRYM) as the most efficient index. DRYM corresponded qualitatively to the partial least square model-based metric of drought stress tolerance in a stress effect model. The DRYM identified significant tolerance variation in the European potato cultivar population to allow tolerance breeding and marker identification. Tolerance results from pot trials correlated with those from field trials but predicted field performance worse than field growth parameters. Drought tolerance correlated negatively with yield under optimal conditions in the field. The distribution of yield data versus DRYM indicated that tolerance can be combined with average yield potentials, thus circumventing potential yield penalties in tolerance breeding. KW - performance prediction KW - Solanum tuberosum KW - tolerance index KW - target environment Y1 - 2015 U6 - https://doi.org/10.1071/FP15013 SN - 1445-4408 SN - 1445-4416 VL - 42 IS - 7 SP - 655 EP - 667 PB - CSIRO CY - Clayton ER -