publish.UP Search

Young Genes out of the Male: An Insight from Evolutionary Age Analysis of the Pollen Transcriptome (2015)

Cui, Xiao ; Lv, Yang ; Chen, Miaolin ; Nikoloski, Zoran ; Twell, David ; Zhang, Dabing

The birth of new genes in genomes is an important evolutionary event. Several studies reveal that new genes in animals tend to be preferentially expressed in male reproductive tissues such as testis (Betran et al., 2002; Begun et al., 2007; Dubruille et al., 2012), and thus an "out of testis' hypothesis for the emergence of new genes has been proposed (Vinckenbosch et al., 2006; Kaessmann, 2010). However, such phenomena have not been examined in plant species. Here, by employing a phylostratigraphic method, we dated the origin of protein-coding genes in rice and Arabidopsis thaliana and observed a number of young genes in both species. These young genes tend to encode short extracellular proteins, which may be involved in rapid evolving processes, such as reproductive barriers, species specification, and antimicrobial processes. Further analysis of transcriptome age indexes across different tissues revealed that male reproductive cells express a phylogenetically younger transcriptome than other plant tissues. Compared with sporophytic tissues, the young transcriptomes of the male gametophyte displayed greater complexity and diversity, which included a higher ratio of anti-sense and inter-genic transcripts, reflecting a pervasive transcription state that facilitated the emergence of new genes. Here, we propose that pollen may act as an "innovation incubator' for the birth of de novo genes. With cases of male-biased expression of young genes reported in animals, the "new genes out of the male' model revealed a common evolutionary force that drives reproductive barriers, species specification, and the upgrading of defensive mechanisms against pathogens.

Unraveling lipid metabolism in maize with time-resolved multi-omics data (2018)

de Abreu e Lima, Francisco Anastacio ; Li, Kun ; Wen, Weiwei ; Yan, Jianbing ; Nikoloski, Zoran ; Willmitzer, Lothar ; Brotman, Yariv

Maize is the cereal crop with the highest production worldwide, and its oil is a key energy resource. Improving the quantity and quality of maize oil requires a better understanding of lipid metabolism. To predict the function of maize genes involved in lipid biosynthesis, we assembled transcriptomic and lipidomic data sets from leaves of B73 and the high-oil line By804 in two distinct time-series experiments. The integrative analysis based on high-dimensional regularized regression yielded lipid-transcript associations indirectly validated by Gene Ontology and promoter motif enrichment analyses. The co-localization of lipid-transcript associations using the genetic mapping of lipid traits in leaves and seedlings of a B73 x By804 recombinant inbred line population uncovered 323 genes involved in the metabolism of phospholipids, galactolipids, sulfolipids and glycerolipids. The resulting association network further supported the involvement of 50 gene candidates in modulating levels of representatives from multiple acyl-lipid classes. Therefore, the proposed approach provides high-confidence candidates for experimental testing in maize and model plant species.

Unraveling gene regulatory networks from time-resolved gene expression data - a measures comparison study (2011)

Hempel, Sabrina ; Koseska, Aneta ; Nikoloski, Zoran ; Kurths, Jürgen

Background: Inferring regulatory interactions between genes from transcriptomics time-resolved data, yielding reverse engineered gene regulatory networks, is of paramount importance to systems biology and bioinformatics studies. Accurate methods to address this problem can ultimately provide a deeper insight into the complexity, behavior, and functions of the underlying biological systems. However, the large number of interacting genes coupled with short and often noisy time-resolved read-outs of the system renders the reverse engineering a challenging task. Therefore, the development and assessment of methods which are computationally efficient, robust against noise, applicable to short time series data, and preferably capable of reconstructing the directionality of the regulatory interactions remains a pressing research problem with valuable applications. Results: Here we perform the largest systematic analysis of a set of similarity measures and scoring schemes within the scope of the relevance network approach which are commonly used for gene regulatory network reconstruction from time series data. In addition, we define and analyze several novel measures and schemes which are particularly suitable for short transcriptomics time series. We also compare the considered 21 measures and 6 scoring schemes according to their ability to correctly reconstruct such networks from short time series data by calculating summary statistics based on the corresponding specificity and sensitivity. Our results demonstrate that rank and symbol based measures have the highest performance in inferring regulatory interactions. In addition, the proposed scoring scheme by asymmetric weighting has shown to be valuable in reducing the number of false positive interactions. On the other hand, Granger causality as well as information-theoretic measures, frequently used in inference of regulatory networks, show low performance on the short time series analyzed in this study. Conclusions: Our study is intended to serve as a guide for choosing a particular combination of similarity measures and scoring schemes suitable for reconstruction of gene regulatory networks from short time series data. We show that further improvement of algorithms for reverse engineering can be obtained if one considers measures that are rooted in the study of symbolic dynamics or ranks, in contrast to the application of common similarity measures which do not consider the temporal character of the employed data. Moreover, we establish that the asymmetric weighting scoring scheme together with symbol based measures (for low noise level) and rank based measures (for high noise level) are the most suitable choices.

Unraveling gene regulatory networks from time-resolved gene expression data (2017)

Hempel, Sabrina ; Koseska, Aneta ; Nikoloski, Zoran ; Kurths, Jürgen

Background: Inferring regulatory interactions between genes from transcriptomics time-resolved data, yielding reverse engineered gene regulatory networks, is of paramount importance to systems biology and bioinformatics studies. Accurate methods to address this problem can ultimately provide a deeper insight into the complexity, behavior, and functions of the underlying biological systems. However, the large number of interacting genes coupled with short and often noisy time-resolved read-outs of the system renders the reverse engineering a challenging task. Therefore, the development and assessment of methods which are computationally efficient, robust against noise, applicable to short time series data, and preferably capable of reconstructing the directionality of the regulatory interactions remains a pressing research problem with valuable applications. Results: Here we perform the largest systematic analysis of a set of similarity measures and scoring schemes within the scope of the relevance network approach which are commonly used for gene regulatory network reconstruction from time series data. In addition, we define and analyze several novel measures and schemes which are particularly suitable for short transcriptomics time series. We also compare the considered 21 measures and 6 scoring schemes according to their ability to correctly reconstruct such networks from short time series data by calculating summary statistics based on the corresponding specificity and sensitivity. Our results demonstrate that rank and symbol based measures have the highest performance in inferring regulatory interactions. In addition, the proposed scoring scheme by asymmetric weighting has shown to be valuable in reducing the number of false positive interactions. On the other hand, Granger causality as well as information-theoretic measures, frequently used in inference of regulatory networks, show low performance on the short time series analyzed in this study. Conclusions: Our study is intended to serve as a guide for choosing a particular combination of similarity measures and scoring schemes suitable for reconstruction of gene regulatory networks from short time series data. We show that further improvement of algorithms for reverse engineering can be obtained if one considers measures that are rooted in the study of symbolic dynamics or ranks, in contrast to the application of common similarity measures which do not consider the temporal character of the employed data. Moreover, we establish that the asymmetric weighting scoring scheme together with symbol based measures (for low noise level) and rank based measures (for high noise level) are the most suitable choices.

The Rice Actin-Binding Protein RMD Regulates Light-Dependent Shoot Gravitropism (2019)

Song, Yu ; Li, Gang ; Nowak, Jacqueline ; Zhang, Xiaoqing ; Xu, Dongbei ; Yang, Xiujuan ; Huang, Guoqiang ; Liang, Wanqi ; Yang, Litao ; Wang, Canhua ; Bulone, Vincent ; Nikoloski, Zoran ; Hu, Jianping ; Persson, Staffan ; Zhang, Dabing

Light and gravity are two key determinants in orientating plant stems for proper growth and development. The organization and dynamics of the actin cytoskeleton are essential for cell biology and critically regulated by actin-binding proteins. However, the role of actin cytoskeleton in shoot negative gravitropism remains controversial. In this work, we report that the actin-binding protein Rice Morphology Determinant (RMD) promotes reorganization of the actin cytoskeleton in rice (Oryza sativa) shoots. The changes in actin organization are associated with the ability of the rice shoots to respond to negative gravitropism. Here, light-grown rmd mutant shoots exhibited agravitropic phenotypes. By contrast, etiolated rmd shoots displayed normal negative shoot gravitropism. Furthermore, we show that RMD maintains an actin configuration that promotes statolith mobility in gravisensing endodermal cells, and for proper auxin distribution in light-grown, but not dark-grown, shoots. RMD gene expression is diurnally controlled and directly repressed by the phytochrome-interacting factor-like protein OsPIL16. Consequently, overexpression of OsPIL16 led to gravisensing and actin patterning defects that phenocopied the rmd mutant. Our findings outline a mechanism that links light signaling and gravity perception for straight shoot growth in rice.

The hidden simplicity of metabolic networks is revealed by multireaction dependencies (2022)

Küken, Anika ; Langary, Damoun ; Nikoloski, Zoran

Understanding the complexity of metabolic networks has implications for manipulation of their functions. The complexity of metabolic networks can be characterized by identifying multireaction dependencies that are challenging to determine due to the sheer number of combinations to consider. Here, we propose the concept of concordant complexes that captures multireaction dependencies and can be efficiently determined from the algebraic structure and operational constraints of metabolic networks. The concordant complexes imply the existence of concordance modules based on which the apparent complexity of 12 metabolic networks of organisms from all kingdoms of life can be reduced by at least 78%. A comparative analysis against an ensemble of randomized metabolic networks shows that the metabolic network of Escherichia coli contains fewer concordance modules and is, therefore, more tightly coordinated than expected by chance. Together, our findings demonstrate that metabolic networks are considerably simpler than what can be perceived from their structure alone.

Systems analysis of the response of photosynthesis, metabolism, and growth to an increase in irradiance in the photosynthetic model organism chlamydomonas reinhardtii (2014)

We investigated the systems response of metabolism and growth after an increase in irradiance in the nonsaturating range in the algal model Chlamydomonas reinhardtii. In a three-step process, photosynthesis and the levels of metabolites increased immediately, growth increased after 10 to 15 min, and transcript and protein abundance responded by 40 and 120 to 240 min, respectively. In the first phase, starch and metabolites provided a transient buffer for carbon until growth increased. This uncouples photosynthesis from growth in a fluctuating light environment. In the first and second phases, rising metabolite levels and increased polysome loading drove an increase in fluxes. Most Calvin-Benson cycle (CBC) enzymes were substrate-limited in vivo, and strikingly, many were present at higher concentrations than their substrates, explaining how rising metabolite levels stimulate CBC flux. Rubisco, fructose-1,6-biosphosphatase, and seduheptulose-1,7-bisphosphatase were close to substrate saturation in vivo, and flux was increased by posttranslational activation. In the third phase, changes in abundance of particular proteins, including increases in plastidial ATP synthase and some CBC enzymes, relieved potential bottlenecks and readjusted protein allocation between different processes. Despite reasonable overall agreement between changes in transcript and protein abundance (R-2 = 0.24), many proteins, including those in photosynthesis, changed independently of transcript abundance.

System-wide organization of actin cytoskeleton determines organelle transport in hypocotyl plant cells (2017)

Breuer, David ; Nowak, Jacqueline ; Ivakov, Alexander ; Somssich, Marc ; Persson, Staffan ; Nikoloski, Zoran

System-wide organization of actin cytoskeleton determines organelle transport in hypocotyl plant cells (2017)

Breuer, David ; Nowak, Jacqueline ; Ivakov, Alexander ; Somssich, Marc ; Persson, Staffan ; Nikoloski, Zoran

The actin cytoskeleton is an essential intracellular filamentous structure that underpins cellular transport and cytoplasmic streaming in plant cells. However, the system-level properties of actin-based cellular trafficking remain tenuous, largely due to the inability to quantify key features of the actin cytoskeleton. Here, we developed an automated image-based, network-driven framework to accurately segment and quantify actin cytoskeletal structures and Golgi transport. We show that the actin cytoskeleton in both growing and elongated hypocotyl cells has structural properties facilitating efficient transport. Our findings suggest that the erratic movement of Golgi is a stable cellular phenomenon that might optimize distribution efficiency of cell material. Moreover, we demonstrate that Golgi transport in hypocotyl cells can be accurately predicted from the actin network topology alone. Thus, our framework provides quantitative evidence for system-wide coordination of cellular transport in plant cells and can be readily applied to investigate cytoskeletal organization and transport in other organisms.

Supervised learning of gene-regulatory networks based on graph distance profiles of transcriptomics data (2020)

Razaghi-Moghadam, Zahra ; Nikoloski, Zoran

Characterisation of gene-regulatory network (GRN) interactions provides a stepping stone to understanding how genes affect cellular phenotypes. Yet, despite advances in profiling technologies, GRN reconstruction from gene expression data remains a pressing problem in systems biology. Here, we devise a supervised learning approach, GRADIS, which utilises support vector machine to reconstruct GRNs based on distance profiles obtained from a graph representation of transcriptomics data. By employing the data fromEscherichia coliandSaccharomyces cerevisiaeas well as synthetic networks from the DREAM4 and five network inference challenges, we demonstrate that our GRADIS approach outperforms the state-of-the-art supervised and unsupervided approaches. This holds when predictions about target genes for individual transcription factors as well as for the entire network are considered. We employ experimentally verified GRNs fromE. coliandS. cerevisiaeto validate the predictions and obtain further insights in the performance of the proposed approach. Our GRADIS approach offers the possibility for usage of other network-based representations of large-scale data, and can be readily extended to help the characterisation of other cellular networks, including protein-protein and protein-metabolite interactions.

Supervised learning of gene regulatory networks (2020)

Razaghi-Moghadam, Zahra ; Nikoloski, Zoran

Identifying the entirety of gene regulatory interactions in a biological system offers the possibility to determine the key molecular factors that affect important traits on the level of cells, tissues, and whole organisms. Despite the development of experimental approaches and technologies for identification of direct binding of transcription factors (TFs) to promoter regions of downstream target genes, computational approaches that utilize large compendia of transcriptomics data are still the predominant methods used to predict direct downstream targets of TFs, and thus reconstruct genome-wide gene-regulatory networks (GRNs). These approaches can broadly be categorized into unsupervised and supervised, based on whether data about known, experimentally verified gene-regulatory interactions are used in the process of reconstructing the underlying GRN. Here, we first describe the generic steps of supervised approaches for GRN reconstruction, since they have been recently shown to result in improved accuracy of the resulting networks? We also illustrate how they can be used with data from model organisms to obtain more accurate prediction of gene regulatory interactions.

Stoichiometric Correlation Analysis: Principles of Metabolic Functionality from Metabolomics Data (2017)

Schwahn, Kevin ; Beleggia, Romina ; Omranian, Nooshin ; Nikoloski, Zoran

Recent advances in metabolomics technologies have resulted in high-quality (time-resolved) metabolic profiles with an increasing coverage of metabolic pathways. These data profiles represent read-outs from often non-linear dynamics of metabolic networks. Yet, metabolic profiles have largely been explored with regression-based approaches that only capture linear relationships, rendering it difficult to determine the extent to which the data reflect the underlying reaction rates and their couplings. Here we propose an approach termed Stoichiometric Correlation Analysis (SCA) based on correlation between positive linear combinations of log-transformed metabolic profiles. The log-transformation is due to the evidence that metabolic networks can be modeled by mass action law and kinetics derived from it. Unlike the existing approaches which establish a relation between pairs of metabolites, SCA facilitates the discovery of higherorder dependence between more than two metabolites. By using a paradigmatic model of the tricarboxylic acid cycle we show that the higher-order dependence reflects the coupling of concentration of reactant complexes, capturing the subtle difference between the employed enzyme kinetics. Using time-resolved metabolic profiles from Arabidopsis thaliana and Escherichia coli, we show that SCA can be used to quantify the difference in coupling of reactant complexes, and hence, reaction rates, underlying the stringent response in these model organisms. By using SCA with data from natural variation of wild and domesticated wheat and tomato accession, we demonstrate that the domestication is accompanied by loss of such couplings, in these species. Therefore, application of SCA to metabolomics data from natural variation in wild and domesticated populations provides a mechanistic way to understanding domestication and its relation to metabolic networks.

Stoichiometric capacitance reveals the theoretical capabilities of metabolic networks (2012)

Larhlimi, Abdelhalim ; Basler, Georg ; Grimbs, Sergio ; Selbig, Joachim ; Nikoloski, Zoran

Motivation: Metabolic engineering aims at modulating the capabilities of metabolic networks by changing the activity of biochemical reactions. The existing constraint-based approaches for metabolic engineering have proven useful, but are limited only to reactions catalogued in various pathway databases. Results: We consider the alternative of designing synthetic strategies which can be used not only to characterize the maximum theoretically possible product yield but also to engineer networks with optimal conversion capability by using a suitable biochemically feasible reaction called 'stoichiometric capacitance'. In addition, we provide a theoretical solution for decomposing a given stoichiometric capacitance over a set of known enzymatic reactions. We determine the stoichiometric capacitance for genome-scale metabolic networks of 10 organisms from different kingdoms of life and examine its implications for the alterations in flux variability patterns. Our empirical findings suggest that the theoretical capacity of metabolic networks comes at a cost of dramatic system's changes.

Stability of metabolic correlations under changing environmental conditions in Escherichia coli : a systems approach (2009)

Szymanski, Jedrzej ; Jozefczuk, Szymon ; Nikoloski, Zoran ; Selbig, Joachim ; Nikiforova, Victoria ; Catchpole, Gareth ; Willmitzer, Lothar

Background: Biological systems adapt to changing environments by reorganizing their cellula r and physiological program with metabolites representing one important response level. Different stresses lead to both conserved and specific responses on the metabolite level which should be reflected in the underl ying metabolic network. Methodology/Principal Findings: Starting from experimental data obtained by a GC-MS based high-throughput metabolic profiling technology we here develop an approach that: (1) extracts network representations from metabolic conditiondependent data by using pairwise correlations, (2) determines the sets of stable and condition-dependent correlations based on a combination of statistical significance and homogeneity tests, and (3) can identify metabolites related to the stress response, which goes beyond simple ob servation s about the changes of metabolic concentrations. The approach was tested with Escherichia colias a model organism observed under four different environmental stress conditions (cold stress, heat stress, oxidative stress, lactose diau xie) and control unperturbed conditions. By constructing the stable network component, which displays a scale free topology and small-world characteristics, we demonstrated that: (1) metabolite hubs in this reconstructed correlation networks are significantly enriched for those contained in biochemical networks such as EcoCyc, (2) particular components of the stable network are enriched for functionally related biochemical path ways, and (3) ind ependently of the response scale, based on their importance in the reorganization of the cor relation network a set of metabolites can be identified which represent hypothetical candidates for adjusting to a stress-specific response. Conclusions/Significance: Network-based tools allowed the identification of stress-dependent and general metabolic correlation networks. This correlation-network-ba sed approach does not rely on major changes in concentration to identify metabolites important for st ress adaptation, but rather on the changes in network properties with respect to metabolites. This should represent a useful complementary technique in addition to more classical approaches.

Spatiotemporal dynamics of the Calvin cycle multistationarity and symmetry breaking instabilities (2011)

Grimbs, Sergio ; Arnold, Anne ; Koseska, Aneta ; Kurths, Jürgen ; Selbig, Joachim ; Nikoloski, Zoran

The possibility of controlling the Calvin cycle has paramount implications for increasing the production of biomass. Multistationarity, as a dynamical feature of systems, is the first obvious candidate whose control could find biotechnological applications. Here we set out to resolve the debate on the multistationarity of the Calvin cycle. Unlike the existing simulation-based studies, our approach is based on a sound mathematical framework, chemical reaction network theory and algebraic geometry, which results in provable results for the investigated model of the Calvin cycle in which we embed a hierarchy of realistic kinetic laws. Our theoretical findings demonstrate that there is a possibility for multistationarity resulting from two sources, homogeneous and inhomogeneous instabilities, which partially settle the debate on multistability of the Calvin cycle. In addition, our tractable analytical treatment of the bifurcation parameters can be employed in the design of validation experiments.

Segmentation of biological multivariate time-series data (2015)

Omranian, Nooshin ; Müller-Röber, Bernd ; Nikoloski, Zoran

Time-series data from multicomponent systems capture the dynamics of the ongoing processes and reflect the interactions between the components. The progression of processes in such systems usually involves check-points and events at which the relationships between the components are altered in response to stimuli. Detecting these events together with the implicated components can help understand the temporal aspects of complex biological systems. Here we propose a regularized regression-based approach for identifying breakpoints and corresponding segments from multivariate time-series data. In combination with techniques from clustering, the approach also allows estimating the significance of the determined breakpoints as well as the key components implicated in the emergence of the breakpoints. Comparative analysis with the existing alternatives demonstrates the power of the approach to identify biologically meaningful breakpoints in diverse time-resolved transcriptomics data sets from the yeast Saccharomyces cerevisiae and the diatom Thalassiosira pseudonana.

Robustness of metabolic networks a review of existing definitions (2011)

Larhlimi, Abdelhalim ; Blachon, Sylvain ; Selbig, Joachim ; Nikoloski, Zoran

Describing the determinants of robustness of biological systems has become one of the central questions in systems biology. Despite the increasing research efforts, it has proven difficult to arrive at a unifying definition for this important concept. We argue that this is due to the multifaceted nature of the concept of robustness and the possibility to formally capture it at different levels of systemic formalisms (e.g, topology and dynamic behavior). Here we provide a comprehensive review of the existing definitions of robustness pertaining to metabolic networks. As kinetic approaches have been excellently reviewed elsewhere, we focus on definitions of robustness proposed within graph-theoretic and constraint-based formalisms.

Revisiting ancestral polyploidy in plants (2017)

Ruprecht, Colin ; Lohaus, Rolf ; Vanneste, Kevin ; Mutwil, Marek ; Nikoloski, Zoran ; Van de Peer, Yves ; Persson, Staffan

Whole-genome duplications (WGDs) or polyploidy events have been studied extensively in plants. In a now widely cited paper, Jiao et al. presented evidence for two ancient, ancestral plant WGDs predating the origin of flowering and seed plants, respectively. This finding was based primarily on a bimodal age distribution of gene duplication events obtained from molecular dating of almost 800 phylogenetic gene trees. We reanalyzed the phylogenomic data of Jiao et al. and found that the strong bimodality of the age distribution may be the result of technical and methodological issues and may hence not be a "true" signal of two WGD events. By using a state-of-the-art molecular dating algorithm, we demonstrate that the reported bimodal age distribution is not robust and should be interpreted with caution. Thus, there exists little evidence for two ancient WGDs in plants from phylogenomic dating.

Resolving the central metabolism of Arabidopsis guard cells (2017)

Robaina-Estevez, Semidan ; Daloso, Danilo M. ; Zhang, Youjun ; Fernie, Alisdair R. ; Nikoloski, Zoran

Photosynthesis and water use efficiency, key factors affecting plant growth, are directly controlled by microscopic and adjustable pores in the leaf-the stomata. The size of the pores is modulated by the guard cells, which rely on molecular mechanisms to sense and respond to environmental changes. It has been shown that the physiology of mesophyll and guard cells differs substantially. However, the implications of these differences to metabolism at a genome-scale level remain unclear. Here, we used constraint-based modeling to predict the differences in metabolic fluxes between the mesophyll and guard cells of Arabidopsis thaliana by exploring the space of fluxes that are most concordant to cell-type-specific transcript profiles. An independent C-13-labeling experiment using isolated mesophyll and guard cells was conducted and provided support for our predictions about the role of the Calvin-Benson cycle in sucrose synthesis in guard cells. The combination of in silico with in vivo analyses indicated that guard cells have higher anaplerotic CO2 fixation via phosphoenolpyruvate carboxylase, which was demonstrated to be an important source of malate. Beyond highlighting the metabolic differences between mesophyll and guard cells, our findings can be used in future integrated modeling of multicellular plant systems and their engineering towards improved growth.

Regression-based modeling of complex plant traits based on metabolomics data (2018)

de Abreu e Lima, Francisco Anastacio ; Leifels, Lydia ; Nikoloski, Zoran

Bridging metabolomics with plant phenotypic responses is challenging. Multivariate analyses account for the existing dependencies among metabolites, and regression models in particular capture such dependencies in search for association with a given trait. However, special care should be undertaken with metabolomics data. Here we propose a modeling workflow that considers all caveats imposed by such large data sets.

Refine

Has Fulltext

Author

Year of publication

Document Type

Language

Is part of the Bibliography

Keywords

Institute

99 search hits