publish.UP Search

Young Genes out of the Male: An Insight from Evolutionary Age Analysis of the Pollen Transcriptome (2015)

Cui, Xiao ; Lv, Yang ; Chen, Miaolin ; Nikoloski, Zoran ; Twell, David ; Zhang, Dabing

The birth of new genes in genomes is an important evolutionary event. Several studies reveal that new genes in animals tend to be preferentially expressed in male reproductive tissues such as testis (Betran et al., 2002; Begun et al., 2007; Dubruille et al., 2012), and thus an "out of testis' hypothesis for the emergence of new genes has been proposed (Vinckenbosch et al., 2006; Kaessmann, 2010). However, such phenomena have not been examined in plant species. Here, by employing a phylostratigraphic method, we dated the origin of protein-coding genes in rice and Arabidopsis thaliana and observed a number of young genes in both species. These young genes tend to encode short extracellular proteins, which may be involved in rapid evolving processes, such as reproductive barriers, species specification, and antimicrobial processes. Further analysis of transcriptome age indexes across different tissues revealed that male reproductive cells express a phylogenetically younger transcriptome than other plant tissues. Compared with sporophytic tissues, the young transcriptomes of the male gametophyte displayed greater complexity and diversity, which included a higher ratio of anti-sense and inter-genic transcripts, reflecting a pervasive transcription state that facilitated the emergence of new genes. Here, we propose that pollen may act as an "innovation incubator' for the birth of de novo genes. With cases of male-biased expression of young genes reported in animals, the "new genes out of the male' model revealed a common evolutionary force that drives reproductive barriers, species specification, and the upgrading of defensive mechanisms against pathogens.

Unraveling lipid metabolism in maize with time-resolved multi-omics data (2018)

de Abreu e Lima, Francisco Anastacio ; Li, Kun ; Wen, Weiwei ; Yan, Jianbing ; Nikoloski, Zoran ; Willmitzer, Lothar ; Brotman, Yariv

Maize is the cereal crop with the highest production worldwide, and its oil is a key energy resource. Improving the quantity and quality of maize oil requires a better understanding of lipid metabolism. To predict the function of maize genes involved in lipid biosynthesis, we assembled transcriptomic and lipidomic data sets from leaves of B73 and the high-oil line By804 in two distinct time-series experiments. The integrative analysis based on high-dimensional regularized regression yielded lipid-transcript associations indirectly validated by Gene Ontology and promoter motif enrichment analyses. The co-localization of lipid-transcript associations using the genetic mapping of lipid traits in leaves and seedlings of a B73 x By804 recombinant inbred line population uncovered 323 genes involved in the metabolism of phospholipids, galactolipids, sulfolipids and glycerolipids. The resulting association network further supported the involvement of 50 gene candidates in modulating levels of representatives from multiple acyl-lipid classes. Therefore, the proposed approach provides high-confidence candidates for experimental testing in maize and model plant species.

Unraveling gene regulatory networks from time-resolved gene expression data - a measures comparison study (2011)

Hempel, Sabrina ; Koseska, Aneta ; Nikoloski, Zoran ; Kurths, Jürgen

Background: Inferring regulatory interactions between genes from transcriptomics time-resolved data, yielding reverse engineered gene regulatory networks, is of paramount importance to systems biology and bioinformatics studies. Accurate methods to address this problem can ultimately provide a deeper insight into the complexity, behavior, and functions of the underlying biological systems. However, the large number of interacting genes coupled with short and often noisy time-resolved read-outs of the system renders the reverse engineering a challenging task. Therefore, the development and assessment of methods which are computationally efficient, robust against noise, applicable to short time series data, and preferably capable of reconstructing the directionality of the regulatory interactions remains a pressing research problem with valuable applications. Results: Here we perform the largest systematic analysis of a set of similarity measures and scoring schemes within the scope of the relevance network approach which are commonly used for gene regulatory network reconstruction from time series data. In addition, we define and analyze several novel measures and schemes which are particularly suitable for short transcriptomics time series. We also compare the considered 21 measures and 6 scoring schemes according to their ability to correctly reconstruct such networks from short time series data by calculating summary statistics based on the corresponding specificity and sensitivity. Our results demonstrate that rank and symbol based measures have the highest performance in inferring regulatory interactions. In addition, the proposed scoring scheme by asymmetric weighting has shown to be valuable in reducing the number of false positive interactions. On the other hand, Granger causality as well as information-theoretic measures, frequently used in inference of regulatory networks, show low performance on the short time series analyzed in this study. Conclusions: Our study is intended to serve as a guide for choosing a particular combination of similarity measures and scoring schemes suitable for reconstruction of gene regulatory networks from short time series data. We show that further improvement of algorithms for reverse engineering can be obtained if one considers measures that are rooted in the study of symbolic dynamics or ranks, in contrast to the application of common similarity measures which do not consider the temporal character of the employed data. Moreover, we establish that the asymmetric weighting scoring scheme together with symbol based measures (for low noise level) and rank based measures (for high noise level) are the most suitable choices.

The Rice Actin-Binding Protein RMD Regulates Light-Dependent Shoot Gravitropism (2019)

Song, Yu ; Li, Gang ; Nowak, Jacqueline ; Zhang, Xiaoqing ; Xu, Dongbei ; Yang, Xiujuan ; Huang, Guoqiang ; Liang, Wanqi ; Yang, Litao ; Wang, Canhua ; Bulone, Vincent ; Nikoloski, Zoran ; Hu, Jianping ; Persson, Staffan ; Zhang, Dabing

Light and gravity are two key determinants in orientating plant stems for proper growth and development. The organization and dynamics of the actin cytoskeleton are essential for cell biology and critically regulated by actin-binding proteins. However, the role of actin cytoskeleton in shoot negative gravitropism remains controversial. In this work, we report that the actin-binding protein Rice Morphology Determinant (RMD) promotes reorganization of the actin cytoskeleton in rice (Oryza sativa) shoots. The changes in actin organization are associated with the ability of the rice shoots to respond to negative gravitropism. Here, light-grown rmd mutant shoots exhibited agravitropic phenotypes. By contrast, etiolated rmd shoots displayed normal negative shoot gravitropism. Furthermore, we show that RMD maintains an actin configuration that promotes statolith mobility in gravisensing endodermal cells, and for proper auxin distribution in light-grown, but not dark-grown, shoots. RMD gene expression is diurnally controlled and directly repressed by the phytochrome-interacting factor-like protein OsPIL16. Consequently, overexpression of OsPIL16 led to gravisensing and actin patterning defects that phenocopied the rmd mutant. Our findings outline a mechanism that links light signaling and gravity perception for straight shoot growth in rice.

The hidden simplicity of metabolic networks is revealed by multireaction dependencies (2022)

Küken, Anika ; Langary, Damoun ; Nikoloski, Zoran

Understanding the complexity of metabolic networks has implications for manipulation of their functions. The complexity of metabolic networks can be characterized by identifying multireaction dependencies that are challenging to determine due to the sheer number of combinations to consider. Here, we propose the concept of concordant complexes that captures multireaction dependencies and can be efficiently determined from the algebraic structure and operational constraints of metabolic networks. The concordant complexes imply the existence of concordance modules based on which the apparent complexity of 12 metabolic networks of organisms from all kingdoms of life can be reduced by at least 78%. A comparative analysis against an ensemble of randomized metabolic networks shows that the metabolic network of Escherichia coli contains fewer concordance modules and is, therefore, more tightly coordinated than expected by chance. Together, our findings demonstrate that metabolic networks are considerably simpler than what can be perceived from their structure alone.

Systems analysis of the response of photosynthesis, metabolism, and growth to an increase in irradiance in the photosynthetic model organism chlamydomonas reinhardtii (2014)

We investigated the systems response of metabolism and growth after an increase in irradiance in the nonsaturating range in the algal model Chlamydomonas reinhardtii. In a three-step process, photosynthesis and the levels of metabolites increased immediately, growth increased after 10 to 15 min, and transcript and protein abundance responded by 40 and 120 to 240 min, respectively. In the first phase, starch and metabolites provided a transient buffer for carbon until growth increased. This uncouples photosynthesis from growth in a fluctuating light environment. In the first and second phases, rising metabolite levels and increased polysome loading drove an increase in fluxes. Most Calvin-Benson cycle (CBC) enzymes were substrate-limited in vivo, and strikingly, many were present at higher concentrations than their substrates, explaining how rising metabolite levels stimulate CBC flux. Rubisco, fructose-1,6-biosphosphatase, and seduheptulose-1,7-bisphosphatase were close to substrate saturation in vivo, and flux was increased by posttranslational activation. In the third phase, changes in abundance of particular proteins, including increases in plastidial ATP synthase and some CBC enzymes, relieved potential bottlenecks and readjusted protein allocation between different processes. Despite reasonable overall agreement between changes in transcript and protein abundance (R-2 = 0.24), many proteins, including those in photosynthesis, changed independently of transcript abundance.

System-wide organization of actin cytoskeleton determines organelle transport in hypocotyl plant cells (2017)

Breuer, David ; Nowak, Jacqueline ; Ivakov, Alexander ; Somssich, Marc ; Persson, Staffan ; Nikoloski, Zoran

System-wide organization of actin cytoskeleton determines organelle transport in hypocotyl plant cells (2017)

Breuer, David ; Nowak, Jacqueline ; Ivakov, Alexander ; Somssich, Marc ; Persson, Staffan ; Nikoloski, Zoran

The actin cytoskeleton is an essential intracellular filamentous structure that underpins cellular transport and cytoplasmic streaming in plant cells. However, the system-level properties of actin-based cellular trafficking remain tenuous, largely due to the inability to quantify key features of the actin cytoskeleton. Here, we developed an automated image-based, network-driven framework to accurately segment and quantify actin cytoskeletal structures and Golgi transport. We show that the actin cytoskeleton in both growing and elongated hypocotyl cells has structural properties facilitating efficient transport. Our findings suggest that the erratic movement of Golgi is a stable cellular phenomenon that might optimize distribution efficiency of cell material. Moreover, we demonstrate that Golgi transport in hypocotyl cells can be accurately predicted from the actin network topology alone. Thus, our framework provides quantitative evidence for system-wide coordination of cellular transport in plant cells and can be readily applied to investigate cytoskeletal organization and transport in other organisms.

Supervised learning of gene-regulatory networks based on graph distance profiles of transcriptomics data (2020)

Razaghi-Moghadam, Zahra ; Nikoloski, Zoran

Characterisation of gene-regulatory network (GRN) interactions provides a stepping stone to understanding how genes affect cellular phenotypes. Yet, despite advances in profiling technologies, GRN reconstruction from gene expression data remains a pressing problem in systems biology. Here, we devise a supervised learning approach, GRADIS, which utilises support vector machine to reconstruct GRNs based on distance profiles obtained from a graph representation of transcriptomics data. By employing the data fromEscherichia coliandSaccharomyces cerevisiaeas well as synthetic networks from the DREAM4 and five network inference challenges, we demonstrate that our GRADIS approach outperforms the state-of-the-art supervised and unsupervided approaches. This holds when predictions about target genes for individual transcription factors as well as for the entire network are considered. We employ experimentally verified GRNs fromE. coliandS. cerevisiaeto validate the predictions and obtain further insights in the performance of the proposed approach. Our GRADIS approach offers the possibility for usage of other network-based representations of large-scale data, and can be readily extended to help the characterisation of other cellular networks, including protein-protein and protein-metabolite interactions.

Supervised learning of gene regulatory networks (2020)

Razaghi-Moghadam, Zahra ; Nikoloski, Zoran

Identifying the entirety of gene regulatory interactions in a biological system offers the possibility to determine the key molecular factors that affect important traits on the level of cells, tissues, and whole organisms. Despite the development of experimental approaches and technologies for identification of direct binding of transcription factors (TFs) to promoter regions of downstream target genes, computational approaches that utilize large compendia of transcriptomics data are still the predominant methods used to predict direct downstream targets of TFs, and thus reconstruct genome-wide gene-regulatory networks (GRNs). These approaches can broadly be categorized into unsupervised and supervised, based on whether data about known, experimentally verified gene-regulatory interactions are used in the process of reconstructing the underlying GRN. Here, we first describe the generic steps of supervised approaches for GRN reconstruction, since they have been recently shown to result in improved accuracy of the resulting networks? We also illustrate how they can be used with data from model organisms to obtain more accurate prediction of gene regulatory interactions.

Refine

Has Fulltext

Author

Year of publication

Document Type

Language

Is part of the Bibliography

Keywords

Institute

91 search hits