Refine
Year of publication
- 2016 (4) (remove)
Language
- English (4)
Is part of the Bibliography
- yes (4)
Keywords
- Data integration (1)
- Gene regulatory network (1)
- Hybrid prediction (1)
- LASSO (1)
- Maize (1)
- Phenomics (1)
- Regression (1)
- STAT6 (1)
- Th2 cells (1)
- Transcription factors (1)
- package (1)
- plant biology (1)
- principal component (1)
Phenomic experiments are carried out in large-scale plant phenotyping facilities that acquire a large number of pictures of hundreds of plants simultaneously. With the aid of automated image processing, the data are converted into genotype-feature matrices that cover many consecutive days of development. Here, we explore the possibility of predicting the biomass of the fully grown plant from early developmental stage image-derived features. We performed phenomic experiments on 195 inbred and 382 hybrid maizes varieties and followed their progress from 16 days after sowing (DAS) to 48 DAS with 129 image-derived features. By applying sparse regression methods, we show that 73% of the variance in hybrid fresh weight of fully-grown plants is explained by about 20 features at the three-leaf-stage or earlier. Dry weight prediction explained over 90% of the variance. When phenomic features of parental inbred lines were used as predictors of hybrid biomass, the proportion of variance explained was 42 and 45%, for fresh weight and dry weight models consisting of 35 and 36 features, respectively. These models were very robust, showing only a small amount of variation in performance over the time scale of the experiment. We also examined mid-parent heterosis in phenomic features. Feature heterosis displayed a large degree of variance which resulted in prediction performance that was less robust than models of either parental or hybrid predictors. Our results show that phenomic prediction is a viable alternative to genomic and metabolic prediction of hybrid performance. In particular, the utility of early-stage parental lines is very encouraging. (C) 2016 Elsevier Ireland Ltd. All rights reserved.
Data integration has become a useful strategy for uncovering new insights into complex biological networks. We studied whether this approach can help to delineate the signal transducer and activator of transcription 6 (STAT6)-mediated transcriptional network driving T helper (Th) 2 cell fate decisions. To this end, we performed an integrative analysis of publicly available RNA-seq data of Stat6-knockout mouse studies together with STAT6 ChIP-seq data and our own gene expression time series data during Th2 cell differentiation. We focused on transcription factors (TFs), cytokines, and cytokine receptors and delineated 59 positively and 41 negatively STAT6-regulated genes, which were used to construct a transcriptional network around STAT6. The network illustrates that important and well-known TFs for Th2 cell differentiation are positively regulated by STAT6 and act either as activators for Th2 cells (e.g., Gata3, Atf3, Satb1, Nfil3, Maf, and Pparg) or as suppressors for other Th cell subpopulations such as Th1 (e.g., Ar), Th17 (e.g., Etv6), or iTreg (e.g., Stat3 and Hifla) cells. Moreover, our approach reveals 11 TFs (e.g., Atf5, Creb3l2, and Asb2) with unknown functions in Th cell differentiation. This fact together with the observed enrichment of asthma risk genes among those regulated by STAT6 underlines the potential value of the data integration strategy used here. Thus, our results clearly support the opinion that data integration is a useful tool to delineate complex physiological processes.
More effort — more results
(2016)
The development of 'omics' technologies has progressed to address complex biological questions that underlie various plant functions thereby producing copious amounts of data. The need to assimilate large amounts of data into biologically meaningful interpretations has necessitated the development of statistical methods to integrate multidimensional information. Throughout this review, we provide examples of recent outcomes of 'omics' data integration together with an overview of available statistical methods and tools.
analysis
(2016)
The development of ‘omics’ technologies has progressed to address complex biological questions that underlie various plant functions thereby producing copious amounts of data. The need to assimilate large amounts of data into biologically meaningful interpretations has necessitated the development of statistical methods to integrate multidimensional information. Throughout this review, we provide examples of recent outcomes of ‘omics’ data integration together with an overview of available statistical methods and tools.