Chrome Extension
WeChat Mini Program
Use on ChatGLM

Predicting the Natural Yeast Phenotypic Landscape with Machine Learning

biorxiv(2024)

Universite Cote d'Azur | Institute of Molecular Systems Biology | University of Gothenburg | EMBL-EBI | Institute of Research on Cancer and Ageing of Nice (IRCAN)

Cited 0|Views1
Abstract
Most organisms' traits result from the complex interplay of many genetic and environmental factors, making their prediction from genotypes difficult. Here, we used machine learning models to explore genotype-phenotype connections for 223 life history traits measured across 1011 genome-sequenced Saccharomyces cerevisiae strains. Firstly, we used genome-wide association studies to connect genetic variants with the phenotypes. Next, we benchmarked an automated machine learning pipeline that includes preprocessing, feature selection, and hyperparameters optimization in combination with multiple linear and complex machine learning methods. We determined gradient boosting machines as best performing in 65% of predictions and pangenome as best predictor, suggesting a considerable contribution of the accessory genome in controlling phenotypes. The accuracy broadly varied among the phenotypes (r = 0.2-0.9), consistent with varying levels of complexity, with stress resistance being easier to predict compared to growth across carbon and nitrogen nutrients. While no specific genomic features could be linked to the predictions for most phenotypes, machine learning identifies high-impact variants with established relationships to phenotypes despite being rare in the population. Near-perfect accuracies (r>0.95) were achieved when other phenomics data were used to aid predictions, suggesting shared useful information can be conveyed across phenotypes. Overall, our study underscores the power of machine learning to interpret the functional outcome of genetic variants. ### Competing Interest Statement The developed computational pipeline and scripts are available at: https://github.com/SakshiKhaiwal/Genotype-to-phenotype-mapping-in-yeast/tree/main. This software is provided solely for the purpose of scientific reproduction of the research described in the associated article and general academic research. Any other use, including commercial use, is prohibited without prior written consent from the authors. All rights reserved.
More
Translated text
PDF
Bibtex
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper
Summary is being generated by the instructions you defined