Advancements in genome sequencing have facilitated a deeper understanding of genetic variation in plants. However, predicting phenotypes from genomic data remains challenging. Machine learning (ML) techniques offer promising solutions in this context, particularly in deciphering genotype-phenotype relationships. Our study compares various ML methods to investigate this association using almond cultivar data. We preprocess the dataset, conduct feature selection, and employ regression models to predict almond yields. Results demonstrate strong genotype-phenotype associations, as indicated by evaluation metrics. Importantly, we employ explainable artificial intelligence algorithms to enhance model interpretability, identifying crucial genetic variations impacting yield. This underscores ML’s efficacy in predicting phenotypic traits from genomic data and highlights its significance in optimizing crop production for sustainable agriculture.

Machine Learning Approaches for Genotype to Phenotype Prediction in Plant Science

Pierfrancesco Novielli;Donato Romano;Stefano Pavan;Pasquale Losciale;Anna Maria Stellacci;Roberto Bellotti;Sabina Tangaro
2024-01-01

Abstract

Advancements in genome sequencing have facilitated a deeper understanding of genetic variation in plants. However, predicting phenotypes from genomic data remains challenging. Machine learning (ML) techniques offer promising solutions in this context, particularly in deciphering genotype-phenotype relationships. Our study compares various ML methods to investigate this association using almond cultivar data. We preprocess the dataset, conduct feature selection, and employ regression models to predict almond yields. Results demonstrate strong genotype-phenotype associations, as indicated by evaluation metrics. Importantly, we employ explainable artificial intelligence algorithms to enhance model interpretability, identifying crucial genetic variations impacting yield. This underscores ML’s efficacy in predicting phenotypic traits from genomic data and highlights its significance in optimizing crop production for sustainable agriculture.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11586/495860
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact