An Embarrassingly Simple Approach to Enhance Transformer Performance in Genomic Selection for Crop Breeding

Chen, Renqi; Han, Wenwei; Zhang, Haohao; Su, Haoyang; Wang, Zhefan; Liu, Xiaolei; Jiang, Hao; Ouyang, Wanli; Dong, Nanqing

Computer Science > Machine Learning

arXiv:2405.09585 (cs)

[Submitted on 15 May 2024 (v1), last revised 24 Jun 2024 (this version, v3)]

Title:An Embarrassingly Simple Approach to Enhance Transformer Performance in Genomic Selection for Crop Breeding

Authors:Renqi Chen, Wenwei Han, Haohao Zhang, Haoyang Su, Zhefan Wang, Xiaolei Liu, Hao Jiang, Wanli Ouyang, Nanqing Dong

View PDF HTML (experimental)

Abstract:Genomic selection (GS), as a critical crop breeding strategy, plays a key role in enhancing food production and addressing the global hunger crisis. The predominant approaches in GS currently revolve around employing statistical methods for prediction. However, statistical methods often come with two main limitations: strong statistical priors and linear assumptions. A recent trend is to capture the non-linear relationships between markers by deep learning. However, as crop datasets are commonly long sequences with limited samples, the robustness of deep learning models, especially Transformers, remains a challenge. In this work, to unleash the unexplored potential of attention mechanism for the task of interest, we propose a simple yet effective Transformer-based framework that enables end-to-end training of the whole sequence. Via experiments on rice3k and wheat3k datasets, we show that, with simple tricks such as k-mer tokenization and random masking, Transformer can achieve overall superior performance against seminal methods on GS tasks of interest.

Comments:	Accepted by IJCAI2024. Code is available at this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2405.09585 [cs.LG]
	(or arXiv:2405.09585v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.09585

Submission history

From: Renqi Chen [view email]
[v1] Wed, 15 May 2024 07:31:06 UTC (4,915 KB)
[v2] Sun, 19 May 2024 12:46:08 UTC (5,102 KB)
[v3] Mon, 24 Jun 2024 09:56:35 UTC (4,900 KB)

Computer Science > Machine Learning

Title:An Embarrassingly Simple Approach to Enhance Transformer Performance in Genomic Selection for Crop Breeding

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:An Embarrassingly Simple Approach to Enhance Transformer Performance in Genomic Selection for Crop Breeding

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators