Write a PREreview

Life as a Function: Why Transformer Architectures Struggle to Gain Genome-Level Foundational Capabilities

by Hassan Hassan, Kyle Puhger, Ali Saadat, Johannes Mayer, and Maximilian Sprang

Posted: January 14, 2025
Server: bioRxiv
DOI: 10.1101/2025.01.13.632745

Recent advances in generative models for nucleotide sequences have shown promise, but their practical utility remains limited. In this study, we explore DNA as a complex functional representation of evolutionary processes and assess the ability of transformer-based models to capture this complexity. Through experiments with both synthetic and real DNA sequences, we demonstrate that current transformer architectures, particularly auto-regressive models relying on next-token prediction, struggle to effectively learn the underlying biological functions. Our findings suggest that these models face inherent limitations, that cannot be overcome with scale, highlighting the need for alternative approaches that incorporate evolutionary constraints and structural information. We propose potential future directions, including the integration of topological methods or the switch of modelling paradigms, to enhance the generation of genomic sequences.

You can write a PREreview of Life as a Function: Why Transformer Architectures Struggle to Gain Genome-Level Foundational Capabilities. A PREreview is a review of a preprint and can vary from a few sentences to a lengthy report, similar to a journal-organized peer-review report.

Before you start

We will ask you to log in with your ORCID iD. If you don’t have an iD, you can create one.