paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago

DNA Language Models: An Assessment of Pre-Training for Fine-Tuning Tasks

Recent breakthroughs in foundation models and Large Language Models (LLMs) have introduced new opportunities for studying and decoding genomic sequences. Several state-of-the-art approaches, such as DNABERT2, rely on transformer-based architectures, while others, such as ConvNova, still build upon more conventional convolutional models. However, systematic benchmark comparisons across these methods remain scarce. Given that transformer-based models require extensive and costly pretraining, it is crucial to evaluate whether their performance gains justify this overhead. Moreover, LLMs such as D

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Related to

glossary_termTransformer

Implements

repovlm-starter

Related across the graph

glossary_termTransformer repovlm-starter

Topics

cs.CL