paperarXivTrust 82 · PrimaryPublished 4d agoLive · 3d ago
DNA Language Models: An Assessment of Pre-Training for Fine-Tuning Tasks
Recent breakthroughs in foundation models and Large Language Models (LLMs) have introduced new opportunities for studying and decoding genomic sequences. Several state-of-the-art approaches, such as DNABERT2, rely on transformer-based architectures, while others, such as ConvNova, still build upon more conventional convolutional models. However, systematic benchmark comparisons across these methods remain scarce. Given that transformer-based models require extensive and costly pretraining, it is crucial to evaluate whether their performance gains justify this overhead. Moreover, LLMs such as D
Lineage graph
Paper → model → repo connections mined from source citations (Tier-1 exact match).
