Read original ↗
paperarXivTrust 82 · PrimaryPublished 2d agoLive · 21h ago

Svarna: An Open Corpus Workbench for Modern Greek

This paper introduces Svarna, a free, open-source, web-based corpus workbench for modern Greek. Svarna integrates five databases covering various registers, institutional, literary, dialectal, social media, and historical, to provide a total of more than 507 million words and around 29 million sentences. This platform addresses the chronic gaps in Greek language technology. Although various corpus resources exist, they are scattered across different platforms, and in many cases, institutional access is restricted or they are no longer available online. Svarna integrates these resources into a

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Why these links exist

authored (incoming)

Implements (incoming)

Related across the graph

Topics