Read original ↗
paperarXivTrust 82 · PrimaryPublished yesterdayLive · 3h ago

LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning

LLMs memorize sensitive training data, including personally identifiable information (PII), creating a pressing need for reliable post hoc removal methods. Unlearning has emerged as a promising solution, with state-of-the-art(SOTA) methods often following a localize-first, unlearn-second paradigm that targets specific model parameters. However, existing benchmarks evaluate unlearning solely at the output level, leaving open the question of whether unlearning truly erases knowledge from a model's parameters or merely obfuscates it, a concern reinforced by the success of resurfacing attacks. To

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Why these links exist

  • Linked via arxiv authorMatteo Boglioni

    LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning

  • Linked via arxiv authorThibault Rousset

    LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning

  • Linked via arxiv authorSiva Reddy

    LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning

  • Linked via arxiv authorMarius Mosbach

    LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning

  • Linked via arxiv authorVerna Dankers

    LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning

Implements

Explains

Covers

authored (incoming)

Related across the graph

Topics