paperarXivTrust 82 · PrimaryPublished yesterdayLive · 3h ago

LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning

LLMs memorize sensitive training data, including personally identifiable information (PII), creating a pressing need for reliable post hoc removal methods. Unlearning has emerged as a promising solution, with state-of-the-art(SOTA) methods often following a localize-first, unlearn-second paradigm that targets specific model parameters. However, existing benchmarks evaluate unlearning solely at the output level, leaving open the question of whether unlearning truly erases knowledge from a model's parameters or merely obfuscates it, a concern reinforced by the success of resurfacing attacks. To

Lineage graph

Paper → model → repo connections mined from source citations (Tier-1 exact match).

Why these links exist

Linked via arxiv authorMatteo Boglioni →
LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning
Linked via arxiv authorThibault Rousset →
LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning
Linked via arxiv authorSiva Reddy →
LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning
Linked via arxiv authorMarius Mosbach →
LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning
Linked via arxiv authorVerna Dankers →
LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning

Implements

repochrisliu298/awesome-llm-unlearning

Explains

tutorialEvaluate a model properly

Covers

newsDataset of permissively licensed code released

authored (incoming)

personMatteo Boglioni personThibault Rousset personSiva Reddy personMarius Mosbach personVerna Dankers

Related across the graph

repochrisliu298/awesome-llm-unlearning personMatteo Boglioni personMarius Mosbach newsDataset of permissively licensed code released personVerna Dankers personSiva Reddy tutorialEvaluate a model properly personThibault Rousset

Topics

cs.AI