Discussion about this post

User's avatar
Tanya Matanda's avatar

Totally agree — you nailed the production reality.

If “retraining” can’t reliably anchor the baseline, then we’re not measuring unlearning so much as measuring the quirks of whichever attack we picked that day. That calibration drift is exactly what makes these evaluations hard to defend in compliance reviews and audit trails.

That’s why I like the “zero-grounding” property here: it gives you a stable reference point, so improvements (or regressions) are actually interpretable — not just attack-dependent noise.

Appreciate you calling out the theory → privacy engineering bridge.

No posts

Ready for more?