Breaking the Gold Standard: Extracting Forgotten Data under Exact Unlearning in Large Language Models
Large language models are typically trained on datasets collected from the web, which may inadvertently contain harmful or sensitive personal information. To address growing privacy concerns, unlearning methods have been proposed to remove the influence of specific data from trained models. Of...