We found results for “”
WS-2024-0008
Date: April 11, 2024
Extraction attack - models are being trained over data that may be sensitive, there are techniques to edit the model in order to delete information from it. The suggested attack extracts a “deleted” answer with relatively high probability. Two attacks are published - whitebox, blackbox; for the whitebox attack the paper suggests a defense that lowers the attack success from 38% to 2.4%.
Language: ML
Severity Score
Related Resources (2)
Severity Score
CVSS v3.1
Base Score: |
|
---|---|
Attack Vector (AV): | LOCAL |
Attack Complexity (AC): | LOW |
Privileges Required (PR): | NONE |
User Interaction (UI): | NONE |
Scope (S): | UNCHANGED |
Confidentiality (C): | NONE |
Integrity (I): | HIGH |
Availability (A): | NONE |