We found results for “”
WS-2024-0009
Date: April 11, 2024
Adversarial demonstration attack - the described attack uses adversarial demonstrations (concrete examples of the desired task being performed) in order to make the model perform poorly in sentiment analysis, textual entailment, topic and question classification tasks. For example, a model can give wrong sentiment prediction (SST-2) on a sentence with 56%-82% probability (depending on the model) when on 8-shot demonstrations.
Language: ML
Severity Score
Related Resources (2)
Severity Score
CVSS v3.1
Base Score: |
|
---|---|
Attack Vector (AV): | NETWORK |
Attack Complexity (AC): | LOW |
Privileges Required (PR): | NONE |
User Interaction (UI): | NONE |
Scope (S): | UNCHANGED |
Confidentiality (C): | NONE |
Integrity (I): | HIGH |
Availability (A): | NONE |