
red teaming
/ˈred ˌtiːmɪŋ/
adversarial testing to find vulnerabilities and failure modes in AI systems
“Red teaming uncovered that the chatbot could be manipulated into giving harmful advice.”
Origin: From military exercises where a red team plays the adversary