constitutional AI

constitutional AI

/ˌkɒnstɪˈtuːʃənəl ˌeɪ ˈaɪ/

🛡️ AI Safety & Alignment

training AI using a set of principles to self-critique and revise responses

Constitutional AI helped the model refuse harmful requests while remaining helpful.

Origin: Latin constitutio establishing + AI