Category: Safety & Ethics
Definition
Constitutional AI trains AI systems to follow a set of principles by critiquing and improving their own responses.
How It Works
Give the AI a constitution - rules about being helpful, harmless, and honest. The AI generates responses, then critiques them against these principles and revises them.
This self-correction process happens during training, not just at runtime.
Why It Matters
Constitutional AI helps create safer AI without constant human oversight. The AI learns to police itself according to defined values.
Anthropic uses this method to train Claude, making it more careful about harmful content.
← Back to Safety & Ethics | All Terms