Category: Fundamentals
Definition
Training data is the information used to teach AI systems how to perform tasks, like labeled photos for image recognition or text examples for language models.
How It Works
AI learns by finding patterns in examples. For a spam filter, you'd show it thousands of emails labeled "spam" or "not spam." The AI spots what makes spam different from regular email.
The quality and quantity of training data determines how well AI performs. More diverse, accurate data usually means better AI performance.
Why It Matters
Training data shapes everything an AI system can do. Biased data creates biased AI. Missing examples mean the AI fails on those cases. Poor quality data leads to unreliable AI.
Understanding training data helps you evaluate AI systems and spot their limitations.
← Back to Fundamentals | All Terms