CoNLL
Category: Protocols & Standards
Category: Protocols & Standards
Definition
CoNLL (Conference on Natural Language Learning) format is the standard for representing annotated text data in natural language processing tasks like named entity recognition and parsing.
How It Works
CoNLL uses a column-based format where each token occupies one line with its annotations in tab-separated columns. Empty lines separate sentences.
The format supports various annotation types including part-of-speech tags, dependency relations, and entity labels.
Why It Matters
CoNLL format enables interoperability between different NLP tools and datasets. It's the standard for sharing annotated corpora and evaluating NLP systems.
Most NLP benchmarks and competitions use CoNLL format, making it essential for comparing model performance.
← Back to Protocols & Standards | All Terms