What is CoQA?
CoQA is a large-scale dataset for building Co nversational Q uestion A nswering systems. The goal of the CoQA challenge is to measure the ability of machines to understand a text passage and answer a series of interconnected questions that appear in a conversation. CoQA is pronounced as coca.
CoQA contains 127,000+ questions with answers collected from 8000+ conversations. Each conversation is collected by pairing two crowdworkers to chat about a passage in the form of questions and answers.
The unique features of CoQA include
- the questions are conversational;
- the answers can be free-form text;
- each answer also comes with an evidence subsequence highlighted in the passage; and
- the passages are collected from seven diverse domains.
CoQA has a lot of challenging phenomena not present in existing reading comprehension datasets, e.g., coreference and pragmatic reasoning.