The full name of MKQA is Multilingual Knowledge Questions & Answers. It is an open-domain question-and-answer evaluation set that contains 10k question-answer pairs across 26 different types of languages (a total of 260k question-answer pairs). The goal of this dataset is to provide a challenging benchmark for question answering quality across multiple languages. The dataset MKQA contains 10,000 queries sampled from the Google Natural Questions dataset. For each query a new paragraph-independent answer is collected. These queries and answers were then human-translated into 25 non-English languages. MKQA data can be downloaded from here. Each of the data set… |
#Multilingual #Knowledge #Question #Answering #Dataset #MKQA