Skip to content

Latest commit

 

History

History
11 lines (6 loc) · 1.49 KB

README.md

File metadata and controls

11 lines (6 loc) · 1.49 KB

RUQuAD

A repository with information about Reykjavik University Question-Answering Dataset (RUQuAD).

The first version of RUQuAD (RUQuAD 22.02) was collected in 2021-2022 by about 1,000 crowd-workers who used the GameQA mobile app platform to generate about 23,000 questions of which about 20,800 passed a double peer review. For these 20,800 verified questions, the crowd-workers annotated about 12,700 answers, sourced from five sources in four separate domains: The Icelandic Wikipedia, The Icelandic Web of Science, the news websites mbl.is and visir.is, and The Icelandic Government Information website.

Please refer to the following paper regarding GameQA and the compilation of RUQuAD:

Njáll Skarphéðinsson, Breki Guðmundsson, Steinar Smári, Marta Kristín Lárusdóttir, Hafsteinn Einarsson, Abuzar Khan, Eric Nyberg, and Hrafn Loftsson. 2023. GameQA: Gamified Mobile App for Building Multiple-Domain Question-Answering Datasets. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL): System Demonstrations. Dubrovnik, Croatia.

RUQuAD 22.02 is available for download from CLARIN.is as two separate datasets: RUQuAD-1 and RUQuAD-2.