Skip to content

Latest commit

 

History

History
27 lines (16 loc) · 1.09 KB

Step 0. Golden Query Dataset Annotation.md

File metadata and controls

27 lines (16 loc) · 1.09 KB

Step 0. Golden Query Dataset (Instruct dataset) Annotation

There are three options for using your dataset to finetune/evaluate the Text to SQL (QueryCraft) pipeline:

  1. Bring your dataset with golden queries in the following format: question, query, and db_id. Instruction for ingesting the dataset is provided in the next Step 1.

  2. Curate the golden query dataset using our annotation tool: https://annotator.superknowa.tsglwatson.buildlab.cloud/

  3. Use the example datasets provided below for testing: Spider and KaggleDBQA

    Unzip the example datasets using the command:

    
    unzip spider.zip
    
    unzip KaggleDBQA.zip
    
    cd ..
    

Golden Query Annotation:

  1. Go to our annotation tool. https://annotator.superknowa.tsglwatson.buildlab.cloud/

Data annotator view

  1. Click on the Instruction Manual and follow the instructions for curating the golden queries dataset. https://annotator.superknowa.tsglwatson.buildlab.cloud/documentation

Data annotation instruction manual