Skip to content

RaRa-digiLab/ocr-corrections-gpt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Testing OCR Post-Correction with GPT Models

Raw data should be unpacked into the data/raw folder.
To run GPT, create a file named ./env and add the line MY_API_KEY= (place your API key after the equals sign).

  • Preparation of training data based on raw data is performed in the notebook training_data.ipynb.
  • Fine-tuning of the models was conducted on the OpenAI platform.
  • Evaluation is performed in the notebook eval.ipynb.

About

Testing OCR post-correction with GPT

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published