Skip to content

Latest commit

 

History

History
55 lines (49 loc) · 1.74 KB

README.md

File metadata and controls

55 lines (49 loc) · 1.74 KB

small100

fine-tune SMaLL-100 model and make inference

fine-tune small100 model and test inference; translate English sentence to Korean using small100 model

SMaLL-100 ?

SMaLL-100 is multilingual neural machine translation model improved from M2M-100

Tools Used

  • Hugging Face
    • model
    • tokenizer
  • PyTorch
    • Dataset
    • DataLoader
    • SequentialSampler
  • Python

How to Execute

  1. install requirements
    recommend to use venv
    pip install -r requirements.txt
  2. start venv requirements installed in venv --> command source venv/bin/activate to start the virtual environment and then run files in this package
  3. run main file command python3 . in terminal (at /small100) or python3 small100 in parent directory to run __main__.py
  4. type your input as instruction, and get the result!

Directory Structure

  • /data: training data
  • /log: log files written during training
  • /model: model checkpoints
  • /utils: not included in this package automatically, but useful codes
    • clear_command.py
    • log_to_loss_plot.py
    • test_resume_training.py
  • __init__.py: initializer of this package. includes needed files
  • __main__.py: main functionality of this package. do training or inference
  • tokenization_small100.py: needed for model's tokenization, provided from the small100 model developer
  • training.py: fine-tuning pre-trained small100 model
  • setting.py: setting needed for training
  • log.py: logging during training
  • inference.py: inference using trained small100 model

TODO

  • freeze most of the model's parameters, and learn only a few parameters at fine-tuning