hly1013 / small100 Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

fine-tune SMaLL-100 model and make inference

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
log		log
utils		utils
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
__main__.py		__main__.py
inference.py		inference.py
log.py		log.py
requirements.txt		requirements.txt
setting.py		setting.py
tokenization_small100.py		tokenization_small100.py
training.py		training.py

Repository files navigation

small100

fine-tune SMaLL-100 model and make inference

fine-tune small100 model and test inference; translate English sentence to Korean using small100 model

SMaLL-100 ?

SMaLL-100 is multilingual neural machine translation model improved from M2M-100

model hub (hugging face): https://huggingface.co/alirezamsh/small100
paper: https://aclanthology.org/2022.emnlp-main.571/

Tools Used

Hugging Face
- model
- tokenizer
PyTorch
- Dataset
- DataLoader
- SequentialSampler
Python

How to Execute

install requirements
recommend to use venv
pip install -r requirements.txt
start venv requirements installed in venv --> command source venv/bin/activate to start the virtual environment and then run files in this package
run main file command python3 . in terminal (at /small100) or python3 small100 in parent directory to run __main__.py
type your input as instruction, and get the result!

Directory Structure

/data: training data
/log: log files written during training
/model: model checkpoints
/utils: not included in this package automatically, but useful codes
- clear_command.py
- log_to_loss_plot.py
- test_resume_training.py
__init__.py: initializer of this package. includes needed files
__main__.py: main functionality of this package. do training or inference
tokenization_small100.py: needed for model's tokenization, provided from the small100 model developer
training.py: fine-tuning pre-trained small100 model
setting.py: setting needed for training
log.py: logging during training
inference.py: inference using trained small100 model

TODO

freeze most of the model's parameters, and learn only a few parameters at fine-tuning

About

fine-tune SMaLL-100 model and make inference

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%