-
Notifications
You must be signed in to change notification settings - Fork 480
Issues: pytorch/torchtune
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Potential issue in prompt handling in
generate()
in torchtune/recipes/generate.py
#2243
opened Jan 10, 2025 by
insop
Grad Norm Differences Across Nodes
discussion
Start a discussion
#2240
opened Jan 9, 2025 by
EugenHotaj
Finetune meta-llama/Llama-Guard-3-1B
triaged
This issue has been assigned an owner and appropriate label
#2237
opened Jan 8, 2025 by
jingzhaoou
quantization recipe should mimic checkpointer.save_checkpoint
better engineering
Tasks which help improve eng productivity e.g. building tools, cleaning up code, writing docs
#2229
opened Jan 4, 2025 by
felipemello1
Improvement: define a protocol to handle base loss and all chunked loss.
enhancement
New feature or request
#2226
opened Jan 2, 2025 by
insop
Improvement: add a "division by zero" check in chunked loss handling in kd_losses.py
enhancement
New feature or request
#2225
opened Jan 2, 2025 by
insop
Hugging Face from_pretrained() using merged weights KeyError: 'base_model_name_or_path'
bug
Something isn't working
triaged
This issue has been assigned an owner and appropriate label
#2224
opened Jan 2, 2025 by
chg0901
How to use train and test split with the recipes?
enhancement
New feature or request
triaged
This issue has been assigned an owner and appropriate label
#2222
opened Jan 1, 2025 by
7rabbit
Add a page explaining quickly setting up with custom data in live docs
#2221
opened Jan 1, 2025 by
RdoubleA
packed errors
bug
Something isn't working
triaged
This issue has been assigned an owner and appropriate label
#2218
opened Dec 31, 2024 by
chg0901
First example dataset for instruct datasets has no _component
#2215
opened Dec 30, 2024 by
johnowhitaker
hotw to estimate gpu memory needed for knowledge distillation?
discussion
Start a discussion
triaged
This issue has been assigned an owner and appropriate label
#2213
opened Dec 30, 2024 by
chuangzhidan
[Question] what to do when model doesn't have
tokenizer.model
?
high-priority
#2212
opened Dec 29, 2024 by
steveepreston
Support masking of partial dialogue in multi-turn chat datasets
#2207
opened Dec 25, 2024 by
jiatong-yu
Llama3.1 models do not allow configuring Something isn't working
triaged
This issue has been assigned an owner and appropriate label
max_seq_len
bug
#2202
opened Dec 23, 2024 by
akashc1
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.