-
Notifications
You must be signed in to change notification settings - Fork 244
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[cp] Add cudnn attention support to Context Parallel
CLA Signed
This label is managed by the Meta Open Source bot.
[do NOT land] CP+torch.compile debugging attempt
CLA Signed
This label is managed by the Meta Open Source bot.
Make CheckpointManager friendlier to custom StorageWriter/StorageReader
CLA Signed
This label is managed by the Meta Open Source bot.
#789
opened Jan 12, 2025 by
dimdi-y
Loading…
Register backward hook for the whole optim_dict to enable working at multi schedule pp
CLA Signed
This label is managed by the Meta Open Source bot.
[Not for land] Integrate float8nocompile, an experimental feature for high performance
CLA Signed
This label is managed by the Meta Open Source bot.
#778
opened Jan 7, 2025 by
danielvegamyhre
Loading…
[PoC] Typed JobConfig
CLA Signed
This label is managed by the Meta Open Source bot.
#767
opened Jan 1, 2025 by
jaysonfrancis
Loading…
[MoE][PoC] Expert Parallel: tp and tp2ep
CLA Signed
This label is managed by the Meta Open Source bot.
[Not for land] Show replicated fp32 norm weights
CLA Signed
This label is managed by the Meta Open Source bot.
First draft Auto-SAC workflow
CLA Signed
This label is managed by the Meta Open Source bot.
#710
opened Dec 2, 2024 by
sanketpurandare
•
Draft
[WIP] Allow benchmark between multiple configs
CLA Signed
This label is managed by the Meta Open Source bot.
#703
opened Nov 26, 2024 by
H-Huang
Loading…
[WIP] Adding OBELICS DataLoader
CLA Signed
This label is managed by the Meta Open Source bot.
#663
opened Oct 30, 2024 by
TJ-Solergibert
Loading…
[not for land] torch.compile individual linears
CLA Signed
This label is managed by the Meta Open Source bot.
#661
opened Oct 29, 2024 by
vkuzo
Loading…
Init weights only if not loading a checkpoint
CLA Signed
This label is managed by the Meta Open Source bot.
[DO NOT REVIEW] gaps to enable FDSP2 cpu offloading
CLA Signed
This label is managed by the Meta Open Source bot.
#622
opened Oct 16, 2024 by
weifengpy
Loading…
[Not for land] Settings to make Llama3-8B on 8 GPUs faster
CLA Signed
This label is managed by the Meta Open Source bot.
[not for land] TE experiments, take 2
CLA Signed
This label is managed by the Meta Open Source bot.
#614
opened Oct 14, 2024 by
vkuzo
Loading…
[DO NOT REVIEW] --experimental.fsdp_sharding_on_largest_dim
CLA Signed
This label is managed by the Meta Open Source bot.
#607
opened Oct 9, 2024 by
weifengpy
Loading…
fix mixed precision for This label is managed by the Meta Open Source bot.
replicate
/ pure DDP
CLA Signed
#591
opened Sep 29, 2024 by
152334H
Loading…
[not for land yet] hack max and abs out of ops eligible for AC
CLA Signed
This label is managed by the Meta Open Source bot.
#580
opened Sep 17, 2024 by
vkuzo
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2024-12-16.