-
Notifications
You must be signed in to change notification settings - Fork 242
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Make CheckpointManager friendlier to custom StorageWriter/StorageReader
CLA Signed
This label is managed by the Meta Open Source bot.
#789
opened Jan 12, 2025 by
dimdi-y
Loading…
Register backward hook for the whole optim_dict to enable working at multi schedule pp
CLA Signed
This label is managed by the Meta Open Source bot.
Do not aggregate the losses since last log step
CLA Signed
This label is managed by the Meta Open Source bot.
#779
opened Jan 7, 2025 by
carmocca
Loading…
[Not for land] Integrate float8nocompile, an experimental feature for high performance
CLA Signed
This label is managed by the Meta Open Source bot.
#778
opened Jan 7, 2025 by
danielvegamyhre
Loading…
[PoC] Typed JobConfig
CLA Signed
This label is managed by the Meta Open Source bot.
#767
opened Jan 1, 2025 by
jaysonfrancis
Loading…
[MoE][PoC] Expert Parallel: tp and tp2ep
CLA Signed
This label is managed by the Meta Open Source bot.
[Not for land] Show replicated fp32 norm weights
CLA Signed
This label is managed by the Meta Open Source bot.
First draft Auto-SAC workflow
CLA Signed
This label is managed by the Meta Open Source bot.
#710
opened Dec 2, 2024 by
sanketpurandare
•
Draft
[WIP] Allow benchmark between multiple configs
CLA Signed
This label is managed by the Meta Open Source bot.
#703
opened Nov 26, 2024 by
H-Huang
Loading…
[WIP] Adding OBELICS DataLoader
CLA Signed
This label is managed by the Meta Open Source bot.
#663
opened Oct 30, 2024 by
TJ-Solergibert
Loading…
[not for land] torch.compile individual linears
CLA Signed
This label is managed by the Meta Open Source bot.
#661
opened Oct 29, 2024 by
vkuzo
Loading…
Init weights only if not loading a checkpoint
CLA Signed
This label is managed by the Meta Open Source bot.
[DO NOT REVIEW] gaps to enable FDSP2 cpu offloading
CLA Signed
This label is managed by the Meta Open Source bot.
#622
opened Oct 16, 2024 by
weifengpy
Loading…
[Not for land] Settings to make Llama3-8B on 8 GPUs faster
CLA Signed
This label is managed by the Meta Open Source bot.
[not for land] TE experiments, take 2
CLA Signed
This label is managed by the Meta Open Source bot.
#614
opened Oct 14, 2024 by
vkuzo
Loading…
[DO NOT REVIEW] --experimental.fsdp_sharding_on_largest_dim
CLA Signed
This label is managed by the Meta Open Source bot.
#607
opened Oct 9, 2024 by
weifengpy
Loading…
fix mixed precision for This label is managed by the Meta Open Source bot.
replicate
/ pure DDP
CLA Signed
#591
opened Sep 29, 2024 by
152334H
Loading…
[not for land yet] hack max and abs out of ops eligible for AC
CLA Signed
This label is managed by the Meta Open Source bot.
#580
opened Sep 17, 2024 by
vkuzo
Loading…
add pp validation for schedule
CLA Signed
This label is managed by the Meta Open Source bot.
#568
opened Sep 5, 2024 by
H-Huang
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.