-
Notifications
You must be signed in to change notification settings - Fork 411
Pull requests: pytorch/torchtitan
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[WIP] Document MX FP8 recipe
CLA Signed
This label is managed by the Meta Open Source bot.
#1350
opened Jun 27, 2025 by
lessw2020
Loading…
Autoparallel support for DP-only, DP+TP, or TP-only
CLA Signed
This label is managed by the Meta Open Source bot.
#1349
opened Jun 27, 2025 by
wconstab
Loading…
[WIP] Enable causal block mask for sdpa
CLA Signed
This label is managed by the Meta Open Source bot.
[WIP][RFC] Always flatten model state_dict
CLA Signed
This label is managed by the Meta Open Source bot.
#1347
opened Jun 26, 2025 by
fegin
Loading…
unit test for flux_dataset dataloader checkpointing
CLA Signed
This label is managed by the Meta Open Source bot.
#1346
opened Jun 26, 2025 by
wesleytruong
Loading…
[SimpleFSDP] Add support for hsdp+tp
CLA Signed
This label is managed by the Meta Open Source bot.
#1343
opened Jun 26, 2025 by
ruisizhang123
Loading…
Only calls destroy_process_group if the trainer exist successfully
CLA Signed
This label is managed by the Meta Open Source bot.
#1342
opened Jun 26, 2025 by
fegin
Loading…
[DSV3] Apply TP on DSV3
CLA Signed
This label is managed by the Meta Open Source bot.
#1341
opened Jun 26, 2025 by
wwwjn
Loading…
[WIP] Refactor Tokenizer -> BaseTokenizer
CLA Signed
This label is managed by the Meta Open Source bot.
[kernels][blackwell] add cutlass/cute group gemm forward for blackwell
CLA Signed
This label is managed by the Meta Open Source bot.
#1327
opened Jun 22, 2025 by
lessw2020
Loading…
Support finetuning from a pretrained model
CLA Signed
This label is managed by the Meta Open Source bot.
#1321
opened Jun 20, 2025 by
vwxyzjn
Loading…
[float8] add _auto_filter_for_recipe for float8 training
CLA Signed
This label is managed by the Meta Open Source bot.
#1319
opened Jun 18, 2025 by
danielvegamyhre
Loading…
Support different tokenizers
CLA Signed
This label is managed by the Meta Open Source bot.
#1318
opened Jun 18, 2025 by
H-Huang
Loading…
[not for land] testing out float8 128_1_128_128 blockwise scaling
CLA Signed
This label is managed by the Meta Open Source bot.
#1317
opened Jun 18, 2025 by
vkuzo
Loading…
Do not submit: Multinode training seems to be working
CLA Signed
This label is managed by the Meta Open Source bot.
#1314
opened Jun 17, 2025 by
ahmadsharif1
•
Draft
Do not submit: Multinode is working with multiple controllers
CLA Signed
This label is managed by the Meta Open Source bot.
#1313
opened Jun 17, 2025 by
ahmadsharif1
•
Draft
[llama4][auxiliary-loss-free load balancing] update expert_bias without backward hooks
CLA Signed
This label is managed by the Meta Open Source bot.
#1304
opened Jun 16, 2025 by
hann-wang
Loading…
Finetune from pre-trained models
CLA Signed
This label is managed by the Meta Open Source bot.
#1300
opened Jun 15, 2025 by
vwxyzjn
Loading…
[not for land] Use new AC
CLA Signed
This label is managed by the Meta Open Source bot.
#1294
opened Jun 13, 2025 by
soulitzer
Loading…
WIP: Try to use monarch to run torchtitan.
CLA Signed
This label is managed by the Meta Open Source bot.
#1288
opened Jun 12, 2025 by
ahmadsharif1
•
Draft
Titan changes to use DCP ZOC instead of titan default Async + Pinned Memory
CLA Signed
This label is managed by the Meta Open Source bot.
#1287
opened Jun 12, 2025 by
Saiteja64
Loading…
DO NOT SUBMIT: WIP: Try to use monarch to run torchtitan.
CLA Signed
This label is managed by the Meta Open Source bot.
#1286
opened Jun 12, 2025 by
ahmadsharif1
•
Draft
[deepseek][kernels][blackwell] Cutlass blackwell grouped gemm using cute dsl (forward,backward)
CLA Signed
This label is managed by the Meta Open Source bot.
#1276
opened Jun 8, 2025 by
lessw2020
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.