Skip to content

Actions: deepspeedai/DeepSpeed

hpu-gaudi2

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,348 workflow runs
1,348 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Precisely track nvme optimizer offload
hpu-gaudi2 #1522: Pull request #6963 opened by tjruwase
January 20, 2025 17:00 57m 4s olruwase/ds_4998
January 20, 2025 17:00 57m 4s
Using explicit GPU upcast for ZeRO-Offload
hpu-gaudi2 #1521: Pull request #6962 opened by xylian86
January 20, 2025 13:25 57m 18s xylian86:explicit_upcast
January 20, 2025 13:25 57m 18s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
hpu-gaudi2 #1519: Pull request #6553 synchronize by gyou2021
January 20, 2025 10:03 Action required gyou2021:configurable_autoTP
January 20, 2025 10:03 Action required
Autotp training
hpu-gaudi2 #1518: Pull request #6922 synchronize by inkcherry
January 20, 2025 09:24 59m 58s inkcherry:autotp_training
January 20, 2025 09:24 59m 58s
Autotp training
hpu-gaudi2 #1517: Pull request #6922 synchronize by inkcherry
January 20, 2025 07:50 52m 24s inkcherry:autotp_training
January 20, 2025 07:50 52m 24s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
hpu-gaudi2 #1516: Pull request #6553 synchronize by gyou2021
January 20, 2025 06:23 Action required gyou2021:configurable_autoTP
January 20, 2025 06:23 Action required
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
hpu-gaudi2 #1515: Pull request #6553 synchronize by gyou2021
January 20, 2025 05:25 Action required gyou2021:configurable_autoTP
January 20, 2025 05:25 Action required
hpu-gaudi2
hpu-gaudi2 #1514: Scheduled
January 20, 2025 00:11 2h 2m 11s master
January 20, 2025 00:11 2h 2m 11s
hpu-gaudi2
hpu-gaudi2 #1513: Scheduled
January 19, 2025 00:12 2h 4m 29s master
January 19, 2025 00:12 2h 4m 29s
hpu-gaudi2
hpu-gaudi2 #1512: Scheduled
January 18, 2025 00:10 2h 3m 21s master
January 18, 2025 00:10 2h 3m 21s
Update torch.norm to torch.linalg.norm and torch.linalg.vector_norm
hpu-gaudi2 #1511: Pull request #6931 synchronize by loadams
January 17, 2025 22:20 57m 53s loadams/fix-torch-issues
January 17, 2025 22:20 57m 53s
Update torch.norm to torch.linalg.norm and torch.linalg.vector_norm
hpu-gaudi2 #1510: Pull request #6931 synchronize by loadams
January 17, 2025 18:21 58m 41s loadams/fix-torch-issues
January 17, 2025 18:21 58m 41s
Update torch.norm to torch.linalg.norm and torch.linalg.vector_norm
hpu-gaudi2 #1509: Pull request #6931 synchronize by loadams
January 17, 2025 18:08 13m 12s loadams/fix-torch-issues
January 17, 2025 18:08 13m 12s
Enabled configurable auto Tensor Parallelism (TP) for the inference of diverse models
hpu-gaudi2 #1506: Pull request #6553 synchronize by gyou2021
January 17, 2025 10:26 Action required gyou2021:configurable_autoTP
January 17, 2025 10:26 Action required
hpu-gaudi2
hpu-gaudi2 #1505: Scheduled
January 17, 2025 00:11 2h 0m 18s master
January 17, 2025 00:11 2h 0m 18s
Update torch.norm to torch.linalg.norm and torch.linalg.vector_norm
hpu-gaudi2 #1504: Pull request #6931 synchronize by loadams
January 16, 2025 20:47 57m 39s loadams/fix-torch-issues
January 16, 2025 20:47 57m 39s
generalize deepspeed linear and implement it for non cuda systems
hpu-gaudi2 #1503: Pull request #6932 synchronize by oelayan7
January 16, 2025 15:42 59m 6s oelayan7:linear
January 16, 2025 15:42 59m 6s
Update torch.norm to torch.linalg.norm and torch.linalg.vector_norm
hpu-gaudi2 #1501: Pull request #6931 synchronize by loadams
January 16, 2025 00:40 3h 26m 40s loadams/fix-torch-issues
January 16, 2025 00:40 3h 26m 40s
generalize deepspeed linear and implement it for non cuda systems
hpu-gaudi2 #1500: Pull request #6932 synchronize by loadams
January 16, 2025 00:23 2h 30m 11s oelayan7:linear
January 16, 2025 00:23 2h 30m 11s
hpu-gaudi2
hpu-gaudi2 #1499: Scheduled
January 16, 2025 00:11 2h 0m 58s master
January 16, 2025 00:11 2h 0m 58s
warn to warning
hpu-gaudi2 #1498: Pull request #6952 opened by qgallouedec
January 15, 2025 18:32 55m 3s qgallouedec:warn_to_warning
January 15, 2025 18:32 55m 3s
Addressing ipg Buffer Data Race Condition in Zero Stage2
hpu-gaudi2 #1497: Pull request #3727 synchronize by loadams
January 15, 2025 17:09 Action required xxr3376:master
January 15, 2025 17:09 Action required
[inf] Add config var to enable keeping module on host
hpu-gaudi2 #1496: Pull request #6846 synchronize by loadams
January 15, 2025 17:06 1h 12m 27s oelayan7:keep_module_on_host
January 15, 2025 17:06 1h 12m 27s
generalize deepspeed linear and implement it for non cuda systems
hpu-gaudi2 #1495: Pull request #6932 synchronize by loadams
January 15, 2025 16:24 57m 32s oelayan7:linear
January 15, 2025 16:24 57m 32s