Skip to content

Releases: pytorch/test-infra

v20240724-192912

24 Jul 19:30
06e8bd7
Compare
Choose a tag to compare
Fix bug with emitting logs from runners (#5503)

v20240724-185114

24 Jul 18:52
5e075d8
Compare
Choose a tag to compare
On ScaleDown remove old/stale SSM parameters (#5479)

v20240724-160740

24 Jul 16:09
0087fbd
Compare
Choose a tag to compare
Small improvements designed to increase the resiliency of ephemeral i…

v20240719-185908

19 Jul 19:00
a979143
Compare
Choose a tag to compare
Emit Runner info at the start of every job on self hosted linux runne…

v20240718-180515

18 Jul 18:06
576f39c
Compare
Choose a tag to compare
Remove workaround to install kernel-devel (#5480)

This workaround might not be necessary any longer as kernel-devel
appears to be available again in package managers. This also fixes the
LF ALI runners as they are failing to fetch from the s3://ossci-linux
bucket due to no permissions.

I verified by doing a `yum install kernel-devel` on one of the EC2
instances that was stuck and saw that it returned a newer kernel-devel
package.

Relates to pytorch/pytorch#129880 and pytorch/ci-infra#244.

Signed-off-by: Thanh Ha <[email protected]>

v20240715-170934

15 Jul 17:10
3609c4c
Compare
Choose a tag to compare
Adds metada options on launch instance templates to set idsv1 optiona…

v20240712-180343

12 Jul 18:05
920f85f
Compare
Choose a tag to compare
Move user-data and daemon monitoring to a separate CW namespace (#5424)

v20240712-140548

12 Jul 14:07
e523942
Compare
Choose a tag to compare
Improves visibility for AMI experiments (#5421)

v20240710-170951

10 Jul 17:11
4e4c73e
Compare
Choose a tag to compare
Adds support to experiment with AMI runner-by-runner (#5406)

v20240704-191103

04 Jul 19:12
e8625c3
Compare
Choose a tag to compare
Userdata send cloudwatch metrics to setup alerts on failures (#5399)