Performance issue regarding Sagemaker Estimator.deploy() #2356

UniSabina · 2019-08-28T09:33:58Z

UniSabina
Aug 28, 2019

System Information

Python Version: 3.6
CPU or GPU: CPU
Python SDK Version: Anaconda

Describe the problem

I'm quite new to the SageMaker algorithms and estimators so please bear with me.
I'm running a script very similar to this example script for DeepAR
https://github.com/awslabs/amazon-sagemaker-examples/blob/master/introduction_to_amazon_algorithms/deepar_electricity/DeepAR-Electricity.ipynb
And want to start more than hundred of such training + prediction jobs.
The cell

predictor = estimator.deploy(
    initial_instance_count=1,
    instance_type='ml.m4.xlarge',
    predictor_cls=DeepARPredictor)

takes up 70% (~8.5min) of the time of the overall training and predicting job (~12min). Is there a possibility to reduce that time? What is the reason for this deploy job taking so long?

Thanks!

ChoiByungWook · 2019-08-30T21:16:41Z

ChoiByungWook
Aug 30, 2019

Hello @UniSabina,

Apologies on the slow experience.

I'll reach out to the team that handles the hosting platform and let them comment.

0 replies

vedal · 2019-12-16T11:56:48Z

vedal
Dec 16, 2019

I'm also interested in this

0 replies

lexxx233 · 2020-09-09T22:44:52Z

lexxx233
Sep 9, 2020

Any update on this?

0 replies

rishabh1212 · 2020-11-12T15:25:27Z

rishabh1212
Nov 12, 2020

Hii @ChoiByungWook,
Any update on this?

0 replies

nrapanos · 2021-01-05T15:15:01Z

nrapanos
Jan 5, 2021

Is there any update on this? I'm experiencing a similar issue.

0 replies

dsow86 · 2021-02-23T09:52:24Z

dsow86
Feb 23, 2021

Any update about this issue? I am still having the same problem, the model deployment take too long.

0 replies

zlufy90 · 2021-04-22T07:22:01Z

zlufy90
Apr 22, 2021

It is too slow for training and deploying model via SageMaker Studio. I just tested with Iris dataset.
Especially when debugging. It makes us give up easily.

0 replies

yitzchak-ben-ezra-ecoplant · 2021-06-30T12:17:25Z

yitzchak-ben-ezra-ecoplant
Jun 30, 2021

really slow

0 replies

avi-shapiro · 2022-05-05T06:44:30Z

avi-shapiro
May 5, 2022

Still no update? I can build and push a custom PyTorch training image, and then train a model in a shorter time than deploying.

0 replies

hinthornw · 2022-06-28T03:34:31Z

hinthornw
Jun 28, 2022

Takes over 90 minutes for me.

0 replies

Soroush-aali-bagi · 2023-10-27T21:25:23Z

Soroush-aali-bagi
Oct 27, 2023

How much should I wait before killing the process?!

0 replies

justintimewatkins · 2024-03-15T23:46:03Z

justintimewatkins
Mar 15, 2024

so slow still

0 replies

ArthurDeleu · 2024-05-01T08:09:10Z

ArthurDeleu
May 1, 2024

terribly slow... tring to deploy an ml.m4.xlarge, since the max health_check_timeout is topped at 3600 and my deployment currently takes over an hour, I am not able to deploy llama3 70B. What to do?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance issue regarding Sagemaker Estimator.deploy() #2356

{{title}}

Replies: 13 comments

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply