New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Tutorial for AOTI Python runtime #2997

Merged

svekars merged 26 commits into pytorch:main from agunapal:tutorial/aoti_python

Aug 23, 2024

Contributor

agunapal commented Aug 12, 2024 •

edited

Loading

Description

We have an AOT Inductor tutorial for showing inference on C++ runtime here

This tutorial shows how to run AOTI on Python runtime

Shows support for dynamic_shapes for batch dimension
Shows how to include torch.compile option like max-autotune mode
Reverts the docker image back to using devel image ( needed for AOT compile)

Checklist

The issue that is being fixed is referred in the description (see above "Fixes #ISSUE_NUMBER")
Only one issue is addressed in this pull request
Labels from the issue that this PR is fixing are added to this pull request
No unnecessary issues are included into this pull request.


          Tutorial for AOTI Python runtime

1dea278

pytorch-bot bot commented Aug 12, 2024 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/tutorials/2997

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 194388e with merge base 96b9c27 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot added the cla signed label

agunapal requested review from desertfire and angelayi

August 12, 2024 23:40

Contributor Author

agunapal commented Aug 13, 2024

Hi @svekars Do I ignore this https://github.com/pytorch/tutorials/actions/runs/10360994812/job/28680461319?pr=2997 or do I need to add some checks in the tutorial?

The failure is because the machine doesn't support triton

HamidShojanazeri reviewed

View reviewed changes

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

Contributor

svekars commented Aug 13, 2024

@agunapal you need to put it on a different worker similar to this

svekars reviewed

View reviewed changes

Contributor

svekars left a comment

Just a few editorial nits. Also, it feels a bit short for a full size intermediate tutorial. We should either add more or move to recipes. Also, we need to add entries either to index.rst or recipes_sourece/recipes_index.rst (depending on whether it's recipe or tutorial)

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

intermediate_source/torch_export_aoti_python.py Outdated

+              #
+              #  .. note::
+              #
+              #       This API also supports :func:`torch.compile` options like `mode`

Contributor

svekars Aug 13, 2024

Suggested change

      
            #       This API also supports :func:`torch.compile` options like `mode`
          
            #       This API also supports :func:`torch.compile` options like ``mode`` and other.

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

agunapal and others added 3 commits

August 13, 2024 10:40


          Apply suggestions from code review

cd09129

Co-authored-by: Svetlana Karslioglu <[email protected]>


          Addressed review comments and added a section on why AOTI Python

3fa9b20


          Addressed review comments and added a section on why AOTI Python

7c9edb7

Contributor Author

agunapal commented Aug 13, 2024

Just a few editorial nits. Also, it feels a bit short for a full size intermediate tutorial. We should either add more or move to recipes. Also, we need to add entries either to index.rst or recipes_sourece/recipes_index.rst (depending on whether it's recipe or tutorial)

Sure, once the content is finalized and looks good, we can move it where you think its appropriate

agunapal added 2 commits

August 13, 2024 21:37


          fixed spelling

9cba6fb


          fixed spelling

a6f6cd9

desertfire reviewed

View reviewed changes

intermediate_source/torch_export_aoti_python.py Outdated

+              # a shared library that can be run in a non-Python environment.
+              #
+              #
+              # In this tutorial, you will learn an end-to-end example of how to use AOTInductor for python runtime.

Contributor

desertfire Aug 14, 2024

It will make the story more complete by explaining the "why" part here, e.g. eliminating recompilation at run time, max-autotune ahead of time, etc.

Contributor Author

agunapal Aug 16, 2024

done. Haven't mentioned eliminating recompilation, since the tutorial doesn't show that

intermediate_source/torch_export_aoti_python.py Outdated

+                  example_inputs = (torch.randn(2, 3, 224, 224, device=device),)
+                  # min=2 is not a bug and is explained in the 0/1 Specialization Problem
+                  batch_dim = torch.export.Dim("batch", min=2, max=32)

Contributor

desertfire Aug 14, 2024

I believe it is ok to use min=1 here, but we can't feed in an example input with batch size 1.

Contributor Author

agunapal Aug 16, 2024 •

edited

Loading

An example with batch_size 1 is usually tried often, hence I set min=2

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

angelayi reviewed

View reviewed changes

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

svekars reviewed

View reviewed changes

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

HamidShojanazeri suggested changes

View reviewed changes

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

agunapal and others added 2 commits

August 16, 2024 14:24


          Apply suggestions from code review

Co-authored-by: Angela Yi <[email protected]>
Co-authored-by: Svetlana Karslioglu <[email protected]>


          Addressed review comment

agunapal requested review from desertfire and angelayi

August 16, 2024 23:04

angelayi reviewed

View reviewed changes

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

intermediate_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

agunapal and others added 6 commits

August 19, 2024 17:33


          Changing to use g5.4xlarge machine

53f5965


          Merge branch 'main' into tutorial/aoti_python

849c8e3


          Moved tutorial to recipe

4aa8399


          Merge branch 'tutorial/aoti_python' of https://github.com/agunapal/tu…

39b3942

…torials into tutorial/aoti_python


          addressed review comments

35c5dc8


          Moved tutorial to recipe

71acd96

agunapal and others added 7 commits

August 20, 2024 00:00


          Change base image to nvidia devel image

7f5fde9


          Change base image to nvidia devel image

790f762


          Update requirements

45df5d0


          fixed formatting

b268a3c


          Merge branch 'main' into tutorial/aoti_python

b6c3a01


          update to CUDA 12.4

6578d82


          Merge branch 'tutorial/aoti_python' of https://github.com/agunapal/tu…

9ee64d9

…torials into tutorial/aoti_python

agunapal commented

View reviewed changes

recipes_source/torch_export_aoti_python.py Show resolved Hide resolved

svekars reviewed

View reviewed changes

Contributor

svekars left a comment

Please double-check the formatting here. Also need to add to the recipe_index.rst. But otherwise, from the publishing perspective LGTM.

recipes_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

recipes_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

recipes_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved

agunapal and others added 2 commits

August 21, 2024 13:23


          Apply suggestions from code review

67bc080

Co-authored-by: Svetlana Karslioglu <[email protected]>


          addressed review comments for formatting

fc0ff5e

svekars reviewed

View reviewed changes

recipes_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved


          Update recipes_source/torch_export_aoti_python.py

85f2870

svekars reviewed

View reviewed changes

recipes_source/torch_export_aoti_python.py Outdated Show resolved Hide resolved


          Update recipes_source/torch_export_aoti_python.py

cb8ea23

Contributor Author

agunapal commented Aug 22, 2024

Please double-check the formatting here. Also need to add to the recipe_index.rst. But otherwise, from the publishing perspective LGTM.

@svekars I fixed the indentation of Pre-requisites. Its still not rendering correctly. Any suggestion?

desertfire reviewed

View reviewed changes

recipes_source/torch_export_aoti_python.py

+              ######################################################################
+              # We see that there is a drastic speedup in first inference time using AOTInductor compared
+              # to ``torch.compile``

Contributor

desertfire Aug 22, 2024

Do you have some example numbers to share here? So readers can get some rough idea without actually running the code.

Contributor Author

agunapal Aug 22, 2024

On the rendered html , the tutorial shows 2.92 ms vs 7000 ms. It might be good to collect this number over a range of models similar to how we show perf difference with compile vs eager.

desertfire approved these changes

View reviewed changes

svekars approved these changes

View reviewed changes


          Merge branch 'main' into tutorial/aoti_python

194388e

svekars merged commit ea2dfc6 into pytorch:main

18 checks passed

svekars added a commit that referenced this pull request


          Tutorial for AOTI Python runtime (#2997)

c5ca867

* Tutorial for AOTI Python runtime
---------

Co-authored-by: Svetlana Karslioglu <[email protected]>
Co-authored-by: Angela Yi <[email protected]>

c-p-i-o pushed a commit that referenced this pull request


          Tutorial for AOTI Python runtime (#2997)

bb9949a

* Tutorial for AOTI Python runtime
---------

Co-authored-by: Svetlana Karslioglu <[email protected]>
Co-authored-by: Angela Yi <[email protected]>

c-p-i-o pushed a commit that referenced this pull request


          Tutorial for AOTI Python runtime (#2997)

ab36383

* Tutorial for AOTI Python runtime
---------

Co-authored-by: Svetlana Karslioglu <[email protected]>
Co-authored-by: Angela Yi <[email protected]>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels