Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Release/2.5.0] Timm model jx_nest_base amp_fp16 inference got fail_accuracy #900

Open
mengfei25 opened this issue Sep 12, 2024 · 1 comment
Assignees
Milestone

Comments

@mengfei25
Copy link
Contributor

🐛 Describe the bug

xpu eval jx_nest_base
[WARNING] Failed to create Level Zero tracer: 2013265921
(I): Detected 1024 spills, recompiling the kernel using large GRF mode
(I): Kernel has now 0 spills
(I): Detected 8192 spills, recompiling the kernel using large GRF mode
(I): Kernel has now 0 spills
(I): Detected 8192 spills, recompiling the kernel using large GRF mode
(I): Kernel has now 0 spills
(I): Detected 4096 spills, recompiling the kernel using large GRF mode
(I): Kernel has now 0 spills
(I): Detected 4096 spills, recompiling the kernel using large GRF mode
(I): Kernel has now 0 spills
E0912 00:16:10.029000 3264502 site-packages/torch/_dynamo/utils.py:1802] RMSE (res-fp64): 0.00087, (ref-fp64): 0.00036 and shape=torch.Size([8, 1000]). res.dtype: torch.float16, multiplier: 2.000000, tol: 0.001000, use_larger_multiplier_for_smaller_tensor: 0
fail_accuracy

Versions

pytorch: 2.5.0-rc1 (https://download.pytorch.org/whl/test/xpu)
torch-xpu-ops: 1206590 (main)

@jianyizh
Copy link

jianyizh commented Sep 24, 2024

@mengfei25 I can pass this test locally on pvc 1550.
xpu eval jx_nest_base
(I): Detected 5504 spills, recompiling the kernel using large GRF mode
(I): Kernel has now 0 spills
(I): Detected 5504 spills, recompiling the kernel using large GRF mode
(I): Kernel has now 0 spills
(I): Detected 2304 spills, recompiling the kernel using large GRF mode
(I): Kernel has now 0 spills
(I): Detected 1792 spills, recompiling the kernel using large GRF mode
(I): Kernel has now 0 spills
Compiled module path: /home/sdp/jianyi/pytorch/inductor_log/timm_models/jx_nest_base/inference/amp_fp16/22/tmpdnoznzfe/ew/cewlskfnrewrjvkri3yjlsyn2wgwtgmzpz3s2ihdmqao5ack4a3p.py
W0926 19:40:25.206000 1566446 torch/_inductor/debug.py:434] [0/0] jx_nest_base__0_inference_0 debug trace: /home/sdp/jianyi/pytorch/inductor_log/timm_models/jx_nest_base/inference/amp_fp16/22/torch_compile_debug/torch_compile_debug/run_2024_09_26_19_39_25_399754-pid_1566446/torchinductor/jx_nest_base__0_inference_0.0
pass

pytorch-triton-xpu 3.0.0+cc981feba1
torch 2.5.0a0+git4a3dabd /home/sdp/jianyi/pytorch

@jianyizh jianyizh assigned jianyizh and unassigned retonym Sep 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants