Multi-queries Paged attn fails with continuator halted #8459

vanbasten23 · 2024-12-05T22:41:49Z

root@t1v-n-408567d9-w-0:/workspaces/persist#  python pytorch/xla/test/benchmarks/test_paged_attention_benchmark.py --kernel multi-queries-paged-attn-v1 --profile
WARNING:root:libtpu.so and TPU device found. Setting PJRT_DEVICE=TPU.
Warming up...
Traceback (most recent call last):
  File "/workspaces/persist/pytorch/xla/test/benchmarks/test_paged_attention_benchmark.py", line 258, in <module>
    benchmark(args)
  File "/workspaces/persist/pytorch/xla/test/benchmarks/test_paged_attention_benchmark.py", line 239, in benchmark
    run_benchmark(num_iters=10, profile=False)
  File "/workspaces/persist/pytorch/xla/test/benchmarks/test_paged_attention_benchmark.py", line 230, in run_benchmark
    jax.block_until_ready(actual_output)
  File "/usr/local/lib/python3.10/site-packages/jax/_src/api.py", line 2763, in block_until_ready
    try_to_block(arrays[0])
  File "/usr/local/lib/python3.10/site-packages/jax/_src/api.py", line 2746, in try_to_block
    return x.block_until_ready()
jaxlib.xla_extension.XlaRuntimeError: FAILED_PRECONDITION: The program continuator has halted unexpectedly.

… benchmark is not working very well.

… CUDA benchmark in vLLM uses.

…es_per_compute_block==0 and got the error The program continuator has halted unexpectedly

vanbasten23 added 10 commits December 4, 2024 19:07

add the benchmarking script

d900806

add the naive non-kernel benchmark and torch_xla bnechmark. torch_xla…

6ce1db6

… benchmark is not working very well.

uncomment xm.mark_step()

7c9036a

jit the nonkernel impl

0866014

remove the jit

d879d43

use dtype=bfloat16 and change the the input shpae to be close to what…

8255386

… CUDA benchmark in vLLM uses.

Commented out the torch_xla stuff

bae7520

restore the kernel to the original version

e726f0d

add v1.

f6de41c

remove the constraint that constraint pages_per_sequence % num_kv_pag…

75f7663

…es_per_compute_block==0 and got the error The program continuator has halted unexpectedly

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-queries Paged attn fails with continuator halted #8459

Multi-queries Paged attn fails with continuator halted #8459

vanbasten23 commented Dec 5, 2024

Multi-queries Paged attn fails with continuator halted #8459

Are you sure you want to change the base?

Multi-queries Paged attn fails with continuator halted #8459

Conversation

vanbasten23 commented Dec 5, 2024