-
Notifications
You must be signed in to change notification settings - Fork 183
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[FA2] fa2/hgemm manually smem swizzle🎉 (#185)
* Update flash_attn_mma.py * Create makefile * Create README.md * Update and rename matrix_trans_swizzle.cu to mat_trans_swizzle.cu * Update hgemm_mma_swizzle.cu * Update mat_trans_swizzle.cu * Update and rename flash_attn_mma_swizzle_qkv.cu to flash_attn_mma_share_kv_swizzle.cu * Create flash_attn_mma_share_qkv_swizzle.cu * Create flash_attn_mma_split_q_swizzle.cu * Create flash_attn_mma_split_kv_swizzle.cu * Create flash_attn_mma_tiling_qk_swizzle.cu * Create flash_attn_mma_tiling_qkv_swizzle.cu * Update flash_attn_mma_share_qkv_swizzle.cu * Update flash_attn_mma_split_kv_swizzle.cu * Update flash_attn_mma_split_q_swizzle.cu * Update flash_attn_mma_tiling_qk_swizzle.cu * Update flash_attn_mma_tiling_qkv_swizzle.cu * Update README.md * Update hgemm_mma_swizzle.cu * Update makefile * Update README.md * Update README.md * Update mat_trans_swizzle.cu * Update makefile * Update hgemm_mma_swizzle.cu * Update hgemm_mma_swizzle.cu * Update README.md * Update hgemm_mma_stage.cu * Update hgemm_mma.cu * Update makefile * Update utils.h * Create mma_simple_swizzle.cu * Update makefile * Update mma_simple_swizzle.cu * Update hgemm_mma_swizzle.cu * Update makefile * Update utils.py * Update makefile * Create hgemm_mma_stage_swizzle.cu * Update hgemm.py * Update hgemm.cc * Update mat_trans_swizzle.cu * Update flash_attn_mma_tiling_qk_swizzle.cu * Update flash_attn.cc * Update flash_attn_mma.py * Update flash_attn_mma.py * Update flash_attn_mma.py * Update flash_attn_mma_tiling_qk_swizzle.cu * Update flash_attn_mma_tiling_qk_swizzle.cu * Update flash_attn_mma_share_kv_swizzle.cu * Update README.md * Update README.md * Create print_swizzle_layout.py * Update flash_attn_mma_tiling_qk_swizzle.cu * Update flash_attn_mma_share_kv_swizzle.cu * Update README.md * Update hgemm_mma_stage_swizzle.cu * Update README.md * Update README.md * Update README.md * Update mma_simple_swizzle.cu * Create print_swizzle_layout.py * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md
- Loading branch information
Showing
27 changed files
with
3,940 additions
and
179 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.