kernels-community
/

metal-flash-sdpa

Model card Files Files and versions

Resources

View closed (1)

[WIP] Add sliding-window attention support to the varlen kernel

#5 opened about 1 month ago by

Add flash_attn_with_kvcache paged-decode kernel (port from huggingface/transformers#45977)

#4 opened 2 months ago by

Add flash_attn_func + harden MPS dispatch for transformers compatibility

#3 opened 2 months ago by

Test code with transformers

#1 opened 9 months ago by