Mastodon Share
Sharing on Mastodon:

FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning – PaperGrep

https://papergrep.dev/paper/flashattention-2-faster-attention-with-better-1253e9

HomeAbout