Mastodon Share
Sharing on Mastodon:

Fast Transformer Decoding: One Write-Head is All You Need – PaperGrep

https://papergrep.dev/paper/fast-transformer-decoding-one-write-head-is-all-8cd251

HomeAbout