This talk by Cameron Buckner in a seminar organized by @UlrikeHahn has the most insightful way of explaining attention in the transformer architecture. A really cool talk as well, sad that the discussion is not on YouTube 馃ゲ
This talk by Cameron Buckner in a seminar organized by @UlrikeHahn has the most insightful way of explaining attention in the transformer architecture. A really cool talk as well, sad that the discussion is not on YouTube 馃ゲ
A space for Bonfire maintainers and contributors to communicate