1
1Pinned
Understanding transformer models for NLP tasks
I've been studying transformer architecture and while I understand the basics of attention mechanisms, I'm struggling with some concepts. Can someone explain: 1. Why positional encoding is necessary?...