dxd-log
🤖 AI/ML

papers | Grokking of Hierarchical Structure in Vanilla Transformers