Home
Tags
optimization
Tag
Cancel
optimization
1
Lost in Backpropagation: The LM Head is a Gradient Bottleneck
Mar 10, 2026
Trending Tags
thesis
anisotropy
softmax
data-curation
language-models
pretraining
attention
award
biomedical
compression