•
A deep dive into hyperball geometry and pretraining optimization strategies.
1 min read · January 02, 2025
2025 · weight decay hyperball optimization deep learning · research