-
Abstract: In this talk, I will explore key research contributions in efficient deep learning, with a focus on training smaller yet highly capable language models. I will discuss approaches such as curating high-quality datasets and designing effective training curricula. The talk will cover different stages of training, including pre-training, mid-training, and agentic reasoning and highlight…