Anant Sahai

UC Berkeley Qualcomm Chair Professor of Electrical Engineering and Computer Sciences and (Part-time) Visiting Faculty Researcher at Google.

Explore Work Contact

LinkedIn Google Scholar Faculty Page

Research Themes

Machine Learning

Most of my work in this area explores the fundamental limits for machine learning, with a specific interest in overparameterized models and their generalization properties, as well as the fundamentals of in-context learning in LLMs. For this work, I tend to focus on creating simple caricature toy models that permit the exploration and understanding of phenomena of interest. I also have some work in online learning (for example in games) as well as practical applications in communications.

Wireless

My work in wireless is largely split between two areas: cognitive radio writ large (including spectrum sharing as well as intelligence in radio systems) and Ultra-reliable low-latency wireless communication systems such as those that might be relevant for industrial control systems.

Information Theory

Information theory is a rich area and my work has touched upon different aspects of it, including the intersections with control theory and machine learning. Early in my career, I had a particularly deep interest in the nature/role of delay within information theory.

Control

My work in control has largely focused on the intersections with information theory, with a particular interest in the Witsenhausen counterexample and the role of delay.

Recent Papers

View all

Synthetic Error Injection Fails to Elicit Self-Correction In Language Models

arXiv preprint: 2512.02389 • 2025

Authors

David Wu, Shreyas Kapur, Anant Sahai, Stuart Russell

Abstract

Reinforcement learning has become the dominant paradigm for eliciting reasoning and self-correction capabilities in large language models, but its computational expense motivates exploration of alternatives. Inspired by techniques from autonomous driving and robotics, we investigate whether supervised learning with synthetic error injection can induce self-correction abilities in language models. Our approach inserts artificial errors into reasoning chains, masks them, and supervises the model to recognize and correct these mistakes. Despite the intuitive appeal of this method, we find that it fails to significantly improve performance even on simple synthetic tasks across multiple models. Moreover, even when the model catches its own error, it often parrots the original mistake. We find that the distribution shift of synthetic errors to on-policy errors significantly degrades the error-correction capabilities of the fine-tuned model, even with good synthetic coverage of on-policy errors. Our results help explain why on-policy reinforcement learning methods have proven uniquely effective for eliciting self-correction.

Different simultaneous mechanisms for in-context recall have distinct learning dynamics

ICML 3rd Workshop on High-dimensional Learning Dynamics (HiLD) • 2025

Authors

Sultan Daniels, Dylan Davis, Dhruv Gautam, Wentinn Liao, Gireeja Ranade, Anant Sahai

Abstract

We introduce a new family of toy problems that combine features of linear-regression-style continuous in-context learning (ICL) with discrete associative recall. We pretrain transformer models on sample traces from this toy, specifically symbolically-labeled interleaved state observations from randomly drawn linear deterministic dynamical systems. We study if the transformer models can recall the state of a sequence previously seen in its context when prompted to do so with the corresponding in-context label. Taking a closer look at this task, it becomes clear that the model must perform two functions: (1) identify which system's state should be recalled and apply that system to its last seen state, and (2) continuing to apply the correct system to predict the subsequent states. Training dynamics reveal that the first capability emerges well into a model's training. Surprisingly, the second capability, of continuing the prediction of a resumed sequence, develops much earlier. Via out-of-distribution experiments, and a mechanistic analysis on model weights via edge pruning, we find that next-token prediction for this toy problem involves at least two separate mechanisms. One mechanism uses the discrete symbolic labels to do the associative recall required to predict the start of a resumption of a previously seen sequence. The second mechanism, which is largely agnostic to the discrete symbolic labels, performs a `Bayesian-style' prediction based on the previous token and the context. These two mechanisms have different learning dynamics. To confirm that this multi-mechanism (manifesting as separate phase transitions) phenomenon is not just an artifact of our toy setting, we used OLMo training checkpoints on an ICL translation task to see a similar phenomenon: a decisive gap in the emergence of first-task-token performance vs second-task-token performance.

Provable weak-to-strong generalization via benign overfitting

International Conference on Learning Representations (ICLR), Apr, 2025 • 2025

Authors

David Wu, Anant Sahai

Abstract

The classic teacher-student model in machine learning posits that a strong teacher supervises a weak student to improve the student's capabilities. We instead consider the inverted situation, where a weak teacher supervises a strong student with imperfect pseudolabels. This paradigm was recently brought forth by Burns et al.'23 and termed weak-to-strong generalization. We theoretically investigate weak-to-strong generalization for binary and multilabel classification in a stylized overparameterized spiked covariance model with Gaussian covariates where the weak teacher's pseudolabels are asymptotically like random guessing. Under these assumptions, we provably identify two asymptotic phases of the strong student's generalization after weak supervision: (1) successful generalization and (2) random guessing. Our techniques should eventually extend to weak-to-strong multiclass classification. Towards doing so, we prove a tight lower tail inequality for the maximum of correlated Gaussians, which may be of independent interest. Understanding the multilabel setting reinforces the value of using logits for weak supervision when they are available.

On the Impossibility of Convergence of Mixed Strategies with Optimal No-Regret Learning

Mathematics of Operations Research • 2024

Authors

Vidya Muthukumar, Soham Phade, Anant Sahai

Abstract

We study the limiting behavior of the mixed strategies that result from optimal no-regret learning in a repeated game setting where the stage game is any 2x2 competitive game. We consider optimal no-regret algorithms that are mean-based and monotonic in their argument. We show that for any such algorithm, the limiting mixed strategies of the players cannot converge almost surely to any Nash equilibrium. This negative result is also shown to hold under a broad relaxation of these assumptions, including popular variants of Follow-the-Regularized Leader with optimism or adaptive step sizes. Finally, we provide partial evidence that the monotonicity and mean-based assumptions can be removed or relaxed. Our results identify the inherent stochasticity in players's realizations as a critical factor underlying this divergence, and demonstrate a crucial difference in outcomes between using the opponent's mixtures and realizations to make updates.

From Foe to Friend: The Surprising Turn of Mega Constellations in Radio Astronomy

ACM Workshop on Hot Topics in Networks • 2024

Authors

Ali Abedi, Joshua Sanz, Mariya Zheleva, Anant Sahai

Abstract

Cheap spaceflight has ushered in an explosive growth era for Low Earth Orbit (LEO) satellites. While this has brought us LEO satellite megaconstellations for ubiquitious highspeed data, it has also enabled a proliferation of nanosatellites (e.g. CubeSats) launched by diverse organizations. An unfortunate side-effect is harmful interference to sensitive receivers like those of radio astronomy --- no place on Earth is safe. How can we enjoy the fruits of the satellite revolution without blinding ourselves to the secrets of the universe? Networking is the key. This paper proposes InOrbitNet, which aggregates and backhauls traffic from low-capability nanosatellites using highly-capable LEO megaconstellations. By simulating LEO and nanosatellite orbit transitions, we show that orders-of-magnitude reductions in latency and significant increases in capacity are possible as compared to the current non-networked direct-to-ground approach. But more importantly, because LEO megaconstellations are highly capable and tightly managed, this consolidation of RF footprints also allows radio astronomy to be protected from interference.

Recent Teaching

View all

2026 Spring

194/294/290S: Special Topics Course. Topics vary every term. See comments per semester.

Taught with Prof. Jiao

Major new special course connecting theory to AI Systems and modern tooling like Nemo, vLLM, etc. on multi-node multi-GPU machines. We secured a very generous compute donation for this course so that students can explore at scale.

2025 Fall

182/282A: Deep Learning

Taught with Prof. Ranade

Major upgrades: covered Muon, muP, and a unified optimization perspective. Deeper state-space modeling and hybrid attention. Fuller RLVR treatment leading to improved RLHF treatment including DPO.

2025 Spring

182/282A: Deep Learning

Significant upgrades: added modern state-space models; and unified DDIM/DDPM treatment in diffusion.